join.tbl_dt {dplyr} | R Documentation |
See join
for a description of the general purpose of the
functions.
## S3 method for class 'data.table' inner_join(x, y, by = NULL, copy = FALSE, ...) ## S3 method for class 'data.table' left_join(x, y, by = NULL, copy = FALSE, ...) ## S3 method for class 'data.table' semi_join(x, y, by = NULL, copy = FALSE, ...) ## S3 method for class 'data.table' anti_join(x, y, by = NULL, copy = FALSE, ...)
x,y |
tbls to join |
by |
a character vector of variables to join by. If To join by different variables on x and y use a named vector.
For example, |
copy |
If |
... |
Included for compatibility with generic; otherwise ignored. |
if (require("data.table") && require("Lahman")) { batting_dt <- tbl_dt(Batting) person_dt <- tbl_dt(Master) # Inner join: match batting and person data inner_join(batting_dt, person_dt) # Left join: keep batting data even if person missing left_join(batting_dt, person_dt) # Semi-join: find batting data for top 4 teams, 2010:2012 grid <- expand.grid( teamID = c("WAS", "ATL", "PHI", "NYA"), yearID = 2010:2012) top4 <- semi_join(batting_dt, grid, copy = TRUE) # Anti-join: find batting data with out player data anti_join(batting_dt, person_dt) }