Function reference
Core verbs - one table
arrange | Sort rows based on one or more columns. |
count | Count observations by group. |
distinct | Count observations by group. |
filter | Keep rows that match condition. |
head | Keep the first n rows of data. |
mutate, transmute | Create or replace columns. |
rename | Rename columns. |
select | Keep, drop, or rename specific columns. |
summarize | Calculate a single number per grouping. |
group_by, ungroup | Specify groups for splitting rows of data. |
Core verbs - two table
inner_join, left_join, right_join, full_join | Mutating joins |
semi_join, anti_join | Filtering joins |
Query verbs
collect | Retrieve data into a DataFrame. |
show_query | Print the query being generated. |
Tidy verbs
complete | Add rows for missing combinations in the data. |
extract | Add new columns by matching a pattern on a column of strings. |
gather, spread | Gather columns in to long format. Spread out to wide format. |
pivot_longer, pivot_wider | Change rows of data to columns, or columns to rows. More comprehensive than spread and gather. |
separate, unite | Add new columns by splitting a character column. |
nest, unnest | Create a column where each entry is a DataFrame. |
Column Operations
Forcats
fct_collapse | Rename categories. Optionally group all others. |
fct_infreq | Order categories by frequency (largest first) |
fct_inorder | Order categories by when they first appear. |
fct_lump | Lump infrequently observed categories together. |
fct_recode | Rename categories. |
fct_reorder | Reordered categories, using a calculation over another column. |
fct_rev | Reverse category levels. |
Datetime
floor_date, ceil_date | Round datetimes down or up to a specific granularity (e.g. week). |
Vector
between() | Check whether values are in a specified range. |
case_when(), if_else() | Generalized if statements. |
coalesce() | Use first non-missing element across columns. |
cumall(), cumany(), cummean() | Cumulative all, any, and mean. |
lag(), lead() | Shift values later (lag) or earlier (lead) in time. |
n() | Calculate the number of observations in a vector. |
n_distinct() | Count the number of unique values. |
na_if() | Convert a value to NA. |
near() | Check whether every pair of values in two vectors are close. |
nth(), first(), last() | Return the first, last, or nth value. |
row_number(), ntile(), min_rank(), dense_rank(), percent_rank(), cume_dist() | Windowed rank functions. |