Release dbplyr 2.4.0 · tidyverse/dbplyr

Breaking changes

Using compute(temporary = FALSE) without providing a name is now
deprecated (@mgirlich, #1154).
ntile()'s first argument has been renamed from order_by to x to
match the interface of dplyr::ntile() (@mgirlich, #1242).
simulate_vars() and simulate_vars_is_typed() were removed as they weren't
used and tidyselect now offers tidyselect_data_proxy() and
tidyselect_data_has_predicates() (@mgirllich, #1199).
sql_not_supported() now expects a function name without parentheses.
sql_query_append(), sql_query_insert(), sql_query_update(),
sql_query_upsert(), and sql_query_delete() changed their arguments to
make them more consistent to the other sql_query_*() functions:
- x_name was renamed to table.
- y was renamed to from and must now be a table identifier or SQL instead
  of a lazy table.
- sql_query_append() and sql_query_insert() have gained the argument cols.
remote_name() now returns a string with the name of the table. To get the
qualified identifier use the newly added remote_table() (@mgirlich, #1280).
tbl_lazy() loses src argument after it has been deprecated for years
(@mgirlich, #1208).
translate_sql() now requires the con argument (@mgirlich, #1311).
The vars argument has been removed after it threw an error for the last 7
years (@mgirlich).

Improved SQL

Preliminary databricks Spark SQL backend (#1377).
Joins
- *_join() after full_join() works again (@mgirlich, #1178).
- *_join() now allows specifying the relationship argument. It must be
  NULL or "many-to-many" (@bairdj, #1305).
- Queries now qualify * with the table alias for better compatibility
  (@mgirlich, #1003).
- full_join() can now handle column names that only differ in case
  (@ejneer, #1255).
- The na_matches argument of semi_join() and anti_join() works again
  (@mgirlich, #1211).
- A semi/anti_join() on fitlered y is inlined when possible (@mgirlich, #884).
- Joins now work again for Pool and Oracle connections (@mgirlich, #1177, #1181).
A sequence of union() resp. union_all() now produces a flat query
instead of subqueries (@mgirlich, #1269).
Added translations for:
- nzchar() (@MichaelChirico, @mgirlich, #1094).
- str_detect(), str_starts() and str_ends() with fixed patterns
  (@mgirlich, #1009).
- runif() (@mgirlich, #1200).
if_any() and if_all() translations are now wrapped in parentheses.
This makes sure it can be combined via & with other conditions
(@mgirlich, #1153).
nth(), first(), and last() now support the na_rm argument
(@mgirlich, #1193).

Minor improvements and bug fixes

across() now supports namespaced functions, e.g.
across(x, dplyr::dense_rank) (@mgirlich, #1231).
db_copy_to(overwrite = TRUE) now actually works.
db_copy_to()'s ... are now passed to db_write_table() (@mgirlich, #1237).
Added db_supports_table_alias_with_as() to customise whether a backend
supports specifying a table alias with AS or not (@mgirlich).
db_write_table() and db_save_query() gain the overwrite argument.
dbplyr_pivot_wider_spec() is now exported. Unlike pivot_wider() this can
be lazy. Note that this will be removed soon after pivot_wider_spec()
becomes a generic (@mgirlich).
filter()ing with window functions now generates columns called col01
rather than q01 (@mgirlich, #1258).
pivot_wider() now matches tidyr NA column handling (@ejneer #1238).
select() can once again be used after arrange(desc(x)) (@ejneer, #1240).
show_query() and remote_query() gain the argument sql_options that allows
to control how the SQL is generated. It can be created via sql_options()
which has the following arguments:
- cte: use common table expressions?
- use_star: use SELECT * or explicitly select every column?
- qualify_all_columns: qualify all columns in a join or only the ambiguous ones?
  (@mgirlich, #1146).
Consequently the cte argument of show_query() and remote_query() has
been deprecated (@mgirlich, #1146).
slice_min/max() can now order by multiple variables like dplyr, e.g. use
slice_min(lf, tibble(x, y)) (@mgirlich, #1167).
slice_*() now supports the data masking pronouns .env and .data (@mgirlich, #1294).
sql_join_suffix() gains the argument suffix so that methods can check
whether the suffix is valid for the backend (@mgirlich).
sql_random() is now deprecated. It was used to power slice_sample() which
is now done via the translation for runif() (@mgirlich, #1200).
tbl() now informs when the user probably forgot to wrap the table identifier
with in_schema() or sql() (@mgirlich, #1287).

Backend specific improvements

Access
- Added translation for != to <> (@erikvona, #1219).
DuckDB
- now supports the returning argument of rows_*().
MySQL/MariaDB:
- rows_update() and rows_patch() now give an informative error when the
  unsupported returning argument is used (@mgirlich, #1279).
- rows_upsert() now gives an informative error that it isn't supported
  (@mgirlich, #1279).
- rows_*() use the column types of x when auto copying y (@mgirlich, #1327).
- copy_inline() now works (@mgirlich, #1188).
- Fix translation of as.numeric(), as.POSIXct(), as_datetime(), and
  as.integer64() (@avsdev-cw, #1189).
MS SQL:
- row_number() now works when no order is specified (@ejneer, @fh-mthomson, #1332)
Oracle:
- Fix translation of rows_upsert() (@mgirlich, @TBlackmore, #1286)
- head(n) is now translated to WHERE ROWNUM <= n to also support old
  versions <= 11.2 (@JeremyPasco, #1292).
Postgres
- The rows_*() functions now also work inside a transaction (@mgirlich, #1183).
SQLite
- Subqueries now also get an alias. This makes it consistent with other
  backends and simplifies the implementation.
SQL Server
- distinct(.keep_all = TRUE) now works (@mgirlich, #1053).
- The translation of between() now also works when used in mutate()
  (@mgirlich, #1241).
- any() and all() now work (@ejneer, #1273).
- Fixed negation of bit (boolean) fields (@ejneer, #1239)
Snowflake:
- na.rm = TRUE is now respected in pmin() and pmax() instead of being silently ignored (@fh-mthomson, #1329)
- row_number() now works when no order is specified (@fh-mthomson, #1332)
Teradata
- distinct() + head() now work (@mgirlich, #685).
- as.Date(x) is now translate to CAST(x AS DATE) again unless x is a
  string (@mgirlich, #1285).
- row_number() no longer defaults to partitioning by groups (now aligned with other databases when no order is specified: ROW_NUMBER() defaults to ORDER BY (SELECT NULL)) (@fh-mthomson, #1331)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dbplyr 2.4.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Breaking changes

Improved SQL

Minor improvements and bug fixes

Backend specific improvements

Contributors

Uh oh!