Is tidypolars slower than polars?
No, or just marginally. The objective of tidypolars is
not to modify the data, simply to translate the
tidyverse syntax to polars syntax.
polars is still in charge of doing all the data
manipulations under the hood.
Therefore, there might be minor overhead because we still need to
parse the expressions and rewrite them in polars syntax
(see the Parsing
expressions vignette) but this should be marginal.
Am I stuck with tidypolars?
No, tidypolars will always return
DataFrames, LazyFrames or Series.
Therefore, if at some point you want to use polars because
you need more control or because you want to reduce your number of
dependencies, you can easily do so.
Do I still need to load polars?
Yes, because tidypolars doesn’t provide any functions to
create polars DataFrame or
LazyFrame or to read data. You’ll still need to use
polars for this.
Can I see some benchmarks with other tools?
Making accurate benchmarks of data wrangling tools is difficult and I won’t try to do it here (I just put one of the homepage to give an idea of the performance, but it is by no means comprehensive). You should refer to DuckDB benchmarks.
