Stream output to a NDJSON file — sink

This writes the output of a query directly to a NDJSON file without collecting it in the R session first. This is useful if the output of the query is still larger than RAM as it would crash the R session if it was collected into R.

Usage

sink_ndjson(
  .data,
  path,
  ...,
  maintain_order = TRUE,
  type_coercion = TRUE,
  predicate_pushdown = TRUE,
  projection_pushdown = TRUE,
  simplify_expression = TRUE,
  slice_pushdown = TRUE,
  no_optimization = FALSE
)

Arguments

.data: A Polars LazyFrame.
path: Output file.
...: Ignored.
maintain_order: Whether maintain the order the data was processed (default is TRUE). Setting this to FALSE will be slightly faster.
type_coercion: Coerce types such that operations succeed and run on minimal required memory (default is TRUE).
predicate_pushdown: Applies filters as early as possible at scan level (default is TRUE).
projection_pushdown: Select only the columns that are needed at the scan level (default is TRUE).
simplify_expression: Various optimizations, such as constant folding and replacing expensive operations with faster alternatives (default is TRUE).
slice_pushdown: Only load the required slice from the scan. Don't materialize sliced outputs level. Don't materialize sliced outputs (default is TRUE).
no_optimization: Sets the following optimizations to FALSE: predicate_pushdown, projection_pushdown, slice_pushdown, simplify_expression. Default is FALSE.

Value

The input LazyFrame.