Rushikesh Padia
07/17/2024, 11:57 PMjay
07/18/2024, 1:30 AMstrategy
keyword arg: https://www.getdaft.io/projects/docs/en/latest/api_docs/doc_gen/dataframe_methods/daft.DataFrame.join.html
Daft’s speedups over Spark come from:
• Vectorized execution
• Much faster/optimized reads from cloud storage (specifically AWS S3), written in async Rust
• Lower overhead wrt JVMjay
07/18/2024, 1:30 AMRushikesh Padia
07/18/2024, 6:46 PM