Slackbot
02/22/2024, 7:45 AMSammy Sidhu
02/22/2024, 9:02 AMdf = daft.read_iceberg(my_table)
# Partition Filter to prune out much needed data
df = df.where(df["partition_col"] < SOME_VAL)
# selected columns (all others will be pruned out)
df = df.select("i", "j", "k")
con = duckdb.connect()
# Zero copy version to arrow
my_arrow_table = df.to_arrow()
results = con.execute("SELECT * FROM my_arrow_table WHERE i = 2")
We plan on specing out SQL support soon! We want to ensure that we do a good job!Ashish Yadav
02/22/2024, 9:11 AM