Phil Chen
09/20/2024, 7:16 PMDesmond Cheong
09/20/2024, 7:26 PMPhil Chen
09/20/2024, 7:38 PMColin Ho
09/20/2024, 8:12 PMnum_partitions=1 , could you try removing it and see if it works?Phil Chen
09/20/2024, 8:42 PMPhil Chen
09/20/2024, 8:54 PMPhil Chen
09/20/2024, 8:56 PMPhil Chen
09/22/2024, 11:23 PMDesmond Cheong
09/22/2024, 11:25 PMDesmond Cheong
09/23/2024, 12:22 AMPhil Chen
09/26/2024, 7:06 PMDesmond Cheong
09/26/2024, 7:07 PMPhil Chen
09/26/2024, 7:15 PMDesmond Cheong
09/26/2024, 7:24 PMColin Ho
09/26/2024, 7:30 PMschema = {
"path": daft.DataType.string(), # This should override the "path" column to be a String
}
df = daft.read_sql(f"SELECT * FROM TABLE", create_conn, schema=schema)
which should help override the column type for 'path' to always be a StringPhil Chen
09/26/2024, 7:31 PMColin Ho
09/26/2024, 7:34 PMwrite_parquet issue then. Got it.Colin Ho
09/26/2024, 7:35 PMimport daft
df = daft.from_pydict({"foo": [1, 2, 3], "bar": ["a", "b", "c"]}).into_partitions(4).write_parquet("z"))
print(df)Colin Ho
09/26/2024, 7:36 PMPhil Chen
09/26/2024, 8:00 PMColin Ho
10/02/2024, 2:17 AMPhil Chen
10/02/2024, 2:31 AM