Phil Chen
08/15/2024, 8:11 PMjay
08/15/2024, 8:19 PMdaft.context.set_runner_ray(address="<ray://address-to-ray-head-with-port-usually-at:10001>")
Running as a Ray job (recommended for production jobs)
You can write a script (e.g. my_script.py
) and then run your script as a Ray job:
ray job submit \
--working-dir wd \
--address "http://<head_node_host>:8265" \
--runtime-env-json '{"pip": ["getdaft"]}' \
-- python job.py
# From your script, it should automatically detect the Ray cluster since it is being run from inside the job
daft.context.set_runner_ray()
Colin Ho
08/15/2024, 8:25 PMPhil Chen
08/15/2024, 8:29 PMPhil Chen
08/15/2024, 8:32 PMColin Ho
08/15/2024, 8:39 PMPhil Chen
08/16/2024, 1:37 AMPhil Chen
08/16/2024, 1:41 AMPhil Chen
08/16/2024, 1:45 AMPhil Chen
08/16/2024, 1:59 AMPhil Chen
08/16/2024, 2:06 AMjay
08/16/2024, 2:33 AMPhil Chen
08/16/2024, 2:53 AMjay
08/16/2024, 2:54 AMjay
08/16/2024, 2:55 AMPhil Chen
08/16/2024, 8:27 AMPhil Chen
08/16/2024, 2:01 PMPhil Chen
08/16/2024, 2:14 PMPhil Chen
08/16/2024, 2:40 PMjay
08/16/2024, 2:40 PMjay
08/16/2024, 2:42 PMPhil Chen
08/16/2024, 6:47 PMColin Ho
08/16/2024, 6:48 PMschema
and infer_schema
right now! should have a PR out soonColin Ho
08/16/2024, 8:06 PMColin Ho
08/16/2024, 8:25 PMColin Ho
08/16/2024, 8:28 PMPhil Chen
08/16/2024, 9:00 PMColin Ho
08/21/2024, 3:39 AMinfer_schema
and schema
features should be ready for you to try. let me know how it goes!Phil Chen
08/21/2024, 9:48 AMPhil Chen
08/21/2024, 2:54 PMPhil Chen
08/27/2024, 11:57 PMPhil Chen
08/28/2024, 12:18 AMColin Ho
08/28/2024, 1:16 AMcount_big
instead.Phil Chen
08/28/2024, 2:00 AMColin Ho
08/28/2024, 11:38 PMcount_big
semantics: https://github.com/tobymao/sqlglot/pull/3996 Will update once everything goes throughPhil Chen
08/29/2024, 12:24 AMColin Ho
09/04/2024, 6:10 PMpip install sqlglot==25.19.0
, this should allow Daft to use count_big nowPhil Chen
09/04/2024, 6:16 PMPhil Chen
09/09/2024, 2:33 PMColin Ho
09/09/2024, 5:01 PM