How would it be possible to load in data from a hd...
# general
k
How would it be possible to load in data from a hdfs address? It works in pyarrow's parquetdataset as just the hdfs filepath but i wasn't able to use the same path in daft
It's kind of like a mounted path but daft rejects it because it starts with hdfs://.. pandas also allows the path to be read
c
We don't yet support hdfs, if you want to open up an issue on github, we can work on getting it added in an upcoming release!
j
hdfs is very interesting… I’m not sure what the best way of supporting it would be tbh 😬 and I don’t think we can squeeze out a ton of performance from it since I believe we have to go through the JVM. Something like: https://github.com/Xuanwo/hdrs Feel free to open up an issue @Kyle!
k
Okay thanks!!!