Ryan Sudhakaran
06/13/2024, 8:36 PMwith_column
and am consistently running into PanicException: ('not implemented: List casting not implemented for dtype: Embedding[Float32; 1536]',)
Ryan Sudhakaran
06/13/2024, 8:37 PMdf.with_column(
"embedding",
df["words"].apply(lambda word: get_embedding(word), return_dtype=daft.DataType.embedding(dtype=daft.DataType.float32(), size=1536))
).show(2)
Ryan Sudhakaran
06/13/2024, 8:37 PMget_embedding
rn outputs a list of float32sjay
06/13/2024, 11:47 PMList -> Embedding
!
I just took a look and it seems like our Python -> Embedding
casting is working. So something like this seems to workβ¦
df.with_column(
"embedding",
df["words"].apply(
lambda word: get_embedding(word),
return_dtype=daft.DataType.python()
).cast(
daft.DataType.embedding(
dtype=daft.DataType.float32(),
size=1536,
)
)
).show(2)
jay
06/13/2024, 11:56 PMRyan Sudhakaran
06/14/2024, 4:16 AMjay
06/14/2024, 4:17 AMjay
06/14/2024, 4:18 AMRyan Sudhakaran
06/14/2024, 4:19 AMjay
06/14/2024, 4:57 PMEmbedding
case, which likely will just go through the FixedSizeList
case and then become wrapped in an Embedding
logical type π