-
|
there must be a better way ? |
Beta Was this translation helpful? Give feedback.
Replies: 9 comments 6 replies
-
|
Perhaps this? TABLE_NAMES = ["lineitem", "nation", "region", "customer", "supplier", "orders", "partsupp", "part"]
table_dfs = {
name: daft.read_parquet(f"{path}/{sf}/{name}/*.parquet") for name in TABLE_NAMES
} |
Beta Was this translation helpful? Give feedback.
-
|
that does not help much I am getting Table not found: lineitem |
Beta Was this translation helpful? Give feedback.
-
|
in polars as an example, I can just do this |
Beta Was this translation helpful? Give feedback.
-
|
Oh! I see what you're trying to do now. @universalmind303 can probably advise better, but you might want to use our SQLCatalog abstraction here. from daft.sql import SQLCatalog
TABLE_NAMES = ["lineitem", "nation", "region", "customer", "supplier", "orders", "partsupp", "part"]
table_dfs = {
name: daft.read_parquet(f"{path}/{sf}/{name}/*.parquet") for name in TABLE_NAMES
}
catalog = SQLCatalog(table_dfs)
daft.sql("SELECT * FROM lineitem", catalog=catalog)Do bear with us as we work on a better catalog API. We have some proposals in the works here for unifying the story around iceberg/delta/HMS etc. |
Beta Was this translation helpful? Give feedback.
-
|
please can we have this |
Beta Was this translation helpful? Give feedback.
-
|
that's perfect !!! |
Beta Was this translation helpful? Give feedback.
-
|
can we register iceberg catalog please |
Beta Was this translation helpful? Give feedback.
-
|
@jaychia maybe just start with register_iceberg_rest first then figure out HMS later, the combo daft/polaris is very interesting actually |
Beta Was this translation helpful? Give feedback.
-
|
It is !! I am in the next stage 😛 |
Beta Was this translation helpful? Give feedback.
I'm trying to move towards #3036 which is more like:
This also opens up opportunities for something like
Which lets us register entire catalog services to start reading tables from