Replies: 3 comments 4 replies
-
Thanks for starting this discussion @Xuanwo. I personally am very interested in exploring this. I am not fully up-to-speed on the Comet/Iceberg integration work, so I don't have a mental model for how Comet would integrate with iceberg-rs. Perhaps @huaxingao or @parthchandra may have thoughts/questions on this proposal. |
Beta Was this translation helpful? Give feedback.
-
This is worth exploring, but one must first understand what Comet brings to the table -
Since Datafusion has its own parquet reader, it is unlikely that (1) would be immediately usable. One can potentially look into implementing similar file reading improvements into the datafusion parquet reader but we do not have plans to do that in Comet as yet. For the native decoding, does the Iceberg Rust implementation intend to integrate with Spark. If so, then we have one potential point of integration. |
Beta Was this translation helpful? Give feedback.
-
@parthchandra gave a good intro. It is important to know what Comet did on the native reader before considering this. Comet's native reader is not a fully native implementation that can be easily replaced or integrated with other native implementations from DataFusion or Iceberg Rust. It is a hybrid of Java implementation and Rust implementation. |
Beta Was this translation helpful? Give feedback.
-
Hi, following @andygrove's encouragement, I've started this thread to continue the discussion from the Iceberg Slack regarding the overlap between this project and the efforts to integrate Iceberg support into DataFusion Comet.
Current Status
From @andygrove
Future Possibility
This discussion inspired me to consider the possibility that a comet can collaborate with iceberg-rust and datafusions to implement read/write operations natively.
From @Xuanwo
From @liurenjie1024
What do you think?
Beta Was this translation helpful? Give feedback.
All reactions