Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python: support get partition table with filter #8619

Closed
puchengy opened this issue Sep 22, 2023 · 4 comments
Closed

Python: support get partition table with filter #8619

puchengy opened this issue Sep 22, 2023 · 4 comments

Comments

@puchengy
Copy link
Contributor

Feature Request / Improvement

Support get partition tables, especially with filter.

This is especially helpful for airflow sensor job.

Query engine

None

@puchengy
Copy link
Contributor Author

@Fokko Hi, may I know if we have all the building blocks to implement this feature? Thanks!

@Fokko
Copy link
Contributor

Fokko commented Sep 22, 2023

@puchengy Can you provide more context? Would you like to block a partition using an Airflow sensor? I've been thinking about this for a while. For the CDC use case we have a property that we block on: apache/airflow#32786 We could also check if a particular partition exists.

@puchengy
Copy link
Contributor Author

@puchengy Hi Fokko, so you are saying instead of doing partition compute, we want to store what partitions is produced by a given snapshot in the snapshot summary metadata? If so, how do we deal with snapshot expiration cases (this might also be a question to your Airflow PR as well).

@Fokko
Copy link
Contributor

Fokko commented Oct 2, 2023

@puchengy Could you maybe elaborate on the use case in apache/iceberg-python#24? We've moved PyIceberg to a separate repository.

@Fokko Fokko closed this as completed Oct 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants