Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pyiceberg_core] Expose IcebergTableProvider to python #865

Open
kevinjqliu opened this issue Jan 2, 2025 · 3 comments
Open

[pyiceberg_core] Expose IcebergTableProvider to python #865

kevinjqliu opened this issue Jan 2, 2025 · 3 comments
Assignees

Comments

@kevinjqliu
Copy link
Contributor

Inspired by #650

I want to expose IcebergTableProvider to Datafusion as python binding using Custom Table Provider

Integration with Python might look something like,

from pyiceberg_core import table_provider
from datafusion import SessionContext

ctx = SessionContext()
iceberg_table_provider = table_provider.create_table_provider(
    metadata_location=metadata_location
)
ctx.register_table_provider("test", iceberg_table_provider)
table = ctx.table("test")
table.show()
@kevinjqliu kevinjqliu self-assigned this Jan 2, 2025
@kevinjqliu
Copy link
Contributor Author

Possibly blocked by apache/datafusion#13851

@kevinjqliu
Copy link
Contributor Author

kevinjqliu commented Jan 8, 2025

Got an example working by building the latest datafusion* libraries locally.
Requires apache/datafusion#13937 and new versions of datafusion* libraries with the PR. Possibly included in the next datafusion release, v45

@Xuanwo
Copy link
Member

Xuanwo commented Jan 8, 2025

That's nice! Thank you @kevinjqliu for pushing forward on this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants