Skip to content

Python write support #23

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
3 of 4 tasks
Fokko opened this issue Oct 2, 2023 · 2 comments · Fixed by #41
Closed
3 of 4 tasks

Python write support #23

Fokko opened this issue Oct 2, 2023 · 2 comments · Fixed by #41

Comments

@Fokko
Copy link
Contributor

Fokko commented Oct 2, 2023

Feature Request / Improvement

Feature Request / Improvement

This is a placeholder ticket for implementing write support for PyIceberg.

Since we don't want PyIceberg to write the actual data, and only handle the metadata part of the Iceberg table format, we need to get an overview of the frameworks we most likely want to integrate with (PyArrow, Dask (fastparquet?), etc).

I would suggest the following first steps to keep it simple: Write using PyArrow (since that's the most commonly used FileIO) and start with unpartitioned tables.

What we need:

@mgmarino
Copy link
Contributor

It looks like this is partially done (🎉), but writing to parquet files is not yet supported even though the associated ticket is closed? Am I reading this correctly? Thanks!

@Fokko
Copy link
Contributor Author

Fokko commented Oct 11, 2023

@mgmarino Writing is part of #41 👍 Writing the parquet file is actually quite trivial.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants