Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Arrow: Support int8 and int16 types #391

Merged
merged 1 commit into from
Feb 8, 2024
Merged

Arrow: Support int8 and int16 types #391

merged 1 commit into from
Feb 8, 2024

Conversation

Fokko
Copy link
Contributor

@Fokko Fokko commented Feb 7, 2024

I've checked with Spark, and here byte and short types are converted to integers. I think it makes sense to do this for Arrow as well.

Closes #378

@Fokko Fokko changed the title Arrow: Support int8 and int16 types Arrow: Support int8 and int16 types Feb 7, 2024
I've checked with Spark, and here byte and short types are converted
to integers. I think it makes sense to do this for Arrow as well.
@geruh
Copy link
Contributor

geruh commented Feb 7, 2024

Does this imply when we convert an Iceberg table to an arrow schema there will be a loss in the int8 and int 16 types?

@Fokko
Copy link
Contributor Author

Fokko commented Feb 7, 2024

@geruh The types will be widened. But when you write the data, the wider type will be used (resulting in more memory/disk usage).

@Fokko Fokko merged commit d7dcd99 into apache:main Feb 8, 2024
6 checks passed
@Fokko Fokko deleted the fd-int16 branch February 8, 2024 07:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add support for writing large_string and int16
4 participants