Use Polars to avoid data conversion from arrow to pandas #4033
Yicong-Huang
started this conversation in
Ideas
Replies: 1 comment 2 replies
-
|
Thanks for the suggestion. Do you think it's easy to adopt Polars? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Currently our Tuple and Table in Python are implemented as pandas series/datafram, respectively. And we need to convert from arrow data to pandas, for processing UDF, then convert the results back to arrow to send back to jvm.
Polars is a data frame API support arrow as the data format (columnar). I think we could switch to Polars data frame (possibly as a drop in replace of pandas data frame), for the two benefits:
Beta Was this translation helpful? Give feedback.
All reactions