GH-46611: [Python][C++] Allow building float16 arrays without numpy #46618

raulcd · 2025-05-27T17:17:29Z

Rationale for this change

When we added Float16 we did not update pyarrow to be able to convert from Python objects to Arrow. Float16 required numpy and it crashed if numpy was not present.

What changes are included in this PR?

Allow to not require numpy to generate float16 scalars and arrays on pyarrow and do not fail if numpy is not present.

Are these changes tested?

Yes, new tests have been added

Are there any user-facing changes?

No changes for old functionality. Users will be allowed to use float16 without requiring to use np.float16 and directly from Python objects

GitHub Issue: [Python] Float16 conversion crashes if NumPy is not installed #46611

…umpy

…alar.as_py

raulcd · 2025-05-27T18:15:14Z

@WillAyd @pitrou what would be your expectation here:

        arr = np.array([1.5, np.nan], dtype=np.float16)
        a = pa.array(arr, type=pa.float16())
        x, y = a.to_pylist()

isinstance(x, float) or isinstance(x, np.float16)?
Currently we convert to np.float16 but in my opinion we should return a Python Float otherwise we unnecessarily require numpy.

pitrou · 2025-05-27T18:18:09Z

I agree a Python float should be returned. There is no reason for float16 to be different from float32 in that regard.

python/pyarrow/tests/test_array.py

python/pyarrow/src/arrow/python/helpers.cc

python/pyarrow/src/arrow/python/helpers.h

python/pyarrow/src/arrow/python/python_to_arrow.cc

python/pyarrow/tests/test_array.py

Co-authored-by: Antoine Pitrou <[email protected]>

python/pyarrow/src/arrow/python/helpers.cc

python/pyarrow/src/arrow/python/helpers.h

python/pyarrow/src/arrow/python/python_to_arrow.cc

pitrou · 2025-05-28T14:32:46Z

python/pyarrow/src/arrow/python/type_traits.h

@@ -87,15 +86,18 @@ NPY_INT_DECL(ULONGLONG, UInt64, uint64_t);

 template <>
 struct npy_traits<NPY_FLOAT16> {
-  typedef npy_half value_type;
+  typedef uint16_t value_type;


Note this could also be arrow::util::Float16, if that's easy to do.

This is requiring quite a lot of changes around pyarrow/src/arrow/python/python_to_arrow.cc, pyarrow/src/arrow/python/arrow_to_pandas.cc and haven't been able to make it work yet. I would prefer to explore updating it on a different issue

Co-authored-by: Antoine Pitrou <[email protected]>

raulcd added 2 commits May 27, 2025 18:25

apacheGH-46611: [Python][C++] Allow building float16 arrays without n…

5c0b2d1

…umpy

Add some tests exercising new conversions

833fb20

github-actions bot added Component: Python awaiting committer review Awaiting committer review labels May 27, 2025

raulcd mentioned this pull request May 27, 2025

[Python] pa.array(..., type=float16) should accept Python floats #46608

Open

Use arrow::util::Float16 instead of numpy when converting HalfFloatSc…

fea6b8e

…alar.as_py

raulcd added 3 commits May 28, 2025 09:58

Convert to float instead of np.float16

c0cca40

Fix test for numpy2

1fe86f9

Create new function PyFloat_FromHalf and deprecate PyHalf_FromHalf

e56e6ec

rok reviewed May 28, 2025

View reviewed changes

python/pyarrow/tests/test_array.py Outdated Show resolved Hide resolved

github-actions bot added awaiting changes Awaiting changes and removed awaiting committer review Awaiting committer review labels May 28, 2025

raulcd marked this pull request as ready for review May 28, 2025 09:58

raulcd requested a review from AlenkaF as a code owner May 28, 2025 09:58

raulcd requested review from pitrou and WillAyd May 28, 2025 09:58

pitrou reviewed May 28, 2025

View reviewed changes

Apply suggestions from code review

a84fffe

Co-authored-by: Antoine Pitrou <[email protected]>

github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels May 28, 2025

Review comments

53b8657

github-actions bot added awaiting changes Awaiting changes and removed awaiting change review Awaiting change review labels May 28, 2025

Remove unnecessary include for numpy halfloat headers

566fe4f

github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels May 28, 2025

Remove another unnecessary header

ae27eef

raulcd requested a review from pitrou May 28, 2025 14:19

raulcd requested a review from rok May 28, 2025 14:19

pitrou reviewed May 28, 2025

View reviewed changes

raulcd and others added 3 commits May 28, 2025 16:50

Apply suggestions

619b628

Co-authored-by: Antoine Pitrou <[email protected]>

Remove unnecessary variable and update error to TypeError

4624274

Remove unnecessary comment

74eedf6

github-actions bot added awaiting changes Awaiting changes and removed awaiting change review Awaiting change review labels May 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GH-46611: [Python][C++] Allow building float16 arrays without numpy #46618

GH-46611: [Python][C++] Allow building float16 arrays without numpy #46618

Uh oh!

raulcd commented May 27, 2025 •

edited by github-actions bot

Loading

Uh oh!

raulcd commented May 27, 2025

Uh oh!

pitrou commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pitrou May 28, 2025

Uh oh!

raulcd May 29, 2025

Uh oh!

Uh oh!

GH-46611: [Python][C++] Allow building float16 arrays without numpy #46618

Are you sure you want to change the base?

GH-46611: [Python][C++] Allow building float16 arrays without numpy #46618

Uh oh!

Conversation

raulcd commented May 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

raulcd commented May 27, 2025

Uh oh!

pitrou commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pitrou May 28, 2025

Choose a reason for hiding this comment

Uh oh!

raulcd May 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

raulcd commented May 27, 2025 •

edited by github-actions bot

Loading