-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DataTable init does not not replace NaNs with pd.NA with float data type #128
Comments
Looked into this issue a little bit. It seems like the problem is happening when the columns get cast to type
|
And it looks like this is getting inferred as a |
Any time you put And then there's the interesting behavior with |
@tamargrey So, I think this means we cannot have a column that has a |
@thehomebrewnerd there's a difference between these two cases below that means that the physical type conversion would, I think, be possible:
But this works and converts
|
I wonder if pandas's convert_dtypes function can help us for the process of converting NaN to pd.NA |
@tamargrey Hmmm. The conversion succeeds, but the |
As @thehomebrewnerd pointed out, pd.NA is not support for the categoircal dtype.
import numpy as np
import pandas as pd
d = {'col4': pd.Series([1, 2, pd.NA], dtype='string')}
df = pd.DataFrame(data=d)
df['col4'].astype(pd.CategoricalDtype()) |
Waiting on pandas to support pd.NA with new FloatDtype |
This issue should:
|
EvalML is currently adding support for pandas 1.2.0 |
replace_none
, which defaults to True. However, this is not working for some data types inputted into the DataTable
- The expected behavior is that all NaN-like values in the DataFrame would be pd.NAThe text was updated successfully, but these errors were encountered: