You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Initialization of Woodwork DataTable using pandas DataFrame and then numpy array causes different behavior from just initalization using numpy array.
#481
Open
angela97lin opened this issue
Jan 7, 2021
· 3 comments
If I initialize a Woodwork DataTable using a pandas DataFrame and then initialize another Woodwork DataTable using the numpy array underneath, it creates a Woodwork DataTable with category types. However, skipping the first step creates a Woodwork DataTable of ints/floats.
Printing out the underlying numpy array, it looks like the numpy array gets converted s.t. it has a dtype=object, causing Woodwork to convert to category types, but this behavior is confusing / inconsistent.
Code Sample, a copy-pastable example to reproduce your bug.
If I initialize a Woodwork DataTable using a pandas DataFrame and then initialize another Woodwork DataTable using the numpy array underneath, it creates a Woodwork DataTable with category types. However, skipping the first step creates a Woodwork DataTable of ints/floats.
Printing out the underlying numpy array, it looks like the numpy array gets converted s.t. it has a dtype=object, causing Woodwork to convert to category types, but this behavior is confusing / inconsistent.
Code Sample, a copy-pastable example to reproduce your bug.
vs
The text was updated successfully, but these errors were encountered: