Skip to content

Pandas and the IngestionManagerPandas class for the FeatureStore class Β #2032

Open
@aarsanjani

Description

@aarsanjani

Describe the bug
To write from a Pandas dataframe to the Feature Store, IngestionManagerPandas iterates through the dataframe using the .iterrows() method. Pandas sends row values as floats; no matter what the datatype of the column. When the Feature Store tries to save the float to a column which was configured as an Integer, it throws an error. you can fix this by iterating through the dataframe with .loc or .iloc to get the proper datatype. This will likely confuse customers when they attempt to use the .ingest method in the SDK.

To reproduce
A clear, step-by-step set of instructions to reproduce the bug.

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots or logs
If applicable, add screenshots or logs to help explain your problem.

System information
A description of your system. Please provide:

  • SageMaker Python SDK version:
  • Framework name (eg. PyTorch) or algorithm (eg. KMeans):
  • Framework version:
  • Python version:
  • CPU or GPU:
  • Custom Docker image (Y/N):

Additional context
Add any other context about the problem here.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions