You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As I'm creating demos using curator for fine-tuning function-calling models, I found that Huggingface Dataset seems to automatically process the dictionary entries. Specifically, it will impute keys with None value so that all entries in the column has the same keys I guess for compatibility.
Since curator deals with Dataset objects throughout, for function-calling settings this could be a potential issue as the argument key can be expanded with arguments from all other unrelated functions and mess up fine-tuning.
The text was updated successfully, but these errors were encountered:
richardzhuang0412
changed the title
Bug: HuggingFace Dataset Automatically
Bug: HuggingFace Dataset Automatically Expand Dictionary Keys
Jan 8, 2025
As I'm creating demos using curator for fine-tuning function-calling models, I found that Huggingface Dataset seems to automatically process the dictionary entries. Specifically, it will impute keys with None value so that all entries in the column has the same keys I guess for compatibility.
Function-Calling Example:
Output:
Since curator deals with Dataset objects throughout, for function-calling settings this could be a potential issue as the argument key can be expanded with arguments from all other unrelated functions and mess up fine-tuning.
The text was updated successfully, but these errors were encountered: