Skip to content

MBPP Dataset Preprocessing for Code-Optimize #105

@JieWu02

Description

@JieWu02

Thank you for your promising work Code-Optimize. We greatly appreciate the effort you've put into it.

However, we’ve encountered some difficulty in reproducing the second step of annotation. Specifically, the MBPP dataset provided in the code contains the following keys:
[prompt, test, entry_point].

On the other hand, the MBPP datasets we have found online typically contain keys such as:
['task_id', 'text', 'code', 'test_list', 'test_setup_code'].

It seems there might be a missing or unclear preprocessing step that is causing this discrepancy. Could you kindly clarify this step for us, or point us in the right direction?

Looking forward to your response, and thank you once again for your valuable contributions.

Best regards,

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions