-
|
Hi all, naive question: in the eval data configuration the icl_task entries use a variable I'm trying to use a custom eval dataset by adding an icl_task entry to config.yaml in eval_bundle. I'm just wondering what parameters I should use. The custom eval data is similar to Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Hi! For evaluation purposes, it matters, because a zero-shot setting will typically produce worse results than let's say a 5-shot setting. If you're defining your own evaluation/dataset, you might want to try both to see the difference in performance. |
Beta Was this translation helpful? Give feedback.
Hi!
num_fewshotrefers to how many examples are given up-front to the LLM. Incore_evalyou can see thattask_meta['num_fewshot']number of such examples are sampled from the dataset (ensuring, of course, that the current example we're testing is not being sampled).For evaluation purposes, it matters, because a zero-shot setting will typically produce worse results than let's say a 5-shot setting. If you're defining your own evaluation/dataset, you might want to try both to see the difference in performance.