Skip to content
Discussion options

You must be logged in to vote

Hi! num_fewshot refers to how many examples are given up-front to the LLM. In core_eval you can see that task_meta['num_fewshot'] number of such examples are sampled from the dataset (ensuring, of course, that the current example we're testing is not being sampled).

For evaluation purposes, it matters, because a zero-shot setting will typically produce worse results than let's say a 5-shot setting. If you're defining your own evaluation/dataset, you might want to try both to see the difference in performance.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants