Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Paper -> CoT pipeline: Optimize prompt for paper grading rubric #55

Open
thehunmonkgroup opened this issue Sep 29, 2024 · 3 comments
Open
Assignees

Comments

@thehunmonkgroup
Copy link
Collaborator

Requirements:

  • Finalize rubric questions
  • Optimize rest of prompt if necessary

Deliverable: the final prompt

@thehunmonkgroup thehunmonkgroup self-assigned this Sep 29, 2024
@thehunmonkgroup
Copy link
Collaborator Author

Here are the current rubric questions, the model is instructed to provide a simple yes/no answer to the question, along with an explanation of the reasoning for the answer:

  • Is there a clear, well-defined central question explicitly stated in the paper?
  • Does the paper provide a definitive answer to this central question?
  • Is the answer derived through multi-step reasoning that includes at least 3 distinct logical steps or connections?
  • Is the reasoning leading to the answer logically coherent and well-structured?
  • Can the reasoning be explained to a layperson (defined as an educated adult without specific expertise in the paper's field) with some effort?
  • Does the paper minimize jargon in the reasoning process, or does it explain necessary technical terms used to derive the answer?
  • Are there illustrative examples or analogies in the reasoning that aid in understanding the answer?
  • Does the reasoning provide significant insights or depth specifically related to the question and its answer?
  • Does the paper provide sufficient information for the key reasoning steps to be independently verified or reproduced?
  • Is the paper suitable for extracting a clear question and an answer arrived at by comprehensible, complex reasoning?

@thehunmonkgroup
Copy link
Collaborator Author

Here is the full archive of artifacts (inference data for profiling and CoT extraction, JSONL training files) for a test run of 100 papers through the pipeline:

paper-cot-extraction-test-data.tar.gz

The CoT extraction logs are in results/inference with -paper-profiling.txt extension.

@thehunmonkgroup
Copy link
Collaborator Author

We've decided the current list of questions is sufficient until we need to run a larger number of papers, we'll need funding for that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant