ContextFocus: Balancing Context and Memory in Large Language Models

This is a project that uses various approaches to improve focus on the context in large language models. The folders are described below:

Approaches

contrastive_activation_addition: This is the main folder, containing our experiments using activation steering (Contrastive Activation Addition) to steer large language models towards the context. The README inside this folder explains how to run the scripts to recreate the results.
entropy_decoding: This folder contains the entropy decoding approach and some of the results obtained with the memotrap dataset.
multitoken_entropy_decoding: Here, the entropy decoding approach was applied on a few multitoken examples.
baseline_comparison: All baseline methods (including CAD, Negative Decoding, Filtered Negative Decoding) were tried out on the same multitoken examples to compare the results.

tutorials: The initial explorations and the basics of using LLMs for decoding are documented here.
context_aware_decoding: This folder contains the code for our context-aware decoding approach and experiments with the same.
my_refusal_code: I attempted to recreate the paper "Refusal is mediated by a single direction" from scratch here. I also have some code that relies on dependencies imported from the official repository, to recreate the steering approach as they had done it.

auto_evaluation: This contains some program files useful for performing GPT evaluations using Langchain and OpenAI.
instruction_following_eval: This contains the code for performing an IFEval on the outputs of any model or pipeline. Instructions on how to use it are enclosed in the README of the respective folder. (We use this in contrastive activation addition approach)

Name		Name	Last commit message	Last commit date
Latest commit History 230 Commits
ContextualUnderstanding-ContrastiveDecoding		ContextualUnderstanding-ContrastiveDecoding
auto_evaluation		auto_evaluation
baseline_comparison		baseline_comparison
classes		classes
context_aware_decoding		context_aware_decoding
contrastive_activation_addition		contrastive_activation_addition
entropy_decoding		entropy_decoding
instruction_following_eval		instruction_following_eval
multitoken_entropy_decoding		multitoken_entropy_decoding
my_refusal_code		my_refusal_code
resources		resources
tutorials		tutorials
.DS_Store		.DS_Store
.gitignore		.gitignore
4additiveSteering.ipynb		4additiveSteering.ipynb
CURRENT_SCRIPT.sh		CURRENT_SCRIPT.sh
CURRENT_SCRIPT2.sh		CURRENT_SCRIPT2.sh
CURRENT_SCRIPT3.sh		CURRENT_SCRIPT3.sh
CURRENT_SCRIPT4.sh		CURRENT_SCRIPT4.sh
CURRENT_SCRIPT5.sh		CURRENT_SCRIPT5.sh
CURRENT_SCRIPT6.sh		CURRENT_SCRIPT6.sh
LICENSE		LICENSE
README.md		README.md
a_star_decoding.ipynb		a_star_decoding.ipynb
baselines_setup.sh		baselines_setup.sh
contextITI.ipynb		contextITI.ipynb
contextSteering.ipynb		contextSteering.ipynb
steering_setup.sh		steering_setup.sh