Contrastive Skip-Layer Guidance (CSLG)

This repository contains the official implementation of our paper:

Contrastive Skip-Layer Guidance for Controlling Semantic Coherence in Diffusion Models
Isabell Hans, Nikolai Röhrich — LMU Munich

Overview

Current diffusion models, including Stable Diffusion 3 and FLUX, often struggle with semantically complex prompts, like rendering visible, legible text, due to a trade-off between prompt adherence and image fidelity.

Contrastive Skip-Layer Guidance (CSLG) is a training-free, prompt-agnostic method for enhancing semantic coherence in such scenarios. It does so by:

Automatically identifying task-relevant layers using contrastive prompt pairs,
Selectively skipping these layers during inference,
Combining the result with standard Classifier-Free Guidance (CFG) for improved output quality at lower guidance scales.

Setup

Requirements

Python 3.9+
diffusers
OpenCV (for OCR-based evaluation)
FLUX or Stable Diffusion 3

Install dependencies:

pip install -r requirements.txt

Usage

This section provides a step-by-step guide to using CSLG with diffusion models like FLUX or Stable Diffusion 3.

1. Run diffusion model on prompt pairs

Use pairs of prompts that differ only with respect to a specific semantic feature (e.g. presence of visible text as seen in prompt_datasets/prompts_text_notext.json) to generate images. Hook into the model's forward pass to get the intermediate activations per layer and save them as tensors.

# FLUX
python FLUX_LayerExperiment_pairwise.py

# Stable Diffusion 3
python SD3_LayerExperiment_pairwise.py

2. Identify Task-Relevant Layers

Run the contrastive analysis to compute cosine similarity differences between the activations of the two prompts for each layer. This will help identify which layers are most relevant for the task at hand.

python visualize.py # set MODEL to 'FLUX' or 'SD3' and specify the path to the saved activations

This creates a plot showing the relative cosine similarity for each layer, helping you identify which layers to skip during inference.

3. Apply CSLG during Inference

Using the identified layers, you can now apply CSLG during inference. To compare the results, we create images without any guidance, with standard Classifier-Free Guidance (CFG), and with CSLG skipping specified layers in all possible combinations.

#FLUX
python FLUX_final_experiments.py 

#Stable Diffusion 3
python SD3_final_experiments.py # specify the layers to skip in the script

OCR-based Evaluation

To evaluate the effectiveness of CSLG in generating visible, legible text, we use an OCR-based approach. This involves running an OCR model on the generated images and comparing the results with the expected text extracted from the given prompts.

python eval.py 
# set MODEL to 'FLUX' or 'SD3' and specify the path to the generated images, as well as the IMAGE_TYPES to be considered

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
FLUX_SLG_results		FLUX_SLG_results
FLUX_similarities		FLUX_similarities
Images/CFG_examples		Images/CFG_examples
Milestone_1		Milestone_1
assets		assets
prompt_datasets		prompt_datasets
FLUX-dev_relative_similarity_change.png		FLUX-dev_relative_similarity_change.png
FLUX_LayerExperiment_pairwise.py		FLUX_LayerExperiment_pairwise.py
FLUX_LayerExperiment_single.py		FLUX_LayerExperiment_single.py
FLUX_custom_pipeline.py		FLUX_custom_pipeline.py
FLUX_final_experiments.py		FLUX_final_experiments.py
FLUX_layer_skipping.py		FLUX_layer_skipping.py
FLUX_layer_specific_CFG.py		FLUX_layer_specific_CFG.py
FLUX_pipeline_for_cfg.py		FLUX_pipeline_for_cfg.py
README.md		README.md
SD3_LayerExperiment_pairwise.py		SD3_LayerExperiment_pairwise.py
SD3_buildin_slg.py		SD3_buildin_slg.py
SD3_final_experiment.py		SD3_final_experiment.py
SD3_prompt_averages.csv		SD3_prompt_averages.csv
SD3_relative_similarity_change.png		SD3_relative_similarity_change.png
SD3_similarity_tensor_full_dataset.pt		SD3_similarity_tensor_full_dataset.pt
SD3_visibility_ratings.csv		SD3_visibility_ratings.csv
all_layers_examples.py		all_layers_examples.py
eval.py		eval.py
fast_experiment_slg_vs_cfg.py		fast_experiment_slg_vs_cfg.py
requirements.txt		requirements.txt
similarity_tensor_full_dataset.pt		similarity_tensor_full_dataset.pt
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Contrastive Skip-Layer Guidance (CSLG)

Overview

Setup

Requirements

Usage

1. Run diffusion model on prompt pairs

2. Identify Task-Relevant Layers

3. Apply CSLG during Inference

OCR-based Evaluation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

IsaH57/contrastive-skip-layer-guidance

Folders and files

Latest commit

History

Repository files navigation

Contrastive Skip-Layer Guidance (CSLG)

Overview

Setup

Requirements

Usage

1. Run diffusion model on prompt pairs

2. Identify Task-Relevant Layers

3. Apply CSLG during Inference

OCR-based Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages