🔎 Uncertainty-o

Ruiyang Zhang, Hu Zhang, Hao Fei, Zhedong Zheng*

Website | Paper | Code

⚡ Overview

🔥 News

2025.3.11: 🐣 Source code of Uncertainty-o is released!

📋 Contents

✏️ Method
🛠️ Install
💻 Dependency
📚 Data Preparation
📈 Run
🏄 Examples
⌨️ Code Structure
✨ Acknowledgement
📎 Citation

✏️ Method

Pipeline of Our Uncertainty-o. Given a multimodal prompt and large multimodal models, we perform multimodal prompt perturbation to generate diverse responses. Due to the inherent epistemic uncertainty of these models under perturbation, varied responses are typically obtained. To quantify this uncertainty, we apply semantic clustering on the collected responses and compute their entropy. Specifically, responses are grouped into semantically similar clusters, and the entropy across these clusters is calculated as the final uncertainty measure. Higher entropy indicates greater variability in responses, suggesting lower confidence, while lower entropy reflects higher consistency and thus higher confidence.

🛠️ Install

Create conda environment.

conda create -n Uncertainty-o python=3.11;

conda activate Uncertainty-o;

Install dependency.

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121;

pip install transformers datasets flash-attn accelerate timm numpy sentencepiece protobuf qwen_vl_utils;

(Tested on NVIDIA H100, NVIDIA A100)

💻 Dependency

Refer to Dependency.md.

📚 Data Preparation

Refer to Data.md.

📈 Run

For Comprehension Hallucination Detection, Hallucination Detection for Closed-Source LMMs, Hallucination Detection for Safety-Critic Tasks

bash run/comprehension_hallucination_detection.sh;

For Generation Hallucination Detection

bash run/generation_hallucination_detection.sh;

For Hallucination Mitigation

bash run/hallucination_mitigation.sh;

For Uncertainty-Aware Chain-of-Thought

bash run/uncertainty_aware_cot.sh;

🏄 Examples

Uncertainty-o successfully detects both comprehension and generation hallucination:

⌨️ Code Structure

Code strucuture of this repostory is as follow:

├── Uncertainty-o/ 
│   ├── .asset/
│   ├── args/                       # Args parser
│   ├── benchmark/
│   │   ├── comprehension/          # Benchmark for comprehension task
│   │   ├── generation/             # Benchmark for generation task
│   ├── dependency/                 # Downstream source code for LMMs
│   ├── factory/                    # Builder for benchmarks, models
│   ├── llm/
│   │   ├── Qwen.py                 # LLM class
│   ├── metric/                     # Metric for hallucination detection
│   ├── mllm/
│   │   ├── comprehension/          # LMM for comprehension task
│   │   ├── generation/             # LMM for generation task
│   ├── perturbation/               # Multimodal prompt perturbation
│   ├── run/                        # Experiment scripts
│   ├── uncertainty/                # Multimodal semantic uncertainty
│   ├── util/
│   ├── .gitignore
│   ├── hallucination_detection.py  
│   ├── hallucination_mitigation.py 
│   ├── README.md
│   ├── uncertainty_aware_cot.py

✨ Acknowledgement

AnyGPT, OneLLM, InternVL, PointLLM: Thanks a lot for those foundamental efforts!
semantic_uncertainty: We are inspired a lot by this work!
VL-Uncertainty: We build our codebase based on this work!

📎 Citation

If you find our work useful for your research and application, please cite using this BibTeX:

@article{zhang2025uncertainty,
  title={Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models},
  author={Zhang, Ruiyang and Zhang, Hu and Hao Fei, and Zheng, Zhedong},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔎 Uncertainty-o

⚡ Overview

🔥 News

📋 Contents

✏️ Method

🛠️ Install

💻 Dependency

📚 Data Preparation

📈 Run

🏄 Examples

⌨️ Code Structure

✨ Acknowledgement

📎 Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.asset		.asset
args		args
benchmark		benchmark
doc		doc
factory		factory
llm		llm
metric		metric
mllm		mllm
perturbation		perturbation
run		run
uncertainty		uncertainty
util		util
.gitignore		.gitignore
README.md		README.md
hallucination_detection.py		hallucination_detection.py
hallucination_mitigation.py		hallucination_mitigation.py
uncertainty_aware_cot.py		uncertainty_aware_cot.py

Ruiyang-061X/Uncertainty-o

Folders and files

Latest commit

History

Repository files navigation

🔎 Uncertainty-o

⚡ Overview

🔥 News

📋 Contents

✏️ Method

🛠️ Install

💻 Dependency

📚 Data Preparation

📈 Run

🏄 Examples

⌨️ Code Structure

✨ Acknowledgement

📎 Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages