AEGIS: Automated Error Generation and Attribution for Multi-Agent Systems

🎯 Overview

AEGIS Framework

AEGIS is a large-scale dataset and benchmark for detecting errors in Multi-Agent Systems (MAS). It provides systematically generated failure scenarios with verifiable ground-truth labels across multiple MAS frameworks, enabling development and evaluation of robust error detection methods.

🏆 Key Results

Main Results

🏗️ Architecture

AEGIS/
├── aegis_core/           # Core AEGIS framework
│   ├── malicious_factory/    # Error injection system
│   ├── agent_systems/        # MAS wrapper interfaces
│   ├── utils/               # Utility functions
│   └── core/                # Core detection modules
├── mas_frameworks/       # Multi-Agent System implementations
│   ├── agentverse/          # AgentVerse framework
│   ├── dylan/               # DyLAN framework
│   ├── llm_debate/          # LLM Debate framework
│   ├── macnet/              # MacNet framework
│   └── ...                  # Other MAS frameworks
├── magnetic_one/         # Magnetic-One specific integration
├── configs/              # Configuration files
├── examples/             # Usage examples
├── evaluation/           # Evaluation utilities
└── scripts/              # Helper scripts

🚀 Quick Start

1. Installation

# Clone the repository
git clone <repository-url>
cd AEGIS

# Install dependencies
pip install -r requirements.txt

2. Configuration

Copy and modify the configuration files:

cp configs/model_api_config.json configs/your_config.json
# Edit your_config.json with your API keys and settings

3. Basic Usage

from aegis_core.malicious_factory import MaliciousFactory
from aegis_core.agent_systems import BaseMASWrapper

# Initialize AEGIS
factory = MaliciousFactory()

# Load a MAS framework
mas_wrapper = BaseMASWrapper("dylan", config_path="configs/config_main.yaml")

# Inject errors and generate dataset
results = factory.generate_error_dataset(
    mas_wrapper=mas_wrapper,
    tasks=["task1", "task2"],
    error_modes=["FM-1.1", "FM-2.3"],
    num_samples=100
)

🧪 Core Components

Error Injection Factory

The malicious factory provides sophisticated error injection capabilities:

Prompt Injection: Modifies agent inputs to trigger specific errors
Response Corruption: Directly manipulates agent outputs
Context-Aware Attacks: Adapts injection strategies based on task domain and agent role

Multi-Agent System Wrappers

Standardized interfaces for various MAS frameworks:

DyLAN: Dynamic graph-based agent interactions
AgentVerse: Hierarchical role-based collaboration
LLM Debate: Multi-agent debate mechanisms
MacNet: Configurable network topologies
Magnetic-One: Orchestrator-executor patterns

Error Taxonomy

AEGIS supports 14 failure modes based on the MAST taxonomy:

Specification Issues (FM-1.x)
- Task specification deviation
- Role specification deviation
- Add redundant steps
- Remove conversation history
- Remove termination conditions
Inter-Agent Misalignment (FM-2.x)
- Repeat handled tasks
- Make request ambiguous
- Deviate from main goal
- Hide important information
- Ignore other agents
- Inconsistent reasoning
Task Verification Failures (FM-3.x)
- Premature termination
- Remove verification steps
- Incorrect verification

📊 Evaluation

Run evaluations using the provided scripts:

# Evaluate on AEGIS-Bench
python evaluation/evaluate.py --dataset aegis_bench --model your_model

# Evaluate on Who&When benchmark
python evaluation/evaluate.py --dataset whowhen --model your_model

🔧 Configuration

⚠️ Security Notice

IMPORTANT: This repository does NOT contain any API keys or credentials. You must configure your own API keys before use.

1. Environment Setup

First, copy the environment template and configure your credentials:

# Copy environment template
cp .env.template .env

# Edit .env file with your actual API keys
nano .env  # or use your preferred editor

Required environment variables (add to .env file):

# OpenAI Configuration
OPENAI_API_KEY=your_openai_api_key_here
OPENAI_BASE_URL=https://api.openai.com/v1

# Google/Gemini Configuration  
GOOGLE_API_KEY=your_google_api_key_here

# Azure OpenAI (if using Azure)
AZURE_OPENAI_API_KEY=your_azure_api_key_here
AZURE_OPENAI_ENDPOINT=your_azure_endpoint_here

# Anthropic/Claude (if using Claude)
ANTHROPIC_API_KEY=your_anthropic_api_key_here

# Dataset Paths (optional, defaults provided)
GAIA_VALIDATION_DIR=./data/gaia/validation
GAIA_TEST_DIR=./data/gaia/test

2. Model API Configuration

Copy and configure the model API template:

# Copy model configuration template
cp configs/model_api_config.json.template configs/model_api_config.json

# Edit with your actual endpoints and keys
nano configs/model_api_config.json

Example configuration:

{
  "gpt-4o-mini": {
    "model_list": [
      {
        "model_name": "gpt-4o-mini-2024-07-18", 
        "model_url": "https://api.openai.com/v1", 
        "api_key": "YOUR_OPENAI_API_KEY"
      }
    ],
    "max_workers_per_model": 20
  },
  "gemini-2.0-flash": {
    "model_list": [
      {
        "model_name": "gemini-2.0-flash", 
        "model_url": "https://generativelanguage.googleapis.com/v1beta/openai/", 
        "api_key": "YOUR_GEMINI_API_KEY"
      }
    ],
    "max_workers_per_model": 1
  }
}

3. Obtaining API Keys

OpenAI: Visit OpenAI API Keys
Google/Gemini: Visit Google AI Studio
Anthropic/Claude: Visit Anthropic Console
Azure OpenAI: Visit Azure OpenAI Service

4. Security Best Practices

✅ Never commit .env or configs/model_api_config.json files
✅ Use environment variables instead of hardcoding keys
✅ Rotate API keys regularly
✅ Set usage limits on your API accounts
✅ Monitor API usage to detect unauthorized access

MAS Framework Configuration

Each MAS framework has its own configuration file in configs/:

config_main.yaml: General configuration
config_humaneval.yaml: Code generation tasks
config_math.yaml: Mathematical reasoning tasks

🎯 Examples

See the examples/ directory for detailed usage examples:

basic_error_injection.py: Simple error injection workflow
multi_framework_evaluation.py: Comparing different MAS frameworks
custom_error_modes.py: Implementing custom error types

📚 Citation

If you use AEGIS in your research, please cite:

@article{kong2025aegis,
  title={AEGIS: Automated Error Generation and Attribution for Multi-Agent Systems},
  author={Kong, Fanqi and Zhang, Ruijie and Yin, Huaxiao and Zhang, Guibin and Zhang, Xiaofei and Chen, Ziang and Zhang, Zhaowei and Zhang, Xiaoyuan and Zhu, Song-Chun and Feng, Xue},
  journal={arXiv preprint arXiv:2509.14295},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
aegis_core		aegis_core
assests		assests
configs		configs
data_processing		data_processing
evaluation		evaluation
examples		examples
magnetic_one		magnetic_one
mas_frameworks		mas_frameworks
smolagents		smolagents
verl		verl
.env.template		.env.template
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AEGIS: Automated Error Generation and Attribution for Multi-Agent Systems

🎯 Overview

🏆 Key Results

🏗️ Architecture

🚀 Quick Start

1. Installation

2. Configuration

3. Basic Usage

🧪 Core Components

Error Injection Factory

Multi-Agent System Wrappers

Error Taxonomy

📊 Evaluation

🔧 Configuration

⚠️ Security Notice

1. Environment Setup

2. Model API Configuration

3. Obtaining API Keys

4. Security Best Practices

MAS Framework Configuration

🎯 Examples

📚 Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

kfq20/AEGIS

Folders and files

Latest commit

History

Repository files navigation

AEGIS: Automated Error Generation and Attribution for Multi-Agent Systems

🎯 Overview

🏆 Key Results

🏗️ Architecture

🚀 Quick Start

1. Installation

2. Configuration

3. Basic Usage

🧪 Core Components

Error Injection Factory

Multi-Agent System Wrappers

Error Taxonomy

📊 Evaluation

🔧 Configuration

⚠️ Security Notice

1. Environment Setup

2. Model API Configuration

3. Obtaining API Keys

4. Security Best Practices

MAS Framework Configuration

🎯 Examples

📚 Citation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages