Skip to content

arena-ai/arena

Folders and files

NameName
Last commit message
Last commit date
May 13, 2024
Apr 4, 2024
May 2, 2024
Jan 6, 2025
May 23, 2024
Jan 8, 2025
Sep 27, 2024
Dec 17, 2024
Oct 16, 2024
Oct 14, 2024
May 22, 2024
Mar 20, 2024
Oct 14, 2024
Sep 20, 2024
Sep 20, 2024
May 22, 2024
Oct 2, 2024
Aug 29, 2024
Oct 14, 2024

Repository files navigation

Arena Logo

Arena Frontend

Arena Backend

Python Client

X (formerly Twitter) Follow

Sarus Arena Framework

If you use public AI services such as OpenAI, Anthropic or Mistral, Sarus Arena is an agent you can easily deploy in your infrastructure to do:

  • LLM evaluation: AB-testing, user-feedback evaluation, formula-based evaluation and LLM as a Judge
  • LLM compliance: Request and response filtering and redacting (PII removal, guardrailing), evaluation-based routing
  • LLM distillation: Train your own model based on the best evaluated responses

Installation

A test instance is hosted by Sarus: arena.sarus.app.

You can deploy your own instance using the provided helm arena chart and following the deployment instructions.

A document describing the installation process is available there.

Dev environment

To start the test environment, run:

docker compose --profile "*" up

Docker compose uses compose.yml and overrides it with compose.override.yml.