Universal LLM Provider Runtime

One interface for every model. Authentication, routing, streaming, retries, caching — handled.

Quick Start · Features · Providers · Usage · Architecture · Contributing

What is eyrie

eyrie is the LLM provider runtime that powers the hawk coding agent. It handles everything between your application and LLM APIs — authentication, model resolution, streaming, retries, rate limiting, and caching.

When your app calls a model, eyrie figures out which provider to use, how to talk to it, and how to stream the response back. Switch from Anthropic to Ollama? eyrie handles the translation. API returns 529? eyrie retries with backoff. Response hits max_tokens? eyrie continues automatically.

Your app never talks to an LLM API directly. eyrie does.

Quick Start

go get github.com/GrayCodeAI/eyrie

Requires Go 1.26+. Zero external dependencies.

import "github.com/GrayCodeAI/eyrie/client"

// Create a client — provider auto-detected from environment
c := client.NewEyrieClient(&client.EyrieConfig{
    Provider: client.DetectProvider(),
})

// Stream a response
sr, err := c.StreamChat(ctx, messages, client.ChatOptions{
    Model: "claude-sonnet-4-6",
})
defer sr.Close()

for evt := range sr.Events {
    switch evt.Type {
    case "content":   // stream text
    case "tool_call": // execute tool
    case "done":      // response complete
    }
}

Features

Provider Routing

Automatically detects and routes to the right provider based on environment variables, config files, or explicit selection.

Model Resolution

Maps abstract tiers (opus/sonnet/haiku) to concrete model IDs per provider. Ships with an embedded catalog of pricing, context windows, and capabilities.

Streaming

Parses SSE for Anthropic and OpenAI formats — text, tool calls, and thinking blocks.

Reliability

Retries on 429/500/529 with exponential backoff and Retry-After support
Auto-continuation when stop_reason == max_tokens
Provider fallback chains for high availability

Rate Limiting

Token bucket rate limiter per provider — prevents hitting API limits before they happen.

Caching

Response caching with configurable TTL
Semantic similarity caching for repeated prompts
Anthropic prompt caching breakpoints on system prompt and conversation prefix

Cost Tracking

Built-in cost estimation per call, with per-provider pricing from the embedded model catalog.

Supported Providers

Provider	Env Variable	Notes
Anthropic	`ANTHROPIC_API_KEY`	Default · thinking, caching
OpenAI	`OPENAI_API_KEY`	Full tool use + reasoning
OpenRouter	`OPENROUTER_API_KEY`	200+ models via one key
Grok (xAI)	`XAI_API_KEY`
Gemini	`GEMINI_API_KEY`
CanopyWave	`CANOPYWAVE_API_KEY`
Ollama	`OLLAMA_BASE_URL`	Local models, no key needed
OpenCodeGo	`OPENCODEGO_API_KEY`

Providers are detected automatically in the order listed above.

Usage

Basic Chat

resp, err := c.Chat(ctx, messages, client.ChatOptions{
    Model: "gpt-4o",
})

Streaming with Continuation

// Auto-continues when max_tokens is hit
resp, err := client.ChatWithContinuation(ctx, provider, messages,
    client.ChatOptions{Model: model},
    client.DefaultContinuationConfig(),
)

Mock Provider for Testing

mock := client.NewMockProvider(client.MockModeFixed)
mock.Response = "Here is the code you asked for..."

resp, _ := mock.Chat(ctx, messages, opts)
// No real API calls — perfect for tests

Model Catalog

cat := catalog.DefaultModelCatalog()

// Get the best model for a tier
model := catalog.GetPreferredProviderModel("anthropic", catalog.TierSonnet, &cat)
// → "claude-sonnet-4-6"

// Check deprecation warnings
warn := catalog.GetModelDeprecationWarning("claude-3-7-sonnet", "anthropic")

Provider Configuration

cfg := config.LoadProviderConfig("")             // load from disk
config.ApplyProviderConfigToEnv(cfg, false, nil) // apply to environment
config.SaveProviderConfig(cfg, "")               // save changes

Architecture

eyrie/
├── cmd/eyrie/              # CLI binary
├── internal/
│   ├── client/             # Provider implementations (51 files)
│   │   ├── providers/      # Anthropic, OpenAI, Azure, Vertex, etc.
│   │   ├── middleware/     # Retry, rate limit, cache, fallback
│   │   └── metrics/        # Cost, call, analytics
│   ├── server/             # HTTP API and gateway
│   ├── config/             # Provider configuration & routing
│   ├── catalog/            # Model catalog & tier system
│   ├── registry/           # Runtime manifest & routing policies
│   ├── routing/            # Weighted provider router
│   ├── storage/            # SQLite conversation DAG store
│   ├── conversation/       # Conversation engine with branching
│   ├── observability/      # OpenTelemetry spans & metrics
│   ├── health/             # Provider health checker
│   ├── cache/              # Response cache warmer
│   ├── types/              # Branded types & API errors
│   ├── errors/             # Error message constants
│   ├── constants/          # API limits
│   ├── utils/              # Error utilities
│   └── sdk/                # Go, Python, TypeScript client SDKs
└── assets/                 # Logo and branding

Ecosystem

eyrie is part of the hawk-eco:

Component	Repository	Purpose
hawk	GrayCodeAI/hawk	AI coding agent
eyrie	This repo	LLM provider runtime
tok	GrayCodeAI/tok	Tokenizer & compression
yaad	GrayCodeAI/yaad	Graph-based memory
trace	GrayCodeAI/trace	Session capture

Development

Prerequisites

Go 1.26+

Build & Test

go build ./cmd/eyrie          # Build binary
go test -race ./...           # Run all tests with race detector
make ci                       # Run full CI suite (lint, test, security)
make cover                    # Generate coverage report

Contributing

We welcome contributions! Please see CONTRIBUTING.md for development setup, commit conventions, and the PR process.

Quick start:

Fork and create a branch: git checkout -b feat/short-description
Make changes in small, focused commits
Run make ci locally
Open a pull request

Use Conventional Commits for commit messages — release-please uses them for versioning.

License

MIT — see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github		.github
assets		assets
catalog		catalog
client		client
cmd/eyrie		cmd/eyrie
config		config
constants		constants
conversation		conversation
errors		errors
internal		internal
router		router
setup		setup
storage		storage
types		types
utils		utils
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
CHANGELOG.md		CHANGELOG.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
VERSION		VERSION
go.mod		go.mod
go.sum		go.sum
lefthook.yml		lefthook.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Universal LLM Provider Runtime

What is eyrie

Quick Start

Features

Provider Routing

Model Resolution

Streaming

Reliability

Rate Limiting

Caching

Cost Tracking

Supported Providers

Usage

Basic Chat

Streaming with Continuation

Mock Provider for Testing

Model Catalog

Provider Configuration

Architecture

Ecosystem

Development

Prerequisites

Build & Test

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Universal LLM Provider Runtime

What is eyrie

Quick Start

Features

Provider Routing

Model Resolution

Streaming

Reliability

Rate Limiting

Caching

Cost Tracking

Supported Providers

Usage

Basic Chat

Streaming with Continuation

Mock Provider for Testing

Model Catalog

Provider Configuration

Architecture

Ecosystem

Development

Prerequisites

Build & Test

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages