feat: support custom LLM endpoints for local AI servers by max-baeumler · Pull Request #54 · knostic/OpenAnt

max-baeumler · 2026-05-06T11:06:03Z

Add configurable base_url, model names, SSL verification, and extended timeouts so OpenAnt can run against any Anthropic-compatible API server (llama-swap, llama-server, vLLM, LM Studio, etc.) instead of requiring Anthropic's cloud API.

Changes:

Config (Go CLI):

Add base_url, opus_model, sonnet_model, verify_ssl to config.json
New config keys: base-url, opus-model, sonnet-model, verify-ssl
Pass settings to Python core via environment variables
Skip Anthropic API key validation when custom base_url is set

LLM client (Python core):

New utilities/config.py: resolve_model(), create_anthropic_client(), extract_text() helpers
All 8 direct anthropic.Anthropic() calls routed through factory
All 15+ hardcoded Claude model strings replaced with resolve_model()
Extended connect timeout (300s) for model cold-start / swap
Configurable SSL verification for self-signed certs
extract_text() handles thinking/reasoning model responses (ThinkingBlock before TextBlock)

Usage:

openant config set base-url # http://localhost:8080
openant config set api-key # any non-empty value
openant config set opus-model # e.g. qwen3:32b
openant config set sonnet-model # e.g. qwen3:8b
openant config set verify-ssl # false (for self-signed certs)
openant scan /path/to/repo

Fully backwards compatible — defaults to Anthropic cloud API with original Claude model names when no custom endpoint is configured.

Add configurable base_url, model names, SSL verification, and extended timeouts so OpenAnt can run against any Anthropic-compatible API server (llama-swap, llama-server, vLLM, LM Studio, etc.) instead of requiring Anthropic's cloud API. Changes: Config (Go CLI): - Add base_url, opus_model, sonnet_model, verify_ssl to config.json - New config keys: base-url, opus-model, sonnet-model, verify-ssl - Pass settings to Python core via environment variables - Skip Anthropic API key validation when custom base_url is set LLM client (Python core): - New utilities/config.py: resolve_model(), create_anthropic_client(), extract_text() helpers - All 8 direct anthropic.Anthropic() calls routed through factory - All 15+ hardcoded Claude model strings replaced with resolve_model() - Extended connect timeout (300s) for model cold-start / swap - Configurable SSL verification for self-signed certs - extract_text() handles thinking/reasoning model responses (ThinkingBlock before TextBlock) Usage: openant config set base-url # http://localhost:8080 openant config set api-key # any non-empty value openant config set opus-model # e.g. qwen3:32b openant config set sonnet-model # e.g. qwen3:8b openant config set verify-ssl # false (for self-signed certs) openant scan /path/to/repo Fully backwards compatible — defaults to Anthropic cloud API with original Claude model names when no custom endpoint is configured.

max-baeumler requested review from ar7casper, dgeyshis, shahar-davidson, sounil and yotamleo as code owners May 6, 2026 11:06

Merge branch 'knostic:master' into master

569e738

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support custom LLM endpoints for local AI servers#54

feat: support custom LLM endpoints for local AI servers#54
max-baeumler wants to merge 2 commits intoknostic:masterfrom
schutzpunkt:master

max-baeumler commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

max-baeumler commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant