Add OpenWebUI + LMStudio integration (CORS, docker compose, env-var host)#10
Draft
offbyonebit wants to merge 1 commit into
Draft
Add OpenWebUI + LMStudio integration (CORS, docker compose, env-var host)#10offbyonebit wants to merge 1 commit into
offbyonebit wants to merge 1 commit into
Conversation
…ding - server.py: add CORSMiddleware (allow_origins=["*"]) so browser-based clients like Open WebUI can call arc-llama from any origin - server.py: populate `created` from file mtime instead of hard-coded 0, so model pickers sort and display timestamps correctly - cli.py: `arc-llama serve` now reads ARC_LLAMA_HOST / ARC_LLAMA_PORT env vars, making it container-friendly without editing config.toml - docker-compose.yml: new file wiring arc-llama + Open WebUI together; `docker compose up` gives a working chat UI on http://localhost:3000 - README: add Open WebUI and LMStudio sections with bare-metal and compose quick-starts, including LMStudio upstream pattern https://claude.ai/code/session_016pwPb8iA8Ch62118zCNfyS
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
server.py— without it, OpenWebUI (running in a browser) gets blocked by same-origin policy when calling arc-llama from a different portcreatedfield in/v1/modelsnow uses the GGUF file's mtime instead of0— model pickers in OpenWebUI sort and display timestamps correctlyARC_LLAMA_HOST/ARC_LLAMA_PORTenv vars onarc-llama serve— the container needs to bind to0.0.0.0, now possible without editingconfig.tomldocker-compose.yml— new file;docker compose upbrings up arc-llama + Open WebUI together, with GPU device passthrough and a shared network; chat UI athttp://localhost:3000Test plan
docker compose upstarts both services; Open WebUI at:3000shows arc-llama models in the model pickerarc-llama serve, configure Open WebUI with base URLhttp://127.0.0.1:11437/v1, models appearGET /v1/modelsreturnscreatedas a non-zero Unix timestamp for registered GGUFshttp://127.0.0.1:11437/v1/modelsfrom a different origin succeeds (CORS headers present)ARC_LLAMA_HOST=0.0.0.0 arc-llama servebinds to all interfacesarc-llama upstream add lmstudio http://127.0.0.1:1234merges LMStudio models into the model listhttps://claude.ai/code/session_016pwPb8iA8Ch62118zCNfyS
Generated by Claude Code