GitHub - andriilazebnyi/web-llm: Run small LLMs fully in your browser via WebGPU using WebLLM.

Web LLM Starter (React + Vite)

Run small LLMs fully in your browser via WebGPU using WebLLM.

Quickstart

Deploy to GitHub Pages

Push this repo to GitHub (default branch main).
GitHub Actions workflow .github/workflows/deploy.yml builds and publishes to Pages.
Enable Pages: Repo Settings → Pages → Build and deployment → Source: GitHub Actions.
Your site will be at https://<user>.github.io/<repo>/.

Notes on hosting

Vite base is set from BASE_PATH. The workflow sets it to /<repo>/ so assets resolve correctly on Pages.
WebGPU does not require COOP/COEP. If you later use WASM multithreading, GitHub Pages cannot set COOP/COEP headers.

Notes

First load downloads model artifacts (hundreds of MB). They are cached for subsequent runs.
Model menu defaults to tiny options (1–2B params) for smoother UX. Larger models may be slow or fail on low‑VRAM GPUs.
If WASM multithreading is required by your browser, consider serving with COOP/COEP headers.

Troubleshooting

“WebGPU not supported”: Enable chrome://flags/#enable-unsafe-webgpu or update your browser/OS/GPU drivers.
“Failed to compile/initialize”: Try a smaller model, and use the “Clear caches” button to remove stale artifacts.
Dev server CORS/isolation: Vite works out of the box for WebGPU. For strict WASM threading you may add COOP/COEP headers via a proxy or plugin.

Project Structure

src/ui/model/useEngine.ts: Loads models and generates text.
src/ui/model/models.ts: Model presets. Adjust as needed.
src/ui/chat/*: Chat state and UI.
src/utils/useIndexedDB.ts: Minimal IndexedDB helper used to persist chats per model.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback