Build software better, together

dineshsoudagar / local-llms-on-android

Run large language models like Qwen and LLaMA locally on Android for offline, private, real-time question answering and chat - powered by ONNX Runtime.

android chatbot android-app on-device-ai mobile-ai onnx-runtime huggingface-tokenizers local-llm qwen llama3 local-llm-integration offline-inference

Updated Mar 27, 2026
Kotlin

pheonix-delta / axiom-voice-agent

Star

Run a <400ms latency Voice Agent on just 4GB VRAM. Fully offline, no API keys required. Optimized for GTX 1650 and edge robotics with zero-copy inference. (Apache 2.0)

Updated Feb 22, 2026
Python

PocketLLM / PocketLLM

Star

🚀 A powerful Flutter-based AI chat application that lets you run LLMs directly on your mobile device or connect to local model servers. Features offline model execution, Ollama/LLMStudio integration, and a beautiful modern UI. Privacy-focused, cross-platform, and fully open source.

self-hosted flutter flutter-desktop flutter-mobile llm local-llm llm-inference ollama llm-framework ollama-ui ollama-interface ollama-gui ollama-app ollama-api local-llm-integration

Updated Jan 29, 2026
Dart

benmaster82 / writher

Star

Voice-powered productivity for Windows

Updated Mar 27, 2026
Python

lelandg / ImageAI

Star

🖼️ Python Image and 🎥 Video Generator using LLM providers and models — built with Claude Code 💻 CLI

image-generation gemini-api video-ge openai-api stable-diffusion stability-ai ollama local-llm-integration

Updated Mar 3, 2026
Python

lynxai-team / goinfer

Star

Local LLM proxy, DevOps friendly

inference inference-server inference-api openai-api llm openaiapi llamacpp llama-cpp local-llm localllm local-ai llm-proxy llama-api llama-server llm-router language-model-api local-lm local-llm-integration

Updated Feb 8, 2026
Go

sanskar9999 / CodeEvolveLLM

Star

A framework for using local LLMs (Qwen2.5-coder 7B) that are fine-tuned using RL to generate, debug, and optimize code solutions through iterative refinement.

ai rl code-generation llm code-interpreter qwen2-5 local-llm-integration

Updated Mar 14, 2025
Python

dronefreak / local_rag_pipeline

Star

An advanced, fully local, and GPU-accelerated RAG pipeline. Features a sophisticated LLM-based preprocessing engine, state-of-the-art Parent Document Retriever with RAG Fusion, and a modular, Hydra-configurable architecture. Built with LangChain, Ollama, and ChromaDB for 100% private, high-performance document Q&A.

Updated Aug 11, 2025
Python

tommathewXC / lidia

Star

A fully customizable, super light-weight, cross-platform GenAI based Personal Assistant that can be run locally on your private hardware!

text-to-speech deep-neural-networks personal-assistant speech-to-text ocr-recognition huggingface llm vllm genai local-llm ollama ollama-python local-llm-integration local-genai

Updated Mar 15, 2025
Python

radlab-dev-group / llm-router

Star

LLM Router is a service that can be deployed on‑premises or in the cloud. It adds a layer between any application and the LLM provider. In real time it controls traffic, distributes a load among providers of a specific LLM, and enables analysis of outgoing requests from a security perspective (masking, anonymization, prohibited content).

security automation cloud rest-api prometheus model-management load-balancing pii on-prem llm genai local-llm llm-router llm-gateway local-llm-integration llm-gateway-system llm-balancing llm-router-models llm-router-plugins

Updated Jan 16, 2026
Python

augustine-aj / RAG-Basics

Star

🤖 An Intelligent Chatbot: Powered by the locally hosted Ollama 3.2 LLM 🧠 and ChromaDB 🗂️, this chatbot offers semantic search 🔍, session-aware responses 🗨️, and an interactive Streamlit interface 🎨 for seamless user interaction. 🚀

interactive-ui local-llm-integration chromadb-integration semantic-sear session-aware-responses dynamic-interaction

Updated Dec 12, 2024
Python

swordonfire / SuperBot

Star

An AI-powered assistant to streamline knowledge management, member discovery, and content generation across Telegram and Twitter, while ensuring privacy with local LLM deployment.

twitter-bot telegram-bot ai-agents weaviate fastapi llm chromadb retrieval-augmented-generation local-llm-integration

Updated Mar 24, 2025
Python

fvanevski / knowledge_agent

Star

An autonomous AI agent for intelligently updating, maintaining, and curating a LightRAG knowledge base.

knowledge-graph knowledge-base knowledge-management ai-agents local-ai local-llm-integration local-ai-development lightrag local-ai-agents

Updated Aug 28, 2025
Python

kmkamyk / ask-cli

Star

**Ask CLI** is a command-line tool for interacting with a local LLM (Large Language Model) server. It allows you to send queries and receive concise command-line responses.

linux bash automation command-line command-line-tool cli-tool ask-cli openai-api llms openai-api-chatbot local-llm command-line-assistant local-llm-integration bash-assistant

Updated Dec 22, 2024
Python

code2k13 / onnx_javascript_browser_inference

Star

This repository has code to securely run SLM (Small language models) locally using nodejs (servers side) or inside browser .

wasm onnx onnxruntime onnx-models small-language-models local-llm-integration

Updated Nov 25, 2025
JavaScript

Sundareeshwaran / lm-studio-chat-agent

Star

A lightweight frontend for LM Studio local server APIs. Built using React, Vite, and Tailwind CSS with full support for streaming responses and GitHub Flavored Markdown.

react ai chatbot tailwindcss llm local-llm lm-studio local-llm-integration

Updated Jan 31, 2026
JavaScript

EcomineAI / JV-Archon

Star

JV-Archon is my personal offline LLM ecosystem.

open-source machine-learning tools ai configs scripts mit-license automation-framework mistral modular-framework personal-workflow large-language-models llms local-llm local-ai ollama lmstudio local-llm-integration gpt-oss-20b

Updated Nov 14, 2025
Shell

DouglasMacKrell / namegnome

Star

Python CLI/TUI for intelligent media file organization. Features atomic operations, rollback safety, and integrity checks, with a local LLM workflow for context-aware renaming and categorization from API-sourced metadata.

python cli metadata automation ai tui file-management tmdb-api tvdb-api llm local-llm local-llm-integration

Updated Jul 6, 2025
Python

haasr / woolychat

Star

WoolyChat - open-source AI chat app for locally hosted Ollama models. Written in Flask/JavaScript.

javascript python macos flask vanilla-javascript windows-10 flask-application chat-application user-friendly flask-sqlalchemy windows-11 ai-chat openllama local-ai ollama ollama-app ollama-api llm-interface local-llm-integration

Updated Oct 16, 2025
Python

EzioDEVio / plantdeck_rag

Star

PlantDeck is an offline herbal RAG that indexes your PDF books and monographs, extracts text/images with OCR, and answers questions with page-level citations using a local LLM via Ollama. Runs on your machine; no cloud. Field guide only; not medical advice.

docker computer-vision offline docker-compose embeddings plants tesseract-ocr poppler pdfminer pymupdf herbalism rag github-actions sentence-transformers ollama rag-chatbot local-llm-integration

Updated Aug 11, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local-llm-integration

Here are 46 public repositories matching this topic...

dineshsoudagar / local-llms-on-android

pheonix-delta / axiom-voice-agent

PocketLLM / PocketLLM

benmaster82 / writher

lelandg / ImageAI

lynxai-team / goinfer

sanskar9999 / CodeEvolveLLM

dronefreak / local_rag_pipeline

tommathewXC / lidia

radlab-dev-group / llm-router

augustine-aj / RAG-Basics

swordonfire / SuperBot

fvanevski / knowledge_agent

kmkamyk / ask-cli

code2k13 / onnx_javascript_browser_inference

Sundareeshwaran / lm-studio-chat-agent

EcomineAI / JV-Archon

DouglasMacKrell / namegnome

haasr / woolychat

EzioDEVio / plantdeck_rag

Improve this page

Add this topic to your repo