Build software better, together

dineshsoudagar / local-llms-on-android

Run local LLMs like Gemma, Qwen, and LLaMA on Android for offline, private, real-time chat and question answering with LiteRT and ONNX Runtime.

android chatbot android-app litert on-device-ai mobile-ai onnx-runtime huggingface-tokenizers local-llm qwen llama3 local-llm-integration offline-inference litert-lm gemma4 gemma4-2b gemma4-e4b

Updated Apr 23, 2026
Kotlin

pheonix-delta / axiom-voice-agent

Star

Run a <400ms latency Voice Agent on just 4GB VRAM. Fully offline, no API keys required. Optimized for GTX 1650 and edge robotics with zero-copy inference. (Apache 2.0)

Updated Feb 22, 2026
Python

PocketLLM / PocketLLM

Star

🚀 A powerful Flutter-based AI chat application that lets you run LLMs directly on your mobile device or connect to local model servers. Features offline model execution, Ollama/LLMStudio integration, and a beautiful modern UI. Privacy-focused, cross-platform, and fully open source.

self-hosted flutter flutter-desktop flutter-mobile llm local-llm llm-inference ollama llm-framework ollama-ui ollama-interface ollama-gui ollama-app ollama-api local-llm-integration

Updated Jan 29, 2026
Dart

Parad0x-Labs / nulla-local

Star

NULLA is a local-first personal AI that runs on your machine, remembers your work, helps with research and workflows, and can optionally share knowledge peer-to-peer.

python persistent-memory multi-agent-systems ai-agents privacy-first autonomous-research local-llm ollama decentralized-ai local-llm-integration privacy-first-ai openai-compatible agent-runtime openclaw local-llm-agent

Updated Mar 31, 2026
Python

lynxai-team / goinfer

Star

Local LLM proxy, DevOps friendly

inference inference-server inference-api openai-api llm openaiapi llamacpp llama-cpp local-llm localllm local-ai llm-proxy llama-api llama-server llm-router language-model-api local-lm local-llm-integration

Updated Apr 16, 2026
Go

dronefreak / local_rag_pipeline

Star

An advanced, fully local, and GPU-accelerated RAG pipeline. Features a sophisticated LLM-based preprocessing engine, state-of-the-art Parent Document Retriever with RAG Fusion, and a modular, Hydra-configurable architecture. Built with LangChain, Ollama, and ChromaDB for 100% private, high-performance document Q&A.

Updated Aug 11, 2025
Python

lelandg / ImageAI

Star

🖼️ Python Image and 🎥 Video Generator using LLM providers and models — built with Claude Code 💻 CLI

image-generation gemini-api video-ge openai-api stable-diffusion stability-ai ollama local-llm-integration

Updated Apr 23, 2026
Python

sanskar9999 / CodeEvolveLLM

Star

A framework for using local LLMs (Qwen2.5-coder 7B) that are fine-tuned using RL to generate, debug, and optimize code solutions through iterative refinement.

ai rl code-generation llm code-interpreter qwen2-5 local-llm-integration

Updated Mar 14, 2025
Python

radlab-dev-group / llm-router

Star

LLM Router is a service that can be deployed on‑premises or in the cloud. It adds a layer between any application and the LLM provider. In real time it controls traffic, distributes a load among providers of a specific LLM, and enables analysis of outgoing requests from a security perspective (masking, anonymization, prohibited content).

security automation cloud rest-api prometheus model-management load-balancing pii on-prem llm genai local-llm llm-router llm-gateway local-llm-integration llm-gateway-system llm-balancing llm-router-models llm-router-plugins

Updated Apr 19, 2026
Python

tommathewXC / lidia

Star

A fully customizable, super light-weight, cross-platform GenAI based Personal Assistant that can be run locally on your private hardware!

text-to-speech deep-neural-networks personal-assistant speech-to-text ocr-recognition huggingface llm vllm genai local-llm ollama ollama-python local-llm-integration local-genai

Updated Mar 15, 2025
Python

augustine-aj / RAG-Basics

Star

🤖 An Intelligent Chatbot: Powered by the locally hosted Ollama 3.2 LLM 🧠 and ChromaDB 🗂️, this chatbot offers semantic search 🔍, session-aware responses 🗨️, and an interactive Streamlit interface 🎨 for seamless user interaction. 🚀

interactive-ui local-llm-integration chromadb-integration semantic-sear session-aware-responses dynamic-interaction

Updated Dec 12, 2024
Python

swordonfire / SuperBot

Star

An AI-powered assistant to streamline knowledge management, member discovery, and content generation across Telegram and Twitter, while ensuring privacy with local LLM deployment.

twitter-bot telegram-bot ai-agents weaviate fastapi llm chromadb retrieval-augmented-generation local-llm-integration

Updated Mar 24, 2025
Python

kmkamyk / ask-cli

Star

**Ask CLI** is a command-line tool for interacting with a local LLM (Large Language Model) server. It allows you to send queries and receive concise command-line responses.

linux bash automation command-line command-line-tool cli-tool ask-cli openai-api llms openai-api-chatbot local-llm command-line-assistant local-llm-integration bash-assistant

Updated Dec 22, 2024
Python

fvanevski / knowledge_agent

Star

An autonomous AI agent for intelligently updating, maintaining, and curating a LightRAG knowledge base.

knowledge-graph knowledge-base knowledge-management ai-agents local-ai local-llm-integration local-ai-development lightrag local-ai-agents

Updated Aug 28, 2025
Python

Sundareeshwaran / lm-studio-chat-agent

Star

A lightweight frontend for LM Studio local server APIs. Built using React, Vite, and Tailwind CSS with full support for streaming responses and GitHub Flavored Markdown.

react ai chatbot tailwindcss llm local-llm lm-studio local-llm-integration

Updated Jan 31, 2026
JavaScript

Celestialchris / SoulPrint-Canonical

Sponsor

Star

Local-first AI conversation archive. Import from ChatGPT, Claude, and Gemini. Browse, search, ask, distill, export.

python export privacy ai memory sqlite mcp gemini obsidian summarizer grok claude distill local-first chatgpt ollama local-llm-integration conversation-import gemma4

Updated Apr 24, 2026
Python

Beardicuss / Softcurse-HEX

Star

A high-performance desktop intelligence agent built in Electron. Pairs deep native Windows OS automation with multimodal LLM cognition and offline voice processing.

electron desktop-app ui ai-assistants ai-assistant local-llm local-llm-integration softcurse softcurse-ui

Updated Apr 22, 2026
JavaScript

nicholasegurley / docker-simple-LMStudio-web-ui

Star

UNOFFICIAL Simple LM Studio Web UI (Docker)

local-llm lm-studio local-llm-integration vibe-coded

Updated Nov 24, 2025
TypeScript

code2k13 / onnx_javascript_browser_inference

Star

This repository has code to securely run SLM (Small language models) locally using nodejs (servers side) or inside browser .

wasm onnx onnxruntime onnx-models small-language-models local-llm-integration

Updated Nov 25, 2025
JavaScript

OMI-KALIX / n8n-rag-automation-ollama-pinecone

Star

End-to-end RAG automation built with n8n, Ollama (local LLMs), and Pinecone. Automatically ingests documents, generates embeddings, stores vectors, and enables context-aware AI chat.

workflow automation rag n8n ollama pinecone-db local-llm-integration

Updated Dec 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local-llm-integration

Here are 52 public repositories matching this topic...

dineshsoudagar / local-llms-on-android

pheonix-delta / axiom-voice-agent

PocketLLM / PocketLLM

Parad0x-Labs / nulla-local

lynxai-team / goinfer

dronefreak / local_rag_pipeline

lelandg / ImageAI

sanskar9999 / CodeEvolveLLM

radlab-dev-group / llm-router

tommathewXC / lidia

augustine-aj / RAG-Basics

swordonfire / SuperBot

kmkamyk / ask-cli

fvanevski / knowledge_agent

Sundareeshwaran / lm-studio-chat-agent

Celestialchris / SoulPrint-Canonical

Beardicuss / Softcurse-HEX

nicholasegurley / docker-simple-LMStudio-web-ui

code2k13 / onnx_javascript_browser_inference

OMI-KALIX / n8n-rag-automation-ollama-pinecone

Improve this page

Add this topic to your repo