cerebras

Star

Here are 104 public repositories matching this topic...

alvinunreal / oh-my-opencode-slim

Star

Slimmed, cleaned and fine-tuned oh-my-opencode fork, consumes much less tokens

opencode orchestration cerebras agentic-ai antigravity oh-my-opencode

Updated Apr 24, 2026
TypeScript

romgX / openrelay

Star

几百个免费 AI 模型配额，一键接入本地项目。| Hundreds of free AI model quotas, one-click access to local projects.

ai proxy openai developer-tools cursor copilot claude free-api windsurf groq aider cerebras free-ai llm-proxy ai-proxy claude-code kiro model-router openclaw

Updated Apr 24, 2026
TypeScript

bobazooba / xllm

Star

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

deep-neural-networks deep-learning torch pytorch openai llama gpt alpaca zephyr mistral vicuna gpt-4 large-language-models llm chatgpt cerebras gptq bitsandbytes llama2

Updated Jan 17, 2024
Python

Production-grade RAG API built in Rust. Hybrid search with HNSW dense vectors and BM25 sparse matching, cross-encoder reranking, layout-aware document extraction via Docling, and 94.5% accuracy on Open RAG Bench. Powered by Cerebras, Groq, Milvus, and Jina AI.

rust open-source actix-web rag vector-search groq milvus llm jinaai cerebras

Updated Apr 2, 2026
Rust

ariya / ask-llm

Sponsor

Star

Interact with any LLM service

Updated Jun 8, 2025
JavaScript

p32929 / rotato

Star

A robust Node.js proxy server that automatically rotates API keys for Gemini and OpenAI APIs when rate limits (429 errors) are encountered. Built with zero dependencies and comprehensive logging.

Updated Apr 10, 2026
HTML

symfony / ai-platform

Star

PHP library for interacting with AI platform provider.

Updated Apr 22, 2026
PHP

Devansh-365 / freellm

Star

OpenAI-compatible gateway aggregating 8 free LLM providers with automatic failover, circuit breakers, and smart routing. 32+ models, zero cost.

api-gateway rate-limiting nvidia gemini circuit-breaker mistral free-api groq openai-api llm cerebras ollama llm-gateway openai-compatible

Updated Apr 14, 2026
TypeScript

yixin0829 / push-to-talk

Star

Ultra-fast, customizable AI voice dictation in any active app on Windows (MacOS and Linux coming soon)

speech-to-text dictation whisper voice-dictation voice-input deepgram cerebras dictation-tool

Updated Mar 8, 2026
Python

jw-source / LlamaSim

Star

Simulate human behavior with mass LLMs

python agent ai simulation chatbot ml prediction multi-agent openai llama agents gpt-4 gpt4 llm chatgpt cerebras multi-llm

Updated Oct 23, 2024
Python

nathabonfim59 / cerebras-code-monitor

Sponsor

Star

A tool keep tabs on your Cerebras Code usage limits, in real time

cerebras cerebras-ai cerebras-code

Updated Nov 7, 2025
Go

oliverbob / ginto.ai

Star

✅ Agentic OS-aware Intention Programming Technology

docker composer lxd websocket lxc sandboxing openai groq cerebras

Updated Apr 13, 2026
PHP

ariya / query-llm

Sponsor

Star

Query LLM with Chain-of-Tought

gemini openai llama mistral groq llm chain-of-thought cerebras localai openrouter ollama lmstudio

Updated Jun 28, 2025
JavaScript

Mohith-akash / Global-News-Intel-Platform

Star

AI-powered geopolitical news intelligence platform. Ingests 100K+ daily events from GDELT, stores in MotherDuck (DuckDB), orchestrates with Dagster, and features an AI chat interface with Text-to-SQL. Full data engineering stack at $0/month.

python ai dashboard etl ci-cd data-engineering data-analytics gdelt data-pipeline geopolitics text-to-sql rag dagster streamlit llm cerebras motherduck

Updated Apr 16, 2026
Python

KompleteAI / xllm

Star

🦖 X—LLM: Simple & Cutting Edge LLM Finetuning

Updated Nov 14, 2023
Python

argonne-lcf / AIaccelerators-SC24-tutorial

Star

AI Accelerators-SC24-tutorial Repository

ai accelerator argonne groq graphcore habana cerebras sambanova

Updated Nov 20, 2024
Jupyter Notebook

colesmcintosh / cerebras-openrouter-hackathon-rag-system

Sponsor

Star

An advanced RAG (Retrieval-Augmented Generation) system that provides intelligent access to Cerebras inference documentation with citations, conversation memory, and multiple interfaces.

reranking rag cerebras qwen3

Updated May 24, 2025
Python

bobazooba / wgpt

Star

This repository features an example of how to utilize the xllm library. Included is a solution for a common type of assessment given to LLM engineers

Updated Jan 23, 2025
Python

apdelsm / Matrix-algorithms-in-cerebras

Star

Matrix decomposition and multiplication on the Cerebras Wafer-Scale Engine (WSE) architecture

hpc gpu linear-algebra parallel-computing matrix-factorization matrix-multiplication lu-decomposition qr-decomposition cannon-algorithm lu-factorization qr-factorization wse cerebras cannons-matrix-matrix-multiplication cerebras-wse

Updated Apr 24, 2025
Python

EdwardChhun / SeverityAI

Star

A solution that could prioritize patients based on urgency, reducing wait times and ensuring those who need immediate care.

react flask website healthcare cerebras propelauth

Updated Sep 26, 2024
JavaScript

Improve this page

Add a description, image, and links to the cerebras topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cerebras topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cerebras

Here are 104 public repositories matching this topic...

alvinunreal / oh-my-opencode-slim

romgX / openrelay

bobazooba / xllm

AlphaCorp-AI / RustyRAG

ariya / ask-llm

p32929 / rotato

symfony / ai-platform

Devansh-365 / freellm

yixin0829 / push-to-talk

jw-source / LlamaSim

nathabonfim59 / cerebras-code-monitor

oliverbob / ginto.ai

ariya / query-llm

Mohith-akash / Global-News-Intel-Platform

KompleteAI / xllm

argonne-lcf / AIaccelerators-SC24-tutorial

colesmcintosh / cerebras-openrouter-hackathon-rag-system

bobazooba / wgpt

apdelsm / Matrix-algorithms-in-cerebras

EdwardChhun / SeverityAI

Improve this page

Add this topic to your repo