Inference Engines

HOT

Projects and tools for efficient AI model inference in production.

Active projects 100
New this week +2144
Total star growth +15.9k
Cross-source 56
2.8M
Total Stars
377.6k
Total Forks
73
Multi-Source Repos
+15.9k
Stars This Period

Top Projects (100)

ME

milla-jovovich/mempalace

The highest-scoring AI memory system ever benchmarked. And it's free.

Trend 53
Breakout +10288.9%
ai chromadb llm mcp memory python
25.2k 3.1k +8,699/wk
GitHub
LW

atomicmemory/llm-wiki-compiler

The knowledge compiler. Raw sources in, interlinked wiki out. Inspired by Karpathy's LLM Wiki pattern.

Trend 45
Breakout +426.1%
cli compiler context-engineering karpathy knowledge-base knowledge-compilation llm markdown obsidian wiki
242 21 +49/wk
GitHub
LW

mduongvandinh/llm-wiki

Hệ thống knowledge base cá nhân hoàn toàn tự động, vận hành bởi LLM. Dựa trên pattern LLM Wiki của Andrej Karpathy.

Trend 25
Breakout +210.5%
andrej-karpathy claude-code llm
118 52 +76/wk
GitHub
TR

huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Trend 22
audio deep-learning deepseek gemma glm hacktoberfest llm machine-learning model-hub natural-language-processing nlp pretrained-models python pytorch pytorch-transformers qwen speech-recognition transformer vlm
159.0k 32.8k +51/wk
GitHub HuggingFace PyPI arxiv 4-source
LI

BerriAI/litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

Trend 22
ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm llm-gateway llmops mcp-gateway openai openai-proxy vertex-ai
42.6k 7.1k +103/wk
GitHub PyPI 2-source
DA

huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Trend 22
ai artificial-intelligence computer-vision dataset-hub datasets deep-learning huggingface llm machine-learning natural-language-processing nlp numpy pandas pytorch speech tensorflow
21.4k 3.2k +1/wk
GitHub HuggingFace PyPI 3-source
JL

R6410418/Jackrong-llm-finetuning-guide

Trend 22
Breakout +185.8%
dataset deepseek fine-tuning guide llama3 llm machine-learning nlp openai pytorch qwen unsloth
383 73 +78/wk
GitHub
RA

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Trend 21
data-science deep-learning deployment distributed hyperparameter-optimization hyperparameter-search large-language-models llm llm-inference llm-serving machine-learning optimization parallel python pytorch ray reinforcement-learning rllib serving tensorflow
42.0k 7.4k +20/wk
GitHub PyPI 2-source
LA

langchain-ai/langgraph

Build resilient language agents as graphs.

Trend 21
agents ai ai-agents chatgpt deepagents enterprise framework gemini generative-ai langchain langgraph llm multiagent open-source openai pydantic python rag
28.7k 4.9k +68/wk
GitHub PyPI 2-source
LO

joyehuang/Learn-Open-Harness

🤖 Official Interactive Tutorial for OpenHarness – Zero to Hero in 12 Chapters | Learn OpenHarness like Claude Code: Agent Loop, Tools, Memory, Multi-Agent | 面向零基础的 AI Agent 交互式教程

Trend 21
Breakout +178.9%
agent-harness agent-loop ai-agent ai-agent-tutorial ai-harness ai-infrastructure chinese claude-code generative-ai harness-engineering interactive-learning interactive-tutorial llm nextjs openharness openharness-tutorial shadcn-ui zero-to-hero
106 18 +34/wk
GitHub
CS

santifer/cv-santiago

Interactive CV with AI chat integration. Built with React 19, TypeScript, Claude API. Chat with my AI avatar about my experience.

Trend 20
Breakout +171.1%
ai chatbot claude langfuse llm llmops observability portfolio react tailwindcss typescript vercel vite
206 77 +40/wk
GitHub
LA

langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

Trend 20
analytics autogen evaluation langchain large-language-models llama-index llm llm-evaluation llm-observability llmops monitoring observability open-source openai playground prompt-engineering prompt-management self-hosted ycombinator
24.6k 2.5k +76/wk
GitHub HuggingFace PyPI 3-source
LL

lucasastorian/llmwiki

Open Source Implementation of Karpathy's LLM Wiki. Upload documents, connect your Claude account via MCP, and have it write your wiki !

Trend 20
Breakout +162.5%
agents ai-agents claude karpathy knowledge-base llm llm-wiki mcp mcp-server rag supabase
84 16 +28/wk
GitHub
OL

ollama/ollama

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Trend 20
deepseek gemma gemma3 glm go golang gpt-oss llama llama3 llm llms minimax mistral ollama qwen
168.2k 15.4k +122/wk
GitHub PyPI arxiv 3-source
BU

browser-use/browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Trend 19
ai-agents ai-tools browser-automation browser-use llm playwright python
86.5k 10.0k +110/wk
GitHub HuggingFace PyPI 3-source
VL

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Trend 19
amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference kimi llama llm llm-serving model-serving moe openai pytorch qwen qwen3 tpu transformer
75.7k 15.3k +116/wk
GitHub PyPI 2-source
LI

run-llama/llama_index

LlamaIndex is the leading document agent and OCR platform

Trend 19
agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database
48.4k 7.2k +20/wk
GitHub PyPI 2-source
AA

cporter202/agentic-ai-apis

The ultimate collection of APIs for building autonomous AI agents — 2,036 production-ready APIs across Agents, AI Models, and MCP Servers. Stop wasting weeks building infrastructure. Plug these in and ship your agent today.

Trend 19
Breakout +150.8%
agentic-ai ai-agents ai-tools api-collection api-directory apis automation curated-list developer-tools llm mcp mcp-servers
163 60 +66/wk
GitHub
ME

google-ai-edge/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Trend 18
android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing
34.6k 5.9k +24/wk
GitHub PyPI 2-source
OP

comet-ml/opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Trend 18
evaluation hacktoberfest hacktoberfest2025 langchain llama-index llm llm-evaluation llm-observability llmops open-source openai playground prompt-engineering
18.7k 1.4k +20/wk
GitHub PyPI 2-source
SP

Dynamis-Labs/spectralquant

3% Is All You Need: Breaking TurboQuant's Compression Limit via Spectral Structure

Trend 18
Breakout +147.1%
compression kv-cache large-language-models llm-inference machine-learning pytorch quantization research-paper spectral-analysis transformer
84 11 +9/wk
GitHub
UN

unslothai/unsloth

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

Trend 17
agent deepseek fine-tuning gemma gemma3 gpt-oss llama llama3 llm llms mistral openai qwen reinforcement-learning self-hosted text-to-speech tts ui unsloth
60.2k 5.2k +197/wk
GitHub HuggingFace PyPI 3-source
OW

open-webui/open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Trend 17
ai llm llm-ui llm-webui llms mcp ollama ollama-webui open-webui openai openapi rag self-hosted ui webui
130.7k 18.5k +155/wk
GitHub HuggingFace PyPI arxiv 4-source
KL

Astro-Han/karpathy-llm-wiki

One skill to build your own Karpathy-style LLM wiki.

Trend 17
Breakout +130.1%
agent-skill claude-code codex cursor knowledge-base llm wiki
168 19 +63/wk
GitHub
CO

ComposioHQ/composio

Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.

Trend 16
agentic-ai agents ai ai-agents aiagents developer-tools function-calling gpt-4 javascript js llm llmops mcp python remote-mcp-server sse typescript
27.7k 4.5k +8/wk
GitHub PyPI 2-source
FI

firecrawl/firecrawl

🔥 The Web Data API for AI - Power AI agents with clean web data

Trend 16
ai ai-agents ai-crawler ai-scraping ai-search crawler data-extraction html-to-markdown llm markdown scraper scraping web-crawler web-data web-data-extraction web-scraper web-scraping web-search webscraping
105.9k 6.9k +368/wk
GitHub PyPI 2-source
LA

ginlix-ai/LangAlpha

Claude Code for Finance

Trend 16
Breakout +131.7%
agent investment langchain langraph llm mcp skills trading
139 20 +12/wk
GitHub
CO

CopilotKit/CopilotKit

The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol

Trend 16
agent agent-native agentic-ai agents ai ai-agent ai-assistant assistant assistant-chat-bots copilot copilot-chat generative-ui js llm nextjs open-source react reactjs ts typescript
30.1k 3.9k +22/wk
GitHub PyPI 2-source
CA

JuliusBrussee/caveman

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

Trend 15
Breakout +119.5%
ai anthropic caveman claude claude-code llm meme prompt-engineering skill tokens
7.2k 280 +1,268/wk
GitHub
AG

agentscope-ai/agentscope

Build and run agents you can see, understand and trust.

Trend 15
agent chatbot large-language-models llm llm-agent mcp multi-agent multi-modal react-agent
23.2k 2.4k +50/wk
GitHub HuggingFace PyPI 3-source
NC

Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Trend 15
android arm-neon artificial-intelligence caffe darknet deep-learning high-preformance inference ios keras mlir mxnet ncnn neural-network onnx pytorch riscv simd tensorflow vulkan
23.1k 4.4k +5/wk
GitHub PyPI 2-source
VA

vanna-ai/vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Trend 15
agent ai data-visualization database llm rag sql text-to-sql
23.2k 2.3k +3/wk
GitHub PyPI 2-source
RE

yamadashy/repomix

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

Trend 14
ai anthropic artificial-intelligence chatbot chatgpt claude deepseek developer-tools gemini genai generative-ai gpt javascript language-model llama llm mcp nodejs openai typescript
23.3k 1.1k +53/wk
GitHub PyPI 2-source
OP

volcengine/OpenViking

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.

Trend 14
agent agentic-rag ai-agents clawbot context-database context-engineering filesystem llm memory openclaw opencode rag skill
21.6k 1.5k +140/wk
GitHub PyPI 2-source
GR

microsoft/graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Trend 14
gpt gpt-4 gpt4 graphrag llm llms rag
32.1k 3.4k +21/wk
GitHub HuggingFace PyPI 3-source
ME

FoundationAgents/MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Trend 13
agent gpt llm metagpt multi-agent
66.8k 8.5k +28/wk
GitHub PyPI 2-source
OP

OpenHands/OpenHands

🙌 OpenHands: AI-Driven Development

Trend 13
agent artificial-intelligence chatgpt claude-ai cli developer-tools gpt llm openai
70.8k 8.9k +58/wk
GitHub HuggingFace PyPI 3-source
KH

khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Trend 13
agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai
34.0k 2.1k +25/wk
GitHub HuggingFace PyPI 3-source
LE

letta-ai/letta

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Trend 13
ai ai-agents llm llm-agent
21.9k 2.3k +7/wk
GitHub PyPI 2-source
AS

AstrBotDevs/AstrBot

Agentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

Trend 12
agent ai chatbot chatgpt discord docker gemini gpt llama llm mcp openai python qq qqbot telegram
29.4k 2.0k +98/wk
GitHub PyPI 2-source
PA

VectifyAI/PageIndex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Trend 12
agentic-ai agents ai ai-agents context-engineering information-retrieval llm rag reasoning retrieval retrieval-augmented-generation vector-database
24.6k 2.1k +95/wk
GitHub PyPI 2-source
GG

kessler/gemma-gem

Gemma Gem runs Google's Gemma 4 model entirely on-device via WebGPU — no API keys, no cloud, no data leaving your machine.

Trend 12
Breakout +88.8%
ai chrome-extension gemma4 gemma4-2b llm
491 41 +61/wk
GitHub
AA

hanlulong/awesome-ai-for-economists

A curated list of AI tools, libraries, and resources for economics research, teaching, and policy analysis. Maintained by the OpenEcon team.

Trend 12
Breakout +91.4%
ai artificial-intelligence awesome awesome-list causal-inference econometrics economics economists generative-ai llm machine-learning mcp research-tools social-science stata
134 41 +29/wk
GitHub
HA

datawhalechina/hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Trend 12
agent llm rag tutorial
34.6k 4.0k +346/wk
GitHub PyPI 2-source
MI

milvus-io/milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Trend 12
anns cloud-native diskann distributed embedding-database embedding-similarity embedding-store faiss golang hnsw image-search llm nearest-neighbor-search rag vector-database vector-search vector-similarity vector-store
43.7k 3.9k +28/wk
GitHub PyPI 2-source
AU

AutoX-AI-Labs/AutoR

AI handles execution, humans own the direction, and every run becomes an inspectable research artifact on disk.

Trend 12
Breakout +87.6%
agent ai ai-scientist auto-research claude claude-code cli llm openai paper science
197 6 +43/wk
GitHub
LW

nvk/llm-wiki

A Claude Code plugin for building LLM-compiled knowledge bases. Ingest sources, compile interconnected markdown articles, query, lint, research, and generate outputs — all from Claude Code. Optionally view in Obsidian.

Trend 12
Breakout +84.6%
agentic-ai agentic-skills agentic-workflow claude-code codex llm plugin wiki
144 12 +13/wk
GitHub
AI

vercel/ai

The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents

Trend 11
anthropic artificial-intelligence gemini generative-ai generative-ui javascript language-model llm nextjs openai react svelte typescript vercel vue
23.3k 4.1k +11/wk
GitHub PyPI 2-source
CH

2noise/ChatTTS

A generative speech model for daily dialogue.

Trend 11
agent chat chatgpt chattts chinese chinese-language english english-language gpt llm llm-agent natural-language-inference python text-to-speech torch torchaudio tts
39.0k 4.2k +12/wk
GitHub PyPI 2-source
HA

deepset-ai/haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

Trend 11
agent agents ai gemini generative-ai gpt-4 information-retrieval large-language-models llm machine-learning nlp orchestration python pytorch question-answering rag retrieval-augmented-generation semantic-search summarization transformers
24.8k 2.7k +7/wk
GitHub PyPI 2-source
QW

QwenLM/Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Trend 10
chinese flash-attention large-language-models llm natural-language-processing pretrained-models
20.9k 1.8k +8/wk
GitHub PyPI 2-source
LL

AlexsJones/llmfit

Hundreds of models & providers. One command to find what runs on your hardware.

Trend 10
gguf llm localai mlx skill unsloth
22.0k 1.3k +136/wk
GitHub PyPI 2-source
CO

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Trend 10
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
41.4k 4.5k -1/wk
GitHub PyPI 2-source
MS

xr843/Master-skill

Chinese Buddhist Master-skill powered by FoJin — 基于佛教经典文献的汉传祖师大德教学角色生成器

Trend 10
Breakout +64.0%
agent-skills ai-persona buddhism chinese-buddhism claude-skills digital-humanities fojin llm rag
141 33 +9/wk
GitHub
LC

chatchat-space/Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Trend 9
chatbot chatchat chatglm chatgpt embedding faiss fastchat gpt knowledge-base langchain langchain-chatglm llama llm milvus ollama qwen rag retrieval-augmented-generation streamlit xinference
37.8k 6.2k +9/wk
GitHub PyPI 2-source
AA

conorbronsdon/avoid-ai-writing

Skill that audits and rewrites content to remove AI writing patterns. Use it with your favorite agents including Claude Code, OpenClaw, and Hermes.

Trend 9
Breakout +59.8%
ai-writing claude claude-code llm prompt-engineering skill writing
770 79 +60/wk
GitHub
QU

QuivrHQ/quivr

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

Trend 9
ai api chatbot chatgpt database docker framework frontend groq html javascript llm openai postgresql privacy rag react security typescript vector
39.1k 3.7k -2/wk
GitHub PyPI 2-source
TO

toon-format/toon

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

Trend 9
data-format llm serialization tokenization
23.7k 1.1k +10/wk
GitHub PyPI 2-source
AU

Significant-Gravitas/AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Trend 9
agentic-ai agents ai artificial-intelligence autonomous-agents claude gpt llama-api llm openai python
183.2k 46.2k +19/wk
GitHub PyPI 2-source
LF

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Trend 9
ai artificial-intelligence chatbot chatgpt deep-learning from-scratch generative-ai gpt language-model large-language-models llm machine-learning neural-networks python pytorch transformers
90.3k 13.8k +56/wk
GitHub HuggingFace PyPI 3-source
PO

linshenkx/prompt-optimizer

一款提示词优化器,助力于编写高质量的提示词

Trend 9
llm prompt prompt-engineering prompt-optimization prompt-toolkit prompt-tuning
26.1k 3.1k +50/wk
GitHub PyPI 2-source
RA

infiniflow/ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Trend 8
agent agentic agentic-ai agentic-workflow ai context-engineering context-retrieval deep-research deepseek deepseek-r1 document-understanding graphrag harness llm mcp ollama openai openclaw rag retrieval-augmented-generation
77.5k 8.7k +102/wk
GitHub PyPI 2-source
ST

usestrix/strix

Open-source AI hackers to find and fix your app’s vulnerabilities.

Trend 8
agents artificial-intelligence cybersecurity generative-ai llm penetration-testing
23.3k 2.5k +36/wk
GitHub PyPI 2-source
VO

voideditor/void

Trend 8
chatgpt claude copilot cursor developer-tools editor llm open-source openai visual-studio-code vscode vscode-extension
28.5k 2.4k +6/wk
GitHub PyPI 2-source
PS

pandazki/pneuma-skills

Co-creation infrastructure for humans and code agents — visual environment, skills, continuous learning, and distribution.

Trend 7
🔥 Heating Up +47.0%
ai-agents ai-coding-assistant ai-tools bun claude claude-code co-creation developer-tools llm presentations
97 8 +6/wk
GitHub
AC

voocel/ainovel-cli

✨多agent实现全自动AI小说生成

Trend 7
🔥 Heating Up +45.1%
agents ai ai-agents claude go llm novel openai
103 19 +2/wk
GitHub
OM

omega-memory/omega-memory

Persistent memory for AI coding agents

Trend 7
🔥 Heating Up +42.4%
ai-agent ai-memory claude claude-code coding-agent context-engineering cursor knowledge-graph llm local-first mcp mcp-server memory model-context-protocol persistent-memory semantic-search windsurf
84 16 +4/wk
GitHub
IS

agenmod/immortal-skill

♾️ 开源数字永生框架 — 从聊天记录蒸馏任何人的七维数字分身。支持微信/飞书/iMessage/Telegram等12+平台,7种角色模板,对齐 OpenClaw Soul Spec 标准。一行指令让你的AI学会蒸馏。

Trend 7
🔥 Heating Up +36.1%
agent-skills ai-agent chatbot cursor digital-immortality digital-twin distillation feishu llm openclaw persona soul-spec wechat
362 34 +44/wk
GitHub
FI

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Trend 7
chatgpt finance fingpt fintech large-language-models machine-learning nlp prompt-engineering pytorch reinforcement-learning robo-advisor sentiment-analysis technical-analysis
19.0k 2.7k +2/wk
GitHub PyPI 2-source
BY

czl9707/build-your-own-openclaw

A step-by-step guide to build your own AI agent.

Trend 7
🔥 Heating Up +35.6%
ai-agent build-your-own-x llm python tutorial
869 138 +153/wk
GitHub
WH

fiveoutofnine/whatcanirun

Find the best models and how to run them locally.

Trend 7
🔥 Heating Up +41.5%
apple-silicon llamacpp llm local-llm mlx
116 9 +5/wk
GitHub
LO

mudler/LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

Trend 7
agents ai api audio-generation decentralized distributed image-generation libp2p llama llm mamba mcp musicgen object-detection rerank stable-diffusion text-generation tts
45.1k 3.9k +56/wk
GitHub HuggingFace PyPI 3-source
BM

BlockRunAI/blockrun-mcp

Live data for AI agents — search, research, markets, crypto, X/Twitter. Pay-per-call via x402 micropayments.

Trend 7
🔥 Heating Up +35.0%
ai claude-code llm mcp mcp-server x402
251 24 +20/wk
GitHub
CO

zhayujie/chatgpt-on-wechat

CowAgent是基于大模型的超级AI助理,能主动思考和任务规划、访问操作系统和外部资源、创造和执行Skills、拥有长期记忆并不断成长,比OpenClaw更轻量和便捷。同时支持微信、飞书、钉钉、企微、QQ、公众号、网页等接入,可选择OpenAI/Claude/Gemini/DeepSeek/ Qwen/GLM/Kimi/LinkAI,能处理文本、语音、图片和文件,可快速搭建个人AI助理和企业数字员工。

Trend 7
ai ai-agent chatgpt claude deepseek dingtalk feishu-bot gemini kimi linkai llm mcp multi-agent openai openclaw python3 qwen skills wechat weixin
42.9k 9.9k +46/wk
GitHub PyPI 2-source
GO

nextlevelbuilder/goclaw

GoClaw - GoClaw is OpenClaw rebuilt in Go — with multi-tenant isolation, 5-layer security, and native concurrency. Deploy AI agent teams at scale without compromising on safety.

Trend 7
🔥 Heating Up +36.6%
agent-orchestration ai-agent ai-gateway anthropic chatbot discord-bot golang llm mcp multi-agent openai postgresql telegram-bot websocket
2.3k 575 +142/wk
GitHub
AO

rivet-dev/agent-os

A portable open-source operating system for agents. ~6 ms coldstarts, 32x cheaper than sandboxes. Powered by WebAssembly and V8 isolates.

Trend 6
🔥 Heating Up +36.5%
agent ai javascript llm sandbox v8 wasm webassembly
2.5k 98 +116/wk
GitHub
MA

canvas-org/meta-agent

Continual harness optimization

Trend 6
🔥 Heating Up +25.0%
agent anthropic claude harness llm training tuning
40 3 +4/wk
GitHub
CO

he-yufeng/CoreCoder

Minimal AI coding agent (~950 LoC Python) inspired by Claude Code. Works with any LLM. Think NanoGPT for coding agent. Formerly NanoCoder.

Trend 6
🔥 Heating Up +30.6%
ai-agent claude-code cli coding-agent corecoder deepseek developer-tools llm openai python
384 111 +33/wk
GitHub
QU

quantumaikr/quant.cpp

LLM inference with 7x longer context. Pure C, zero dependencies. Lossless KV cache compression + single-header library.

Trend 6
🔥 Heating Up +32.2%
delta-compression embeddable gguf kv-cache llm llm-inference pure-c quantization transformer turboquant
337 39 +25/wk
GitHub
IX

ix-infrastructure/Ix

Understand any codebase instantly. System intelligence for codebases, built for humans and AI.

Trend 6
🔥 Heating Up +23.0%
ai ai-memory claude-code cli code-analysis code-mapping code-memory codebase-understanding codex developer-tools llm openclaw persistent-memory program-analysis software-architecture system-intelligence system-mapping
107 10 +13/wk
GitHub
WE

runzhliu/welink

🔍微信聊天数据分析的本地化AI-agent(Docker/Windows/MacOS) · AI分身 / 大模型分析 / 好友排行 / 词云 / 情感趋势 / 群聊画像

Trend 6
🔥 Heating Up +24.1%
chat-analysis chat-history chat-mcp llm macos mcp-server wechat weixin
67 8 +10/wk
GitHub
CR

tirth8205/code-review-graph

Local knowledge graph for Claude Code. Builds a persistent map of your codebase so Claude reads only what matters — 6.8× fewer tokens on reviews and up to 49× on daily coding tasks.

Trend 6
🔥 Heating Up +26.5%
ai-coding claude claude-code code-review graphrag incremental knowledge-graph llm mcp python static-analysis tree-sitter
6.5k 770 +665/wk
GitHub
AA

jnMetaCode/agency-agents-zh

🎭 193 个即插即用的 AI 专家角色 — 支持 OpenClaw/Claude Code/Cursor/Copilot 等 14 种工具,覆盖工程/设计/营销/产品等 18 个部门。含 46 个中国市场原创智能体(小红书/抖音/微信/飞书/钉钉等)

Trend 6
🔥 Heating Up +25.0%
agency-orchestrator agent-definitions ai-agents ai-roles chinese claude claude-code copilot-agent cursor-rules deepseek llm multi-agent no-code prompt-engineering qwen system-prompt workflow
5.0k 927 +415/wk
GitHub
CL

antgroup/ClawAegis

ClawAegis is a lightweight plugin providing full-lifecycle runtime protection for OpenClaw.

Trend 6
🔥 Heating Up +27.1%
agent agent-security llm openclaw openclaw-plugin security skills
75 12 +3/wk
GitHub
AG

reworkd/AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Trend 5
agent agentgpt agents agi ai ai-agents autogpt baby-agi gpt langchain llm next openai t3 t3-stack
36.0k 9.4k +6/wk
GitHub PyPI 2-source
DL

datawhalechina/diy-llm

🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系

Trend 5
🔥 Heating Up +23.9%
gpu-programming llm nlp rl sft transformer triton
378 40 +61/wk
GitHub
LA

pathwaycom/llm-app

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

Trend 5
chatbot hugging-face llm llm-local llm-prompting llm-security llmops machine-learning open-ai pathway rag real-time retrieval-augmented-generation vector-database vector-index
60.0k 1.4k +10/wk
GitHub PyPI 2-source
SL

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Trend 5
chatglm chatglm3 gemma-2b-it glm-4 internlm2 llama3 llm lora minicpm q-wen qwen qwen1-5 qwen2
29.6k 2.9k +19/wk
GitHub PyPI 2-source
DA

NVIDIA-NeMo/DataDesigner

🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.

Trend 5
🔥 Heating Up +23.3%
agentic-ai data-augmentation data-generation llm mcp multimodal nemo nvidia sdg synthetic-data tool-use
1.5k 135 +35/wk
GitHub
VM

jjang-ai/vmlx

vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

Trend 5
🔥 Heating Up +22.9%
anthropic-api kvcache-compression kvcache-optimization kvcache-reuse llm lmstudio macbook mcp-server mlx mlxllm mlxstudio omlx omlx-alternative openai-api openclaw openclaw-agent persistent-memory prefix-cache vmlx
188 28 +10/wk
GitHub
AP

Arthur-Ficial/apfel

Apple Intelligence from the command line. On-device LLM via FoundationModels framework. No API keys, no cloud, no dependencies.

Trend 5
🔥 Heating Up +22.8%
apple-intelligence apple-silicon cli foundationmodels homebrew llm macos macos-26 on-device openai-compatible swift tool-calling unix
3.7k 132 +149/wk
GitHub
PR

f/prompts.chat

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

Trend 5
ai artificial-intelligence awesome-list chatgpt chatgpt-prompts claude gemini gpt gpt-4 llm machine-learning nextjs open-source openai prompt-engineering prompts prompts-chat typescript
158.1k 20.7k +262/wk
GitHub PyPI 2-source
VA

ItzCrazyKns/Vane

Vane is an AI-powered answering engine.

Trend 5
ai-agents ai-search-engine answering-engine artificial-intelligence llm machine-learning open-source-ai-search-engine perplexica rag search-engine searxng searxng-copilot self-hosted-ai vane
33.7k 3.6k +17/wk
GitHub HuggingFace PyPI 3-source
MA

math-ai-org/mathcode

MathCode: A Frontier Mathematical Coding Agent

Trend 5
🔥 Heating Up +18.5%
ai coding foundation-models llm reasoning
282 25 +19/wk
GitHub
FA

labring/FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

Trend 5
agent claude deepseek llm mcp nextjs openai qwen rag workflow
27.7k 7.0k +11/wk
GitHub PyPI 2-source
DE

Alibaba-NLP/DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Trend 5
agent alibaba artificial-intelligence deep-research deepresearch information-seeking llm tongyi web-agent
18.6k 1.4k +4/wk
GitHub HuggingFace PyPI 3-source
AA

didilili/ai-agents-from-zero

🚀 2026 最系统的 AI Agent 速成指南|智能体实战教程 · 完整学习路径 + 实战项目 + 面试题库 · 对标大模型应用开发工程师岗位 · 覆盖LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / 提示词 · 企业级部署与微调 · 从0到企业级落地 + 从学习到上线项目 + 面试准备一体化

Trend 5
🔥 Heating Up +16.0%
agent agent-framework ai-agent aigc coze dify gpt langchain langgraph llm mcp rag skills tutorial
232 38 +17/wk
GitHub
SA

SYuan03/Skill-Anything

Any source (PDF, video, web, audio, text) to interactive learning package with quizzes, flashcards and spaced repetition. One command, 12-section study guide.

Trend 5
🔥 Heating Up +20.0%
active-recall ai-learning cli-tool education flashcard-generator knowledge-extraction learning-tool llm openai pdf-to-quiz python quiz-generator skill-anything spaced-repetition study-guide
168 7 +8/wk
GitHub
AS

ZYKJShadow/Async

IDE, A native-feeling AI coding workspace that blends chat, planning, agent execution, and project navigation into a unified desktop experience.

Trend 5
🔥 Heating Up +18.9%
ai-coding-assistant ai-development-tools code-editor cursor-alternative electron llm reactjs typescript
170 41 +4/wk
GitHub
CA

avidevelops/claude-architect-exam-prep

Trend 5
🔥 Heating Up +16.1%
ai certification claude claude-code claude-skills exam-prep llm multi-agent
36 11 +3/wk
GitHub

Source Breakdown

GitHub
Stars2.8M
Forks377.6k
Repos100
PyPI
Packages56
HuggingFace
Linked Repos14

Related Topics