Fine-Tuning

HOT

Projects and tools related to fine-tuning pre-trained AI models.

Active projects 100
New this week +145
Total star growth +508
Cross-source 8
614.7k
Total Stars
85.5k
Total Forks
14
Multi-Source Repos
+508
Stars This Period

Top Projects (100)

JL

R6410418/Jackrong-llm-finetuning-guide

Trend 22
Breakout +185.8%
dataset deepseek fine-tuning guide llama3 llm machine-learning nlp openai pytorch qwen unsloth
383 73 +78/wk
GitHub
LI

run-llama/llama_index

LlamaIndex is the leading document agent and OCR platform

Trend 19
agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database
48.4k 7.2k +20/wk
GitHub PyPI 2-source
UN

unslothai/unsloth

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

Trend 17
agent deepseek fine-tuning gemma gemma3 gpt-oss llama llama3 llm llms mistral openai qwen reinforcement-learning self-hosted text-to-speech tts ui unsloth
60.2k 5.2k +197/wk
GitHub HuggingFace PyPI 3-source
LL

hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Trend 13
agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers
69.8k 8.5k +61/wk
GitHub HuggingFace PyPI 3-source
CO

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Trend 10
audio-generation cantonese chatbot chatgpt chinese cosyvoice cross-lingual english fine-grained fine-tuning gpt-4o japanese korean multi-lingual natural-language-generation python text-to-speech tts voice-cloning
20.5k 2.3k +18/wk
GitHub HuggingFace PyPI 3-source
BL

datawhalechina/base-llm

从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/

Trend 3
bert deeplearning docker fine-tuning linux llama llm lora nlp python pytorch qwen rnn tensorrt transformer tutorial
586 55 +22/wk
GitHub
NE

NVIDIA-NeMo/Nemotron

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models

Trend 3
ai fine-tuning model-training nemotron nvidia reinforcement-learning
897 192 +16/wk
GitHub
AE

amitshekhariitbhu/ai-engineering-interview-questions

Your Cheat Sheet for AI Engineering Interview – Questions and Answers.

Trend 3
agents ai ai-agents ai-engineering fine-tuning interview interview-preparation interview-questions llm mcp quantization questions-and-answers rag
988 178 +9/wk
GitHub
UB

TYH-labs/unsloth-buddy

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.

Trend 3
apple-silicon claude-code dpo fine-tuning gaslamp grpo huggingface lora qlora rlhf sft transformer unsloth
211 12 +1/wk
GitHub
SR

vllm-project/semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Trend 3
ai-gateway bert-classification fine-tuning golang huggingface-candle huggingface-transformers kubernetes llm llmrouter mcp mixture-of-models openclaw pii-detection prompt-engineering prompt-guard rust semantic-router vllm
3.7k 603 +11/wk
GitHub
ME

aiming-lab/MetaClaw

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Trend 3
agent ai-agent continual-learning fine-tuning llm lora meta-learning metaclaw online-learning openclaw reinforcement-learning skill-learning tinker
3.6k 393 +8/wk
GitHub
MA

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

Trend 3
deepseek fine-tuning gemma2 gemma3 gpt jax large-language-models llama2 llama3 llama4 llm mistral mixtral sft
2.2k 500 +3/wk
GitHub
OR

pykeio/ort

Fast ML inference & training for ONNX models in Rust

Trend 3
ai ai-training fine-tuning inference machine-learning onnx onnxruntime rust
2.2k 230 +4/wk
GitHub
CU

NVIDIA-NeMo/Curator

Scalable data pre processing and curation toolkit for LLMs

Trend 3
data data-curation data-prep data-preparation data-processing data-processing-pipelines data-quality datacuration datarecipes deduplication fast-data-processing fine-tuning large-language-models large-scale-data-processing llm llm-data-quality llmapps python semantic-deduplication
1.5k 252 +2/wk
GitHub
LA

oxbshw/LLM-Agents-Ecosystem-Handbook

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

Trend 3
ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops local-development mcp-server memory rag rag-chatbot voice-agent
505 79 +1/wk
GitHub
VE

bibinprathap/VeritasGraph

VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution

Trend 3
data-privacy enterprise-ai explainable-ai fine-tuning generative-ai generativeai graph-rag information-retrieval knowledge-graph langchain llamaindex llm lora multi-hop-reasoning neo4j nlp ollama on-premise question-answering rag
266 30 +1/wk
GitHub
SU

invergent-ai/surogate

Full-Stack Development Platform for Building Reliable Agents

Trend 3
cuda deep-learning fine-tuning generative-ai llama llm llms nvidia-gpu qwen sft
218 3 +0/wk
GitHub
VE

ai4protein/VenusFactory2

🏭 AI agent platform with skills for protein engineering, the noob-friendly AI tutorial tool for life science professionals.

Trend 3
agent database fine-tuning language-model life-science noob-friendly protein protein-science skill tutorial
212 29 +0/wk
GitHub
OH

huggingface/optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Trend 3
bert fine-tuning habana hpu transformers
209 271 +0/wk
GitHub
TO

ToolBrain/ToolBrain

A framework for agentic tool use training with reinforcement learning

Trend 3
agentic-ai dpo fine-tuning grpo langchain llm-agents reinforcement-learning smolagents tool-use unsloth
165 14 +1/wk
GitHub
PE

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Trend 3
adapter diffusion fine-tuning llm lora parameter-efficient-learning peft python pytorch transformers
20.9k 2.2k +1/wk
GitHub HuggingFace PyPI 3-source
ON

microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Trend 3
ai-framework deep-learning hardware-acceleration machine-learning neural-networks onnx pytorch scikit-learn tensorflow
19.8k 3.8k +13/wk
GitHub HuggingFace PyPI 3-source
MA

Unity-Technologies/ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Trend 3
deep-learning deep-reinforcement-learning machine-learning neural-networks reinforcement-learning unity unity3d
19.3k 4.4k +3/wk
GitHub
LC

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

Trend 3
ai finetuning langchain llama llama2 llm machine-learning python pytorch vllm
18.3k 2.7k +0/wk
GitHub HuggingFace PyPI 3-source
ME

stas00/ml-engineering

Machine Learning Engineering Open Book

Trend 3
ai debugging gpus inference large-language-models llm machine-learning machine-learning-engineering mlops network pytorch scalability slurm storage training transformers
17.6k 1.1k +4/wk
GitHub
TR

jindongwang/transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Trend 3
deep-learning domain-adaptation domain-adaption domain-generalization few-shot few-shot-learning generalization machine-learning meta-learning paper papers representation-learning self-supervised-learning style-transfer survey theory transfer-learning transferlearning tutorial-code unsupervised-learning
14.3k 3.8k +1/wk
GitHub
ED

ConardLi/easy-dataset

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

Trend 3
dataset fine-tuning javascript llm rag
13.8k 1.4k +16/wk
GitHub
YP

Data-Centric-AI-Community/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Trend 3
big-data-analytics data-analysis data-exploration data-profiling data-quality data-science deep-learning eda exploration exploratory-data-analysis hacktoberfest html-report jupyter jupyter-notebook machine-learning pandas pandas-dataframe pandas-profiling python statistics
13.5k 1.8k +2/wk
GitHub
OP

bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Trend 3
bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna
12.3k 806 -1/wk
GitHub
AX

axolotl-ai-cloud/axolotl

Go ahead and axolotl questions

Trend 3
fine-tuning llm
11.6k 1.3k +5/wk
GitHub
WA

wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Trend 3
ai collaboration data-science data-versioning deep-learning experiment-track hyperparameter-optimization hyperparameter-search hyperparameter-tuning jax keras machine-learning ml-platform mlops model-versioning pytorch reinforcement-learning reproducibility tensorflow
11.0k 856 -3/wk
GitHub
AS

aws/amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Trend 3
aws data-science deep-learning examples inference jupyter-notebook machine-learning mlops reinforcement-learning sagemaker training
10.9k 7.0k +0/wk
GitHub
OU

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Trend 3
dpo evaluation fine-tuning gpt-oss gpt-oss-120b gpt-oss-20b inference llama llms sft slms vlms
9.2k 744 +2/wk
GitHub
LO

cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Trend 3
diffusion dreambooth fine-tuning lora stable-diffusion
7.5k 498 +2/wk
GitHub
FL

flyteorg/flyte

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows. Flyte 2 now available locally: https://github.com/flyteorg/flyte-sdk

Trend 3
data data-analysis data-science dataops declarative fine-tuning flyte golang grpc hacktoberfest kubernetes kubernetes-operator llm machine-learning mlops orchestration-engine production python scale workflow
6.9k 802 +2/wk
GitHub
KI

Kiln-AI/Kiln

Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.

Trend 3
ai chain-of-thought collaboration dataset-generation evals evaluation evaluation-framework fine-tuning machine-learning macos mcp ml ollama openai prompt prompt-engineering python rlhf synthetic-data windows
4.7k 352 -1/wk
GitHub
LZ

LLMBook-zh/LLMBook-zh.github.io

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

Trend 3
artificial-intelligence deep-learning deep-neural-networks deep-reinforcement-learning fine-tuning language-model large-language-models natural-language-processing nlp pretrained-models
4.4k 329 +4/wk
GitHub
CO

truefoundry/cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Trend 3
agent ai application data deep-learning fine-tuning framework generative-ai llm llm-ops llmops machine-learning mlops model-deployment python rag retrieval-augmented-generation typescript
4.4k 392 +0/wk
GitHub
TL

thuml/Transfer-Learning-Library

Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization

Trend 3
adversarial-learning dann deep-learning domain-adaptation finetune image-translation out-of-distribution-generalization self-training semi-supervised-learning transfer-learning unsupervised-domain-adaptation
3.9k 592 +2/wk
GitHub
UP

dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Trend 3
albert bart bert chinese classification clue elmo fine-tuning gpt gpt-2 model-zoo natural-language-processing ner pegasus pre-training pytorch roberta t5 unilm xlm-roberta
3.1k 521 +1/wk
GitHub
OP

AIDotNet/OpenDeepWiki

OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful knowledge management and collaboration platform. The project is mainly developed using C# and TypeScript, supporting modular design, and is easy to expand and customize.

Trend 3
chatgpt deepwiki docs fine-tuning
3.0k 397 +6/wk
GitHub
ON

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your Diffusion training needs.

Trend 3
fine-tuning image-model-training lora training
2.9k 273 +2/wk
GitHub
MA

roboflow/maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Trend 3
captioning fine-tuning florence-2 multimodal objectdetection paligemma phi-3-vision qwen2-vl transformers vision-and-language vqa
2.7k 222 +0/wk
GitHub
SB

decodingai-magazine/second-brain-ai-assistant-course

Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.

Trend 3
agents ai-systems data-engineering fine-tuning huggingface llm llmops mlops openai python rag
2.6k 469 +1/wk
GitHub
AE

adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

Trend 3
fine-tuning finetuning finetuning-llms inference large-language-models llm python quantization
2.2k 250 +1/wk
GitHub
DS

dstackai/dstack

Control plane for agents and engineers to provision compute and run training and inference across NVIDIA, AMD, TPU, and Tenstorrent GPUs—on clouds, Kubernetes, and bare-metal clusters.

Trend 3
agent-skills agentic-orchestration amd cloud containers docker fine-tuning gpu inference k8s kubernetes llms machine-learning nvidia orchestration python slurm training
2.1k 222 +0/wk
GitHub
TR

kubeflow/trainer

Distributed AI Model Training and LLM Fine-Tuning on Kubernetes

Trend 3
ai distributed fine-tuning gpu huggingface jax kubeflow kubernetes llm machine-learning mlops python pytorch tensorflow xgboost
2.1k 943 +2/wk
GitHub
CD

adobe-research/custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Trend 3
computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation
2.0k 142 +0/wk
GitHub
PE

google-deepmind/penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Trend 3
fine-tuning interpretability jax neural-networks visualization
1.9k 69 +1/wk
GitHub
LO

THUDM/LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Trend 3
fine-tuning llm long-context long-text
1.9k 185 +1/wk
GitHub
JI

nyu-mll/jiant

jiant is an nlp toolkit

Trend 3
bert multitask-learning nlp sentence-representation transfer-learning transformers
1.7k 297 +0/wk
GitHub
CU

bespokelabsai/curator

Synthetic data curation for post-training and structured data extraction

Trend 3
agents deep-learning fine-tuning instruction-tuning llm machine-learning natural-language-processing prompt python synthetic-data synthetic-dataset-generation
1.7k 136 +2/wk
GitHub
BE

beam-cloud/beta9

Ultrafast serverless GPU inference, sandboxes, and background jobs

Trend 3
autoscaler cloudrun cuda developer-productivity distributed-computing faas fine-tuning functions-as-a-service generative-ai gpu large-language-models llm llm-inference ml-platform paas self-hosted serverless serverless-containers
1.6k 142 +0/wk
GitHub
AG

LirongWu/awesome-graph-self-supervised-learning

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

Trend 3
data-augmentation deep-learning graph-neural-networks machine-learning pre-training pretext-task representation-learning self-supervised-learning transfer-learning unsupervised-learning
1.4k 164 +0/wk
GitHub
TT

SakanaAI/text-to-lora

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Trend 3
fine-tuning hypernetworks llm lora machine-learning
1.3k 87 +1/wk
GitHub
DE

always-further/deepfabric

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

Trend 3
agents ai data-science dataset distillation evaluation fine-tuning huggingface huggingface-datasets machine-learning open open-source python source synthetic synthetic-data unsloth
851 80 +0/wk
GitHub
FI

dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Trend 3
bitsandbytes fine-tuning finetuning finetuning-llms hugging-face huggingface large-language-models llamacpp lora ollama peft peft-fine-tuning-llm pytorch transformers
806 104 +0/wk
GitHub
RE

MolecularAI/REINVENT4

AI molecular design tool for de novo design, scaffold hopping, R-group replacement, linker design and molecule optimization.

Trend 3
ai astrazeneca cheminformatics chemistry deep-learning denovo-design drug-design drug-discovery generative-ai ml molecule-generation neural-networks reinforcement-learning transfer-learning
720 207 +0/wk
GitHub
AM

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.

Trend 3
attack-defense continual-learning diffusion-models ensemble-learning federated-learning few-shot-learning foundation-models generalization generative-model knowledge-fusion large-language-models llms meta-learning model-fusion model-merging multi-domain-learning multi-task-learning robustness transfer-learning zero-shot-learning
708 41 +1/wk
GitHub
ML

Coobiw/MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

Trend 3
deepspeed fine-tuning mllm model-parallel multimodal-large-language-models pipeline-parallelism pretraining qwen video-language-model video-large-language-models
669 35 +2/wk
GitHub
CL

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Complete-Life-Cycle-of-a-Data-Science-Project

Trend 3
analysis data-analysis data-science dataset deep-learning eda exploratory-data-analysis feature-engineering federated-learning machine-learning nlp-models python python-library pytorch reinforcement-learning scraper supervised-learning transfer-learning unsupervised-learning web-scraping
638 254 +1/wk
GitHub
RD

Denis2054/RAG-Driven-Generative-AI

This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models for generation and evaluation.

Trend 3
advanced-rag chroma chromadb embedding-models fine-tuning gpt-4o-mini gpt4-omni grok huggingface indexing-querying llama llama-index multimodal openai-api pinecone rag scaling vision-transformer xai-grok
596 202 +1/wk
GitHub
AI

kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Trend 3
ai buildkit chatgpt docker fine-tuning finetuning gemma gpt inference kubernetes large-language-models llama llm localllama mistral mixtral nvidia open-llm open-source-llm openai
514 59 +0/wk
GitHub
GE

open-edge-platform/geti

Build computer vision models in a fraction of the time and with less data.

Trend 3
computer-vision deep-learning fine-tuning geti inference openvino
474 51 +0/wk
GitHub
TF

tensorflow/tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Trend 0
deep-learning deep-neural-network gpu-acceleration javascript machine-learning neural-network typescript wasm web-assembly webgl
19.1k 2.0k +0/wk
GitHub HuggingFace 2-source
HO

horovod/horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Trend 0
baidu deep-learning deeplearning keras machine-learning machinelearning mpi mxnet pytorch ray spark tensorflow uber
14.7k 2.2k +0/wk
GitHub
ND

alex000kim/nsfw_data_scraper

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

Trend 0
content-moderation deep-learning machine-learning nsfw nsfw-classifier pornography
12.6k 2.9k +0/wk
GitHub
LU

ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Trend 0
computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch
11.7k 1.2k -1/wk
GitHub
HL

h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Trend 0
ai chatbot chatgpt fedramp fine-tuning finetuning generative generative-ai gpt llama llama2 llm llm-training
4.9k 523 -2/wk
GitHub
NE

nucleuscloud/neosync

Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.

Trend 0
benthos docker etl faker fine-tuning golang kubernetes mysql nextjs open-source orchestration postgresql reactjs self-hosted synthetic-data synthetic-data-generation test-data-generator testing typescript
4.1k 231 -1/wk
GitHub
LO

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Trend 0
fine-tuning gpt llama llm llm-inference llm-serving llmops lora model-serving pytorch transformers
3.7k 311 +0/wk
GitHub
CE

hiyouga/ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Trend 0
alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers
3.7k 466 +0/wk
GitHub
FA

ZhaoJ9014/face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Trend 0
artificial-intelligence computer-vision convolutional-neural-network data-augmentation deep-learning face-alignment face-detection face-landmark-detection face-recognition feature-extraction fine-tuning hard-negative-mining imbalanced-learning machine-learning model-training nus pytorch supervised-learning tencent transfer-learning
3.6k 762 -1/wk
GitHub
RE

smallcloudai/refact

AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

Trend 0
ai-agent developer-tools enterprise fine-tuning on-prem open-source rag self-hosted swe-bench vscode
3.5k 309 +0/wk
GitHub
HU

tensorflow/hub

A library for transfer learning by reusing parts of TensorFlow models.

Trend 0
embeddings image-classification machine-learning ml python tensorflow transfer-learning
3.5k 1.6k +0/wk
GitHub
O1

sentient-agi/OML-1.0-Fingerprinting

OML 1.0 via Fingerprinting: Open, Monetizable, and Loyal AI

Trend 0
fine-tuning fingerprint loyalty oml sentient verifiable-ai
3.5k 235 +0/wk
GitHub
AO

TJU-DRL-LAB/AI-Optimizer

The next generation deep reinforcement learning tookit

Trend 0
deep-learning reinforcement-learning transfer-learning
3.5k 597 -1/wk
GitHub
HO

iusztinpaul/hands-on-llms

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Trend 0
3-pipeline-design aws beam bytewax cicd comet-ml docker fine-tuning generative-ai huggingface langchain llmops llms mlops qdrant qlora streaming transformers
3.4k 550 -1/wk
GitHub
VS

jingyi0000/VLM_survey

Collection of AWESOME vision-language models for vision tasks

Trend 0
clip computer-vision deep-learning knowledge-distillation multi-modal-model survey transfer-learning vision-language-model
3.1k 234 +0/wk
GitHub
LF

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

Trend 0
falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation
2.9k 764 -1/wk
GitHub
SI

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

Trend 0
diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion
2.8k 276 -2/wk
GitHub
XT

stochasticai/xTuring

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Trend 0
adapter deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization
2.7k 213 -2/wk
GitHub
FF

AIStream-Peelout/flow-forecast

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).

Trend 0
anomaly-detection deep-learning deep-neural-networks forecasting hacktoberfest lstm pytorch state-of-the-art-models time-series time-series-analysis time-series-forecasting time-series-regression transfer-learning transformer
2.3k 304 -1/wk
GitHub
SP

neuralmagic/sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Trend 0
automl computer-vision-algorithms deep-learning-algorithms deep-learning-library deep-learning-models image-classification keras nlp object-detection onnx pruning pruning-algorithms pytorch smaller-models sparsification sparsification-recipes sparsity tensorflow transfer-learning
2.1k 156 -2/wk
GitHub
YI

YiVal/YiVal

Your Automatic Prompt Engineering Assistant for GenAI Applications

Trend 0
ai ai-experiments ai-toolkit aigc api auto-prompting autogpt fine-tuning framework generative-ai gpt4 llm midjourney prompt prompt-engineering prompt-tuning promptengineering python stable-diffusion
2.1k 329 -3/wk
GitHub
AF

chaoyanghe/Awesome-Federated-Learning

FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai

Trend 0
adversarial-attack-and-defense communication-efficiency computation-efficiency computer-vision continual-learning decentralized-federated-learning distributed-optimization federated-learning hierarchical-federated-learning incentive-mechanism interpretability machine-learning neural-architecture-search non-iid privacy semi-supervised-learning straggler-problem transfer-learning vertical-federated-learning wireless-communication
2.0k 334 +0/wk
GitHub
DG

eosphoros-ai/DB-GPT-Hub

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Trend 0
database datasets fine-tuning gpt hacktoberfest llm nl2sql sql text-to-sql text2sql
2.0k 247 +0/wk
GitHub
LA

ray-project/llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

Trend 0
anyscale fine-tuning llama2 llms machine-learning openai ray serving
1.9k 253 +0/wk
GitHub
TL

huggingface/transfer-learning-conv-ai

🦄 State-of-the-Art Conversational AI with Transfer Learning

Trend 0
chatbots deep-learning dialog gpt gpt-2 neural-networks nlp pytorch transfer-learning
1.8k 431 +0/wk
GitHub
FI

jina-ai/finetuner

:dart: Task-oriented embedding tuning for BERT, CLIP, etc.

Trend 0
bert few-shot-learning fine-tuning finetuning jina metric-learning negative-sampling neural-search openai-clip pretrained-models siamese-network similarity-learning transfer-learning triplet-loss
1.5k 68 +0/wk
GitHub
SA

tianrun-chen/SAM-Adapter-PyTorch

Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts

Trend 0
2d-segmentation adapter camouflage-images camouflaged-object-detection camouflaged-target-detection fine-tune fine-tuning image-segmentation image-segmentation-pytorch segment-anything segment-anything-model
1.5k 122 +0/wk
GitHub
SP

uclaml/SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Trend 0
deep-learning fine-tuning large-language-models self-play
1.2k 105 +0/wk
GitHub
LA

AGI-Edgerunners/LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Trend 0
adapters fine-tuning large-language-models parameter-efficient
1.2k 121 -1/wk
GitHub
DA

datadreamer-dev/DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Trend 0
alignment deep-learning fine-tuning gpt instruction-tuning llm llmops llms machine-learning natural-language-processing nlp nlp-library openai python pytorch synthetic-data synthetic-dataset-generation transformers
1.1k 60 +0/wk
GitHub
TE

Tencent/TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Trend 0
albert bart bert chinese classification clue elmo fine-tuning gpt gpt-2 model-zoo natural-language-processing ner pegasus pre-training pytorch roberta t5 unilm xlm-roberta
1.1k 148 +0/wk
GitHub
LI

RL-VIG/LibFewShot

[TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.

Trend 0
few-shot-learning fine-tuning image-classification meta-learning pytorch
1.1k 199 -1/wk
GitHub
BL

brightmart/bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Trend 0
attention-is-all-you-need bert-model document-classification fasttext language-model language-understanding nlp pre-training question-answering self-attention text-classification textcnn transfer-learning transformer-encoder
967 211 +0/wk
GitHub
I2

DmitryRyumin/ICCV-2023-25-Papers

ICCV 2023-2025 Papers: Discover cutting-edge research from ICCV 2023-25, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Trend 0
3d-graphics 3d-reconstruction biometrics computer-vision datasets deep-learning explainable-ai face-recognition gesture-recognition iccv iccv2023 iccv2025 image-processing image-synthesis multimodal-learning pattern-recognition photogrammetry pose-estimation transfer-learning video-synthesis
967 49 +0/wk
GitHub
SL

louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2026 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

Trend 0
ai fine-tuning gpt gpt-4 language-model large-language-models llama llm llms rag retrieval-augmented-generation
959 124 +0/wk
GitHub
DE

deepdrive/deepdrive

Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

Trend 0
competition control deep-learning deep-reinforcement-learning gym python reinforcement-learning self-driving-car sensorimotor simulation tensorflow transfer-learning unreal-engine vision
927 150 +0/wk
GitHub

Source Breakdown

GitHub
Stars614.7k
Forks85.5k
Repos100
PyPI
Packages7
HuggingFace
Linked Repos7

Related Topics