Fine-Tuning | AISignal

JL

R6410418/Jackrong-llm-finetuning-guide

Trend 22

⚡ Breakout +185.8%

dataset deepseek fine-tuning guide llama3 llm machine-learning nlp openai pytorch qwen unsloth

383 73 +78/wk

GitHub

LI

run-llama/llama_index

LlamaIndex is the leading document agent and OCR platform

Trend 19

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database

48.4k 7.2k +20/wk

GitHub PyPI 2-source

UN

unslothai/unsloth

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

Trend 17

agent deepseek fine-tuning gemma gemma3 gpt-oss llama llama3 llm llms mistral openai qwen reinforcement-learning self-hosted text-to-speech tts ui unsloth

60.2k 5.2k +197/wk

GitHub HuggingFace PyPI 3-source

LL

hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Trend 13

agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers

69.8k 8.5k +61/wk

GitHub HuggingFace PyPI 3-source

CO

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Trend 10

audio-generation cantonese chatbot chatgpt chinese cosyvoice cross-lingual english fine-grained fine-tuning gpt-4o japanese korean multi-lingual natural-language-generation python text-to-speech tts voice-cloning

20.5k 2.3k +18/wk

GitHub HuggingFace PyPI 3-source

BL

datawhalechina/base-llm

从 NLP 到 LLM 的算法全栈教程，在线阅读地址：https://datawhalechina.github.io/base-llm/

Trend 3

bert deeplearning docker fine-tuning linux llama llm lora nlp python pytorch qwen rnn tensorrt transformer tutorial

586 55 +22/wk

GitHub

NE

NVIDIA-NeMo/Nemotron

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models

Trend 3

ai fine-tuning model-training nemotron nvidia reinforcement-learning

897 192 +16/wk

GitHub

AE

amitshekhariitbhu/ai-engineering-interview-questions

Your Cheat Sheet for AI Engineering Interview – Questions and Answers.

Trend 3

agents ai ai-agents ai-engineering fine-tuning interview interview-preparation interview-questions llm mcp quantization questions-and-answers rag

988 178 +9/wk

GitHub

UB

TYH-labs/unsloth-buddy

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.

Trend 3

apple-silicon claude-code dpo fine-tuning gaslamp grpo huggingface lora qlora rlhf sft transformer unsloth

211 12 +1/wk

GitHub

SR

vllm-project/semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Trend 3

ai-gateway bert-classification fine-tuning golang huggingface-candle huggingface-transformers kubernetes llm llmrouter mcp mixture-of-models openclaw pii-detection prompt-engineering prompt-guard rust semantic-router vllm

3.7k 603 +11/wk

GitHub

ME

aiming-lab/MetaClaw

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Trend 3

agent ai-agent continual-learning fine-tuning llm lora meta-learning metaclaw online-learning openclaw reinforcement-learning skill-learning tinker

3.6k 393 +8/wk

GitHub

MA

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

Trend 3

deepseek fine-tuning gemma2 gemma3 gpt jax large-language-models llama2 llama3 llama4 llm mistral mixtral sft

2.2k 500 +3/wk

GitHub

OR

pykeio/ort

Fast ML inference & training for ONNX models in Rust

Trend 3

ai ai-training fine-tuning inference machine-learning onnx onnxruntime rust

2.2k 230 +4/wk

GitHub

CU

NVIDIA-NeMo/Curator

Scalable data pre processing and curation toolkit for LLMs

Trend 3

data data-curation data-prep data-preparation data-processing data-processing-pipelines data-quality datacuration datarecipes deduplication fast-data-processing fine-tuning large-language-models large-scale-data-processing llm llm-data-quality llmapps python semantic-deduplication

1.5k 252 +2/wk

GitHub

LA

oxbshw/LLM-Agents-Ecosystem-Handbook

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

Trend 3

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops local-development mcp-server memory rag rag-chatbot voice-agent

505 79 +1/wk

GitHub

VE

bibinprathap/VeritasGraph

VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution

Trend 3

data-privacy enterprise-ai explainable-ai fine-tuning generative-ai generativeai graph-rag information-retrieval knowledge-graph langchain llamaindex llm lora multi-hop-reasoning neo4j nlp ollama on-premise question-answering rag

266 30 +1/wk

GitHub

SU

invergent-ai/surogate

Full-Stack Development Platform for Building Reliable Agents

Trend 3

cuda deep-learning fine-tuning generative-ai llama llm llms nvidia-gpu qwen sft

218 3 +0/wk

GitHub

VE

ai4protein/VenusFactory2

🏭 AI agent platform with skills for protein engineering, the noob-friendly AI tutorial tool for life science professionals.

Trend 3

agent database fine-tuning language-model life-science noob-friendly protein protein-science skill tutorial

212 29 +0/wk

GitHub

OH

huggingface/optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Trend 3

bert fine-tuning habana hpu transformers

209 271 +0/wk

GitHub

TO

ToolBrain/ToolBrain

A framework for agentic tool use training with reinforcement learning

Trend 3

agentic-ai dpo fine-tuning grpo langchain llm-agents reinforcement-learning smolagents tool-use unsloth

165 14 +1/wk

GitHub

PE

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Trend 3

adapter diffusion fine-tuning llm lora parameter-efficient-learning peft python pytorch transformers

20.9k 2.2k +1/wk

GitHub HuggingFace PyPI 3-source

ON

microsoft/onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Trend 3

ai-framework deep-learning hardware-acceleration machine-learning neural-networks onnx pytorch scikit-learn tensorflow

19.8k 3.8k +13/wk

GitHub HuggingFace PyPI 3-source

MA

Unity-Technologies/ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Trend 3

deep-learning deep-reinforcement-learning machine-learning neural-networks reinforcement-learning unity unity3d

19.3k 4.4k +3/wk

GitHub

LC

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

Trend 3

ai finetuning langchain llama llama2 llm machine-learning python pytorch vllm

18.3k 2.7k +0/wk

GitHub HuggingFace PyPI 3-source

ME

stas00/ml-engineering

Machine Learning Engineering Open Book

Trend 3

ai debugging gpus inference large-language-models llm machine-learning machine-learning-engineering mlops network pytorch scalability slurm storage training transformers

17.6k 1.1k +4/wk

GitHub

TR

jindongwang/transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Trend 3

deep-learning domain-adaptation domain-adaption domain-generalization few-shot few-shot-learning generalization machine-learning meta-learning paper papers representation-learning self-supervised-learning style-transfer survey theory transfer-learning transferlearning tutorial-code unsupervised-learning

14.3k 3.8k +1/wk

GitHub

ED

ConardLi/easy-dataset

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

Trend 3

dataset fine-tuning javascript llm rag

13.8k 1.4k +16/wk

GitHub

YP

Data-Centric-AI-Community/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Trend 3

big-data-analytics data-analysis data-exploration data-profiling data-quality data-science deep-learning eda exploration exploratory-data-analysis hacktoberfest html-report jupyter jupyter-notebook machine-learning pandas pandas-dataframe pandas-profiling python statistics

13.5k 1.8k +2/wk

GitHub

OP

bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Trend 3

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

12.3k 806 -1/wk

GitHub

AX

axolotl-ai-cloud/axolotl

Go ahead and axolotl questions

Trend 3

fine-tuning llm

11.6k 1.3k +5/wk

GitHub

WA

wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Trend 3

ai collaboration data-science data-versioning deep-learning experiment-track hyperparameter-optimization hyperparameter-search hyperparameter-tuning jax keras machine-learning ml-platform mlops model-versioning pytorch reinforcement-learning reproducibility tensorflow

11.0k 856 -3/wk

GitHub

AS

aws/amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Trend 3

aws data-science deep-learning examples inference jupyter-notebook machine-learning mlops reinforcement-learning sagemaker training

10.9k 7.0k +0/wk

GitHub

OU

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Trend 3

dpo evaluation fine-tuning gpt-oss gpt-oss-120b gpt-oss-20b inference llama llms sft slms vlms

9.2k 744 +2/wk

GitHub

LO

cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Trend 3

diffusion dreambooth fine-tuning lora stable-diffusion

7.5k 498 +2/wk

GitHub

FL

flyteorg/flyte

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows. Flyte 2 now available locally: https://github.com/flyteorg/flyte-sdk

Trend 3

data data-analysis data-science dataops declarative fine-tuning flyte golang grpc hacktoberfest kubernetes kubernetes-operator llm machine-learning mlops orchestration-engine production python scale workflow

6.9k 802 +2/wk

GitHub

KI

Kiln-AI/Kiln

Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.

Trend 3

ai chain-of-thought collaboration dataset-generation evals evaluation evaluation-framework fine-tuning machine-learning macos mcp ml ollama openai prompt prompt-engineering python rlhf synthetic-data windows

4.7k 352 -1/wk

GitHub

LZ

LLMBook-zh/LLMBook-zh.github.io

《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣

Trend 3

artificial-intelligence deep-learning deep-neural-networks deep-reinforcement-learning fine-tuning language-model large-language-models natural-language-processing nlp pretrained-models

4.4k 329 +4/wk

GitHub

CO

truefoundry/cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Trend 3

agent ai application data deep-learning fine-tuning framework generative-ai llm llm-ops llmops machine-learning mlops model-deployment python rag retrieval-augmented-generation typescript

4.4k 392 +0/wk

GitHub

TL

thuml/Transfer-Learning-Library

Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization

Trend 3

adversarial-learning dann deep-learning domain-adaptation finetune image-translation out-of-distribution-generalization self-training semi-supervised-learning transfer-learning unsupervised-domain-adaptation

3.9k 592 +2/wk

GitHub

UP

dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Trend 3

albert bart bert chinese classification clue elmo fine-tuning gpt gpt-2 model-zoo natural-language-processing ner pegasus pre-training pytorch roberta t5 unilm xlm-roberta

3.1k 521 +1/wk

GitHub

OP

AIDotNet/OpenDeepWiki

OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful knowledge management and collaboration platform. The project is mainly developed using C# and TypeScript, supporting modular design, and is easy to expand and customize.

Trend 3

chatgpt deepwiki docs fine-tuning

3.0k 397 +6/wk

GitHub

ON

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your Diffusion training needs.

Trend 3

fine-tuning image-model-training lora training

2.9k 273 +2/wk

GitHub

MA

roboflow/maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Trend 3

captioning fine-tuning florence-2 multimodal objectdetection paligemma phi-3-vision qwen2-vl transformers vision-and-language vqa

2.7k 222 +0/wk

GitHub

SB

decodingai-magazine/second-brain-ai-assistant-course

Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.

Trend 3

agents ai-systems data-engineering fine-tuning huggingface llm llmops mlops openai python rag

2.6k 469 +1/wk

GitHub

AE

adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

Trend 3

fine-tuning finetuning finetuning-llms inference large-language-models llm python quantization

2.2k 250 +1/wk

GitHub

DS

dstackai/dstack

Control plane for agents and engineers to provision compute and run training and inference across NVIDIA, AMD, TPU, and Tenstorrent GPUs—on clouds, Kubernetes, and bare-metal clusters.

Trend 3

agent-skills agentic-orchestration amd cloud containers docker fine-tuning gpu inference k8s kubernetes llms machine-learning nvidia orchestration python slurm training

2.1k 222 +0/wk

GitHub

TR

kubeflow/trainer

Distributed AI Model Training and LLM Fine-Tuning on Kubernetes

Trend 3

ai distributed fine-tuning gpu huggingface jax kubeflow kubernetes llm machine-learning mlops python pytorch tensorflow xgboost

2.1k 943 +2/wk

GitHub

CD

adobe-research/custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Trend 3

computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation

2.0k 142 +0/wk

GitHub

PE

google-deepmind/penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Trend 3

fine-tuning interpretability jax neural-networks visualization

1.9k 69 +1/wk

GitHub

LO

THUDM/LongWriter

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Trend 3

fine-tuning llm long-context long-text

1.9k 185 +1/wk

GitHub

JI

nyu-mll/jiant

jiant is an nlp toolkit

Trend 3

bert multitask-learning nlp sentence-representation transfer-learning transformers

1.7k 297 +0/wk

GitHub

CU

bespokelabsai/curator

Synthetic data curation for post-training and structured data extraction

Trend 3

agents deep-learning fine-tuning instruction-tuning llm machine-learning natural-language-processing prompt python synthetic-data synthetic-dataset-generation

1.7k 136 +2/wk

GitHub

BE

beam-cloud/beta9

Ultrafast serverless GPU inference, sandboxes, and background jobs

Trend 3

autoscaler cloudrun cuda developer-productivity distributed-computing faas fine-tuning functions-as-a-service generative-ai gpu large-language-models llm llm-inference ml-platform paas self-hosted serverless serverless-containers

1.6k 142 +0/wk

GitHub

AG

LirongWu/awesome-graph-self-supervised-learning

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

Trend 3

data-augmentation deep-learning graph-neural-networks machine-learning pre-training pretext-task representation-learning self-supervised-learning transfer-learning unsupervised-learning

1.4k 164 +0/wk

GitHub

TT

SakanaAI/text-to-lora

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Trend 3

fine-tuning hypernetworks llm lora machine-learning

1.3k 87 +1/wk

GitHub

DE

always-further/deepfabric

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

Trend 3

agents ai data-science dataset distillation evaluation fine-tuning huggingface huggingface-datasets machine-learning open open-source python source synthetic synthetic-data unsloth

851 80 +0/wk

GitHub

FI

dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Trend 3

bitsandbytes fine-tuning finetuning finetuning-llms hugging-face huggingface large-language-models llamacpp lora ollama peft peft-fine-tuning-llm pytorch transformers

806 104 +0/wk

GitHub

RE

MolecularAI/REINVENT4

AI molecular design tool for de novo design, scaffold hopping, R-group replacement, linker design and molecule optimization.

Trend 3

ai astrazeneca cheminformatics chemistry deep-learning denovo-design drug-design drug-discovery generative-ai ml molecule-generation neural-networks reinforcement-learning transfer-learning

720 207 +0/wk

GitHub

AM

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.

Trend 3

attack-defense continual-learning diffusion-models ensemble-learning federated-learning few-shot-learning foundation-models generalization generative-model knowledge-fusion large-language-models llms meta-learning model-fusion model-merging multi-domain-learning multi-task-learning robustness transfer-learning zero-shot-learning

708 41 +1/wk

GitHub

ML

Coobiw/MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

Trend 3

deepspeed fine-tuning mllm model-parallel multimodal-large-language-models pipeline-parallelism pretraining qwen video-language-model video-large-language-models

669 35 +2/wk

GitHub

CL

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Complete-Life-Cycle-of-a-Data-Science-Project

Trend 3

analysis data-analysis data-science dataset deep-learning eda exploratory-data-analysis feature-engineering federated-learning machine-learning nlp-models python python-library pytorch reinforcement-learning scraper supervised-learning transfer-learning unsupervised-learning web-scraping

638 254 +1/wk

GitHub

RD

Denis2054/RAG-Driven-Generative-AI

This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models for generation and evaluation.

Trend 3

advanced-rag chroma chromadb embedding-models fine-tuning gpt-4o-mini gpt4-omni grok huggingface indexing-querying llama llama-index multimodal openai-api pinecone rag scaling vision-transformer xai-grok

596 202 +1/wk

GitHub

AI

kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Trend 3

ai buildkit chatgpt docker fine-tuning finetuning gemma gpt inference kubernetes large-language-models llama llm localllama mistral mixtral nvidia open-llm open-source-llm openai

514 59 +0/wk

GitHub

GE

open-edge-platform/geti

Build computer vision models in a fraction of the time and with less data.

Trend 3

computer-vision deep-learning fine-tuning geti inference openvino

474 51 +0/wk

GitHub

TF

tensorflow/tfjs

A WebGL accelerated JavaScript library for training and deploying ML models.

Trend 0

deep-learning deep-neural-network gpu-acceleration javascript machine-learning neural-network typescript wasm web-assembly webgl

19.1k 2.0k +0/wk

GitHub HuggingFace 2-source

HO

horovod/horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Trend 0

baidu deep-learning deeplearning keras machine-learning machinelearning mpi mxnet pytorch ray spark tensorflow uber

14.7k 2.2k +0/wk

GitHub

ND

alex000kim/nsfw_data_scraper

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

Trend 0

content-moderation deep-learning machine-learning nsfw nsfw-classifier pornography

12.6k 2.9k +0/wk

GitHub

LU

ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Trend 0

computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch

11.7k 1.2k -1/wk

GitHub

HL

h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Trend 0

ai chatbot chatgpt fedramp fine-tuning finetuning generative generative-ai gpt llama llama2 llm llm-training

4.9k 523 -2/wk

GitHub

NE

nucleuscloud/neosync

Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.

Trend 0

benthos docker etl faker fine-tuning golang kubernetes mysql nextjs open-source orchestration postgresql reactjs self-hosted synthetic-data synthetic-data-generation test-data-generator testing typescript

4.1k 231 -1/wk

GitHub

LO

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Trend 0

fine-tuning gpt llama llm llm-inference llm-serving llmops lora model-serving pytorch transformers

3.7k 311 +0/wk

GitHub

CE

hiyouga/ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Trend 0

alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers

3.7k 466 +0/wk

GitHub

FA

ZhaoJ9014/face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Trend 0

artificial-intelligence computer-vision convolutional-neural-network data-augmentation deep-learning face-alignment face-detection face-landmark-detection face-recognition feature-extraction fine-tuning hard-negative-mining imbalanced-learning machine-learning model-training nus pytorch supervised-learning tencent transfer-learning

3.6k 762 -1/wk

GitHub

RE

smallcloudai/refact

AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

Trend 0

ai-agent developer-tools enterprise fine-tuning on-prem open-source rag self-hosted swe-bench vscode

3.5k 309 +0/wk

GitHub

HU

tensorflow/hub

A library for transfer learning by reusing parts of TensorFlow models.

Trend 0

embeddings image-classification machine-learning ml python tensorflow transfer-learning

3.5k 1.6k +0/wk

GitHub

O1

sentient-agi/OML-1.0-Fingerprinting

OML 1.0 via Fingerprinting: Open, Monetizable, and Loyal AI

Trend 0

fine-tuning fingerprint loyalty oml sentient verifiable-ai

3.5k 235 +0/wk

GitHub

AO

TJU-DRL-LAB/AI-Optimizer

The next generation deep reinforcement learning tookit

Trend 0

deep-learning reinforcement-learning transfer-learning

3.5k 597 -1/wk

GitHub

HO

iusztinpaul/hands-on-llms

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Trend 0

3-pipeline-design aws beam bytewax cicd comet-ml docker fine-tuning generative-ai huggingface langchain llmops llms mlops qdrant qlora streaming transformers

3.4k 550 -1/wk

GitHub

VS

jingyi0000/VLM_survey

Collection of AWESOME vision-language models for vision tasks

Trend 0

clip computer-vision deep-learning knowledge-distillation multi-modal-model survey transfer-learning vision-language-model

3.1k 234 +0/wk

GitHub

LF

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

Trend 0

falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation

2.9k 764 -1/wk

GitHub

SI

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

Trend 0

diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion

2.8k 276 -2/wk

GitHub

XT

stochasticai/xTuring

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Trend 0

adapter deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization

2.7k 213 -2/wk

GitHub

FF

AIStream-Peelout/flow-forecast

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).

Trend 0

anomaly-detection deep-learning deep-neural-networks forecasting hacktoberfest lstm pytorch state-of-the-art-models time-series time-series-analysis time-series-forecasting time-series-regression transfer-learning transformer

2.3k 304 -1/wk

GitHub

SP

neuralmagic/sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Trend 0

automl computer-vision-algorithms deep-learning-algorithms deep-learning-library deep-learning-models image-classification keras nlp object-detection onnx pruning pruning-algorithms pytorch smaller-models sparsification sparsification-recipes sparsity tensorflow transfer-learning

2.1k 156 -2/wk

GitHub

YI

YiVal/YiVal

Your Automatic Prompt Engineering Assistant for GenAI Applications

Trend 0

ai ai-experiments ai-toolkit aigc api auto-prompting autogpt fine-tuning framework generative-ai gpt4 llm midjourney prompt prompt-engineering prompt-tuning promptengineering python stable-diffusion

2.1k 329 -3/wk

GitHub

AF

chaoyanghe/Awesome-Federated-Learning

FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai

Trend 0

adversarial-attack-and-defense communication-efficiency computation-efficiency computer-vision continual-learning decentralized-federated-learning distributed-optimization federated-learning hierarchical-federated-learning incentive-mechanism interpretability machine-learning neural-architecture-search non-iid privacy semi-supervised-learning straggler-problem transfer-learning vertical-federated-learning wireless-communication

2.0k 334 +0/wk

GitHub

DG

eosphoros-ai/DB-GPT-Hub

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Trend 0

database datasets fine-tuning gpt hacktoberfest llm nl2sql sql text-to-sql text2sql

2.0k 247 +0/wk

GitHub

LA

ray-project/llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

Trend 0

anyscale fine-tuning llama2 llms machine-learning openai ray serving

1.9k 253 +0/wk

GitHub

TL

huggingface/transfer-learning-conv-ai

🦄 State-of-the-Art Conversational AI with Transfer Learning

Trend 0

chatbots deep-learning dialog gpt gpt-2 neural-networks nlp pytorch transfer-learning

1.8k 431 +0/wk

GitHub

FI

jina-ai/finetuner

:dart: Task-oriented embedding tuning for BERT, CLIP, etc.

Trend 0

bert few-shot-learning fine-tuning finetuning jina metric-learning negative-sampling neural-search openai-clip pretrained-models siamese-network similarity-learning transfer-learning triplet-loss

1.5k 68 +0/wk

GitHub

SA

tianrun-chen/SAM-Adapter-PyTorch

Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts

Trend 0

2d-segmentation adapter camouflage-images camouflaged-object-detection camouflaged-target-detection fine-tune fine-tuning image-segmentation image-segmentation-pytorch segment-anything segment-anything-model

1.5k 122 +0/wk

GitHub

SP

uclaml/SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Trend 0

deep-learning fine-tuning large-language-models self-play

1.2k 105 +0/wk

GitHub

LA

AGI-Edgerunners/LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Trend 0

adapters fine-tuning large-language-models parameter-efficient

1.2k 121 -1/wk

GitHub

DA

datadreamer-dev/DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Trend 0

alignment deep-learning fine-tuning gpt instruction-tuning llm llmops llms machine-learning natural-language-processing nlp nlp-library openai python pytorch synthetic-data synthetic-dataset-generation transformers

1.1k 60 +0/wk

GitHub

TE

Tencent/TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Trend 0

albert bart bert chinese classification clue elmo fine-tuning gpt gpt-2 model-zoo natural-language-processing ner pegasus pre-training pytorch roberta t5 unilm xlm-roberta

1.1k 148 +0/wk

GitHub

LI

RL-VIG/LibFewShot

[TPAMI 2023] LibFewShot: A Comprehensive Library for Few-shot Learning.

Trend 0

few-shot-learning fine-tuning image-classification meta-learning pytorch

1.1k 199 -1/wk

GitHub

BL

brightmart/bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Trend 0

attention-is-all-you-need bert-model document-classification fasttext language-model language-understanding nlp pre-training question-answering self-attention text-classification textcnn transfer-learning transformer-encoder

967 211 +0/wk

GitHub

I2

DmitryRyumin/ICCV-2023-25-Papers

ICCV 2023-2025 Papers: Discover cutting-edge research from ICCV 2023-25, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Trend 0

3d-graphics 3d-reconstruction biometrics computer-vision datasets deep-learning explainable-ai face-recognition gesture-recognition iccv iccv2023 iccv2025 image-processing image-synthesis multimodal-learning pattern-recognition photogrammetry pose-estimation transfer-learning video-synthesis

967 49 +0/wk

GitHub

SL

louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2026 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

Trend 0

ai fine-tuning gpt gpt-4 language-model large-language-models llama llm llms rag retrieval-augmented-generation

959 124 +0/wk

GitHub

DE

deepdrive/deepdrive

Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

Trend 0

competition control deep-learning deep-reinforcement-learning gym python reinforcement-learning self-driving-car sensorimotor simulation tensorflow transfer-learning unreal-engine vision

927 150 +0/wk

GitHub

Top Projects (100)

R6410418/Jackrong-llm-finetuning-guide

run-llama/llama_index

unslothai/unsloth

hiyouga/LlamaFactory

FunAudioLLM/CosyVoice

datawhalechina/base-llm

NVIDIA-NeMo/Nemotron

amitshekhariitbhu/ai-engineering-interview-questions

TYH-labs/unsloth-buddy

vllm-project/semantic-router

aiming-lab/MetaClaw

AI-Hypercomputer/maxtext

pykeio/ort

NVIDIA-NeMo/Curator

oxbshw/LLM-Agents-Ecosystem-Handbook

bibinprathap/VeritasGraph

invergent-ai/surogate

ai4protein/VenusFactory2

huggingface/optimum-habana

ToolBrain/ToolBrain

huggingface/peft

microsoft/onnxruntime

Unity-Technologies/ml-agents

meta-llama/llama-cookbook

stas00/ml-engineering

jindongwang/transferlearning

ConardLi/easy-dataset

Data-Centric-AI-Community/ydata-profiling

bentoml/OpenLLM

axolotl-ai-cloud/axolotl

wandb/wandb

aws/amazon-sagemaker-examples

oumi-ai/oumi

cloneofsimo/lora

flyteorg/flyte

Kiln-AI/Kiln

LLMBook-zh/LLMBook-zh.github.io

truefoundry/cognita

thuml/Transfer-Learning-Library

dbiir/UER-py

AIDotNet/OpenDeepWiki

Nerogar/OneTrainer

roboflow/maestro

decodingai-magazine/second-brain-ai-assistant-course

adithya-s-k/AI-Engineering.academy

dstackai/dstack

kubeflow/trainer

adobe-research/custom-diffusion

google-deepmind/penzai

THUDM/LongWriter

nyu-mll/jiant

bespokelabsai/curator

beam-cloud/beta9

LirongWu/awesome-graph-self-supervised-learning

SakanaAI/text-to-lora

always-further/deepfabric

dvgodoy/FineTuningLLMs

MolecularAI/REINVENT4

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Coobiw/MPP-LLaVA

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Denis2054/RAG-Driven-Generative-AI

kaito-project/aikit

open-edge-platform/geti

tensorflow/tfjs

horovod/horovod

alex000kim/nsfw_data_scraper

ludwig-ai/ludwig

h2oai/h2o-llmstudio

nucleuscloud/neosync

predibase/lorax

hiyouga/ChatGLM-Efficient-Tuning

ZhaoJ9014/face.evoLVe

smallcloudai/refact

tensorflow/hub

sentient-agi/OML-1.0-Fingerprinting

TJU-DRL-LAB/AI-Optimizer

iusztinpaul/hands-on-llms

jingyi0000/VLM_survey