pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
PyTorch, LLaMA, Detectron2, fairseq, and more
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.
SiLK (Simple Learned Keypoint) is a self-supervised deep learning keypoint model.
An implementation of a deep learning recommendation model (DLRM)
Code & Models for 3DETR - an End-to-end transformer model for 3D object detection
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
A method to increase the speed and lower the memory footprint of existing vision transformers.
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
End-to-End Object Detection with Transformers
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
An open-source tool for LLM prompt optimization.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Datasets, Transforms and Models specific to Computer Vision
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Large Concept Models: Language modeling in a sentence representation space
A PyTorch native platform for training generative AI models
On-device AI across mobile, embedded and edge for PyTorch
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Training Large Language Model to Reason in a Continuous Latent Space
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
FAIR Chemistry's library of machine learning methods for chemistry
Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Hackable and optimized Transformers building blocks, supporting a composable construction.
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
A flexible, high-performance 3D simulator for Embodied AI research.
PyTorch native quantization and sparsity for training and inference
Set of tools to assess and improve LLM security.
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
Reinforcement learning environments for compiler and program optimization tasks
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.
Data manipulation and transformation for audio signal processing, powered by PyTorch
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
Compiler for Neural Network hardware accelerators
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
a Lightweight library for sequential learning agents, including reinforcement learning
[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
Serve, optimize and scale PyTorch models in production
LAnguage Model Analysis
An End-To-End, Lightweight and Flexible Platform for Game Research
DrQ-v2: Improved Data-Augmented Reinforcement Learning
Deep Learning for Symbolic Mathematics
Implementation of a classification framework from the paper Aggregated Residual Transformations for Deep Neural Networks
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
A simple and effective method for detecting out-of-distribution images in neural networks.
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Run PyTorch LLMs locally on servers, desktop and mobile
Ocean is the in-house framework for Computer Vision (CV) and Augmented Reality (AR) applications at Meta. It is platform independent and is mainly implemented in C/C++.
A framework for Privacy Preserving Machine Learning
A domain specific language to express machine learning workloads.
A natural language modeling framework based on PyTorch
FAIR Sequence Modeling Toolkit 2
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Collection of common code that's shared among different research projects in FAIR computer vision team.
Non-local Neural Networks for Video Classification
Models, data loaders and abstractions for language processing, powered by PyTorch
Official code Cross-Covariance Image Transformer (XCiT)
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Code release for "Avatars Grow Legs Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model", CVPR 2023
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
A deep learning library for video understanding research.
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
D2Go is a toolkit for efficient deep learning
Transformer training code for sequential tasks
Code for the Convolutional Vision Transformer (ConViT)
A library for differentiable nonlinear optimization
Code for: "And the bit goes down: Revisiting the quantization of neural networks"
Code release for "Learning Video Representations from Large Language Models"
Proposes neural networks that can generate animation of virtual characters for different actions.
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
Library for Model Based RL
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Translate - a PyTorch Language Library
New and extensible file format for storage of large columnar datasets.
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT