NVIDIA AI

company

TensorRT, NeMo, Megatron-LM, RAPIDS, cuDF

Repos 43
Total stars 168.6k
43
Total Repos
168.6k
Total Stars
0
Trending

Projects (43)

AI

NVIDIA/aicr

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Trend 3
ai argocd config gpu helm kubernetes manifest runtime
258 27 +0/wk
GitHub
OP

NVIDIA/OpenShell

OpenShell is the safe, private runtime for autonomous AI agents.

Trend 3
4.6k 480 +28/wk
GitHub
EA

NVIDIA/earth2studio

Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.

Trend 3
ai climate-science deep-learning weather
748 167 +5/wk
GitHub
KV

NVIDIA/kvpress

LLM KV cache compression made easy

Trend 3
inference kv-cache kv-cache-compression large-language-models llm long-context python pytorch transformers
1.0k 127 +0/wk
GitHub
BF

NVIDIA/bionemo-framework

BioNeMo Framework: For building and adapting AI models in drug discovery at scale

Trend 3
drug-discovery gpu machine-learning pytorch
718 136 +2/wk
GitHub
MO

NVIDIA/Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Trend 3
2.4k 337 +10/wk
GitHub
NA

NVIDIA/NeMo-Agent-Toolkit

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Trend 3
2.2k 601 +2/wk
GitHub
PH

NVIDIA/physicsnemo

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Trend 3
deep-learning machine-learning nvidia-gpu nvidia-warp physics pytorch
2.7k 634 +7/wk
GitHub
DL

NVIDIA/DLSS

NVIDIA DLSS is a new and improved deep learning neural network that boosts frame rates and generates beautiful, sharp images for your games

Trend 3
1.3k 110 +2/wk
GitHub
NR

NVIDIA/NeMo-Retriever

NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

Trend 3
2.9k 315 +1/wk
GitHub
PS

NVIDIA/physicsnemo-sym

Framework providing pythonic APIs, algorithms and utilities to be used with PhysicsNeMo core to physics inform model training as well as higher level abstraction for domain experts

Trend 3
deep-learning machine-learning nvidia-gpu physics pytorch
323 121 +0/wk
GitHub
AF

NVIDIA/audio-flamingo

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

Trend 3
audio-captioning audio-language-models audio-question-answering audio-reasoning multimodal-large-language-models
1.0k 87 +1/wk
GitHub
TL

NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

Trend 3
blackwell cuda llm-serving moe pytorch
13.3k 2.3k +6/wk
GitHub
RU

NVIDIA/RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Trend 3
1.5k 125 +0/wk
GitHub
CH

NVIDIA/ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Trend 3
3.1k 430 +0/wk
GitHub
TR

NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Trend 3
bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet
1.3k 159 +2/wk
GitHub
TE

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Trend 3
deep-learning gpu-acceleration inference nvidia tensorrt
12.9k 2.3k +2/wk
GitHub
WA

NVIDIA/warp

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

Trend 3
cuda differentiable-programming gpu gpu-acceleration nvidia nvidia-warp python
6.5k 477 +8/wk
GitHub
DA

NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Trend 3
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
5.7k 661 +0/wk
GitHub
GE

NVIDIA/GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Trend 3
gpu-acceleration large-language-models llm llm-inference microservice nemo rag retrieval-augmented-generation tensorrt triton-inference-server
3.9k 1.0k +0/wk
GitHub
NC

NVIDIA/nccl

Optimized primitives for collective multi-GPU communication

Trend 3
communications cpp cuda deep-learning gpu nvidia
4.6k 1.2k +6/wk
GitHub
WA

NVIDIA/waveglow

A Flow-based Generative Network for Speech Synthesis

Trend 3
2.3k 537 +1/wk
GitHub
ML

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Trend 3
large-language-models model-para transformers
16.0k 3.8k +15/wk
GitHub
DE

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Trend 3
computer-vision deep-learning drug-discovery forecasting large-language-models mxnet nlp paddlepaddle pytorch recommender-systems speech-recognition speech-synthesis tensorflow tensorflow2 translation
14.8k 3.4k +2/wk
GitHub
TR

NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

Trend 3
cuda deep-learning fp4 fp8 gpu jax machine-learning python pytorch
3.3k 687 +1/wk
GitHub
FA

NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

Trend 3
bert gpt pytorch transformer
6.4k 934 +1/wk
GitHub
ME

NVIDIA-Merlin/Merlin

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Trend 3
deep-learning end-to-end gpu-acceleration machine-learning recommendation-system recommender-system
886 129 +0/wk
GitHub
FA

NVIDIA/FastPhotoStyle

Style transfer, deep learning, feature transform

Trend 3
11.2k 1.2k +0/wk
GitHub
GA

NVIDIA/garak

the LLM vulnerability scanner

Trend 3
ai llm-evaluation llm-security security-scanners vulnerability-assessment
7.5k 864 +8/wk
GitHub
MI

NVIDIA/MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Trend 3
3d-convolutional-network 3d-vision 4d-convolutional-neural-network auto-differentiation computer-vision convolutional-neural-networks cuda deep-learning high-dimensional-data high-dimensional-inference minkowski-engine neural-network pytorch semantic-segmentation space-time sparse-convolution sparse-tensor-network sparse-tensors spatio-temporal-analysis trilateral-filter
2.9k 467 +2/wk
GitHub
CU

NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

Trend 3
cpp cuda deep-learning deep-learning-library gpu nvidia python
9.5k 1.8k +4/wk
GitHub
DI

NVIDIA/DIGITS

Deep Learning GPU Training System

Trend 3
caffe deep-learning gpu machine-learning torch
4.2k 1.4k +1/wk
GitHub
NV

NVIDIA/nvvl

A library that uses hardware acceleration to load sequences of video frames to facilitate machine learning training

Trend 0
694 84 +0/wk
GitHub
RU

NVIDIA/runx

Deep Learning Experiment Management

Trend 0
641 39 +0/wk
GitHub
DS

NVIDIA/Dataset_Synthesizer

NVIDIA Deep learning Dataset Synthesizer (NDDS)

Trend 0
computer-vision deep-learning domain-randomization object-detection pose-estimation synthetic-dataset-generation
600 133 +0/wk
GitHub
FL

NVIDIA/flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Trend 0
speech-synthesis
897 174 +0/wk
GitHub
RE

NVIDIA/retinanet-examples

Fast and accurate object detection with end-to-end GPU optimization

Trend 0
deep-learning neural-network object-detection python pytorch retinanet tensorrt
899 265 +0/wk
GitHub
OP

NVIDIA/OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Trend 0
deep-learning float16 language-model mixed-precision multi-gpu multi-node neural-machine-translation seq2seq sequence-to-sequence speech-recognition speech-synthesis speech-to-text tensorflow text-to-speech
1.6k 371 +0/wk
GitHub
DE

NVIDIA/DeepRecommender

Deep learning for recommender systems

Trend 0
collaborative-filtering deep-autoencoders deep-learning gpu recommendation-engine
1.7k 339 +0/wk
GitHub
PI

NVIDIA/pix2pixHD

Synthesizing and manipulating 2048x1024 images with conditional GANs

Trend 0
computer-graphics computer-vision deep-learning deep-neural-networks gan generative-adversarial-network image-to-image-translation pix2pix pytorch
7.0k 1.4k +0/wk
GitHub
CT

NVIDIA/Cosmos-Tokenizer

A suite of image and video neural tokenizers

Trend 0
diffusion tokenization transformers
1.7k 87 +0/wk
GitHub
CA

NVIDIA/context-aware-rag

Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.

Trend 0
agent agentic-ai agentic-rag graphrag large-language-models llm long-video-understanding rag retrieval-augmented-generation video-summarization video-understanding vlm
65 19 +0/wk
GitHub
SD

NVIDIA/sentiment-discovery

Unsupervised Language Modeling at scale for robust sentiment classification

Trend 0
1.1k 205 +0/wk
GitHub