NVIDIA AI | AISignal

AI

NVIDIA/aicr

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Trend 3

ai argocd config gpu helm kubernetes manifest runtime

258 27 +0/wk

GitHub

OP

NVIDIA/OpenShell

OpenShell is the safe, private runtime for autonomous AI agents.

Trend 3

4.6k 480 +28/wk

GitHub

EA

NVIDIA/earth2studio

Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.

Trend 3

ai climate-science deep-learning weather

748 167 +5/wk

GitHub

KV

NVIDIA/kvpress

LLM KV cache compression made easy

Trend 3

inference kv-cache kv-cache-compression large-language-models llm long-context python pytorch transformers

1.0k 127 +0/wk

GitHub

BF

NVIDIA/bionemo-framework

BioNeMo Framework: For building and adapting AI models in drug discovery at scale

Trend 3

drug-discovery gpu machine-learning pytorch

718 136 +2/wk

GitHub

MO

NVIDIA/Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Trend 3

2.4k 337 +10/wk

GitHub

NA

NVIDIA/NeMo-Agent-Toolkit

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Trend 3

2.2k 601 +2/wk

GitHub

PH

NVIDIA/physicsnemo

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Trend 3

deep-learning machine-learning nvidia-gpu nvidia-warp physics pytorch

2.7k 634 +7/wk

GitHub

DL

NVIDIA/DLSS

NVIDIA DLSS is a new and improved deep learning neural network that boosts frame rates and generates beautiful, sharp images for your games

Trend 3

1.3k 110 +2/wk

GitHub

NR

NVIDIA/NeMo-Retriever

NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

Trend 3

2.9k 315 +1/wk

GitHub

PS

NVIDIA/physicsnemo-sym

Framework providing pythonic APIs, algorithms and utilities to be used with PhysicsNeMo core to physics inform model training as well as higher level abstraction for domain experts

Trend 3

deep-learning machine-learning nvidia-gpu physics pytorch

323 121 +0/wk

GitHub

AF

NVIDIA/audio-flamingo

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

Trend 3

audio-captioning audio-language-models audio-question-answering audio-reasoning multimodal-large-language-models

1.0k 87 +1/wk

GitHub

TL

NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

Trend 3

blackwell cuda llm-serving moe pytorch

13.3k 2.3k +6/wk

GitHub

RU

NVIDIA/RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Trend 3

1.5k 125 +0/wk

GitHub

CH

NVIDIA/ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Trend 3

3.1k 430 +0/wk

GitHub

TR

NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Trend 3

bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet

1.3k 159 +2/wk

GitHub

TE

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Trend 3

deep-learning gpu-acceleration inference nvidia tensorrt

12.9k 2.3k +2/wk

GitHub

WA

NVIDIA/warp

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

Trend 3

cuda differentiable-programming gpu gpu-acceleration nvidia nvidia-warp python

6.5k 477 +8/wk

GitHub

DA

NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Trend 3

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

5.7k 661 +0/wk

GitHub

GE

NVIDIA/GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Trend 3

gpu-acceleration large-language-models llm llm-inference microservice nemo rag retrieval-augmented-generation tensorrt triton-inference-server

3.9k 1.0k +0/wk

GitHub

NC

NVIDIA/nccl

Optimized primitives for collective multi-GPU communication

Trend 3

communications cpp cuda deep-learning gpu nvidia

4.6k 1.2k +6/wk

GitHub

WA

NVIDIA/waveglow

A Flow-based Generative Network for Speech Synthesis

Trend 3

2.3k 537 +1/wk

GitHub

ML

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Trend 3

large-language-models model-para transformers

16.0k 3.8k +15/wk

GitHub

DE

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Trend 3

computer-vision deep-learning drug-discovery forecasting large-language-models mxnet nlp paddlepaddle pytorch recommender-systems speech-recognition speech-synthesis tensorflow tensorflow2 translation

14.8k 3.4k +2/wk

GitHub

TR

NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

Trend 3

cuda deep-learning fp4 fp8 gpu jax machine-learning python pytorch

3.3k 687 +1/wk

GitHub

FA

NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

Trend 3

bert gpt pytorch transformer

6.4k 934 +1/wk

GitHub

ME

NVIDIA-Merlin/Merlin

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Trend 3

deep-learning end-to-end gpu-acceleration machine-learning recommendation-system recommender-system

886 129 +0/wk

GitHub

FA

NVIDIA/FastPhotoStyle

Style transfer, deep learning, feature transform

Trend 3

11.2k 1.2k +0/wk

GitHub

GA

NVIDIA/garak

the LLM vulnerability scanner

Trend 3

ai llm-evaluation llm-security security-scanners vulnerability-assessment

7.5k 864 +8/wk

GitHub

MI

NVIDIA/MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Trend 3

3d-convolutional-network 3d-vision 4d-convolutional-neural-network auto-differentiation computer-vision convolutional-neural-networks cuda deep-learning high-dimensional-data high-dimensional-inference minkowski-engine neural-network pytorch semantic-segmentation space-time sparse-convolution sparse-tensor-network sparse-tensors spatio-temporal-analysis trilateral-filter

2.9k 467 +2/wk

GitHub

CU

NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

Trend 3

cpp cuda deep-learning deep-learning-library gpu nvidia python

9.5k 1.8k +4/wk

GitHub

DI

NVIDIA/DIGITS

Deep Learning GPU Training System

Trend 3

caffe deep-learning gpu machine-learning torch

4.2k 1.4k +1/wk

GitHub

NV

NVIDIA/nvvl

A library that uses hardware acceleration to load sequences of video frames to facilitate machine learning training

Trend 0

694 84 +0/wk

GitHub

RU

NVIDIA/runx

Deep Learning Experiment Management

Trend 0

641 39 +0/wk

GitHub

DS

NVIDIA/Dataset_Synthesizer

NVIDIA Deep learning Dataset Synthesizer (NDDS)

Trend 0

computer-vision deep-learning domain-randomization object-detection pose-estimation synthetic-dataset-generation

600 133 +0/wk

GitHub

FL

NVIDIA/flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Trend 0

speech-synthesis

897 174 +0/wk

GitHub

RE

NVIDIA/retinanet-examples

Fast and accurate object detection with end-to-end GPU optimization

Trend 0

deep-learning neural-network object-detection python pytorch retinanet tensorrt

899 265 +0/wk

GitHub

OP

NVIDIA/OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Trend 0

deep-learning float16 language-model mixed-precision multi-gpu multi-node neural-machine-translation seq2seq sequence-to-sequence speech-recognition speech-synthesis speech-to-text tensorflow text-to-speech

1.6k 371 +0/wk

GitHub

DE

NVIDIA/DeepRecommender

Deep learning for recommender systems

Trend 0

collaborative-filtering deep-autoencoders deep-learning gpu recommendation-engine

1.7k 339 +0/wk

GitHub

PI

NVIDIA/pix2pixHD

Synthesizing and manipulating 2048x1024 images with conditional GANs

Trend 0

computer-graphics computer-vision deep-learning deep-neural-networks gan generative-adversarial-network image-to-image-translation pix2pix pytorch

7.0k 1.4k +0/wk

GitHub

CT

NVIDIA/Cosmos-Tokenizer

A suite of image and video neural tokenizers

Trend 0

diffusion tokenization transformers

1.7k 87 +0/wk

GitHub

CA

NVIDIA/context-aware-rag

Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.

Trend 0

agent agentic-ai agentic-rag graphrag large-language-models llm long-video-understanding rag retrieval-augmented-generation video-summarization video-understanding vlm

65 19 +0/wk

GitHub

SD

NVIDIA/sentiment-discovery

Unsupervised Language Modeling at scale for robust sentiment classification

Trend 0

1.1k 205 +0/wk

GitHub

Projects (43)

NVIDIA/aicr

NVIDIA/OpenShell

NVIDIA/earth2studio

NVIDIA/kvpress

NVIDIA/bionemo-framework

NVIDIA/Model-Optimizer

NVIDIA/NeMo-Agent-Toolkit

NVIDIA/physicsnemo

NVIDIA/DLSS

NVIDIA/NeMo-Retriever

NVIDIA/physicsnemo-sym

NVIDIA/audio-flamingo

NVIDIA/TensorRT-LLM

NVIDIA/RULER

NVIDIA/ChatRTX

NVIDIA-Merlin/Transformers4Rec

NVIDIA/TensorRT

NVIDIA/warp

NVIDIA/DALI

NVIDIA/GenerativeAIExamples

NVIDIA/nccl

NVIDIA/waveglow

NVIDIA/Megatron-LM

NVIDIA/DeepLearningExamples

NVIDIA/TransformerEngine

NVIDIA/FasterTransformer

NVIDIA-Merlin/Merlin

NVIDIA/FastPhotoStyle

NVIDIA/garak

NVIDIA/MinkowskiEngine

NVIDIA/cutlass

NVIDIA/DIGITS

NVIDIA/nvvl

NVIDIA/runx

NVIDIA/Dataset_Synthesizer

NVIDIA/flowtron

NVIDIA/retinanet-examples

NVIDIA/OpenSeq2Seq

NVIDIA/DeepRecommender

NVIDIA/pix2pixHD

NVIDIA/Cosmos-Tokenizer

NVIDIA/context-aware-rag

NVIDIA/sentiment-discovery