NVIDIA Corporation

All

607 repositories

KAI-Scheduler
Public
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Go
•
Apache License 2.0
•93•840•28•23•Updated Oct 8, 2025Oct 8, 2025
gpu-operator
Public
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
kubernetes gpu cuda nvidia
Go
•
Apache License 2.0
•390•2.3k•392•70•Updated Oct 8, 2025Oct 8, 2025
Megatron-LM
Public
Ongoing research training transformer models at scale
transformers model-para large-language-models
Python
•
Other
•3.1k•14k•305•110•Updated Oct 8, 2025Oct 8, 2025
k8s-nim-operator
Public
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
Go
•
Apache License 2.0
•32•129•5•28•Updated Oct 8, 2025Oct 8, 2025
JAX-Toolbox
Public
JAX-Toolbox
Python
•
Apache License 2.0
•67•347•80•44•Updated Oct 8, 2025Oct 8, 2025
tinylinux-scripts
Public
Scripts for building minimal Linux distribution for diagnostics
Shell
•
Other
•5•30•2•0•Updated Oct 8, 2025Oct 8, 2025
torch-harmonics
Public
Differentiable signal processing on the sphere for PyTorch
machine-learning signal-processing sphere pytorch
Jupyter Notebook
•
Other
•53•538•4•2•Updated Oct 8, 2025Oct 8, 2025
kvpress
Public
LLM KV cache compression made easy
python transformers inference pytorch kv-cache large-language-models llm long-context kv-cache-compression
Python
•
Apache License 2.0
•65•646•3•4•Updated Oct 8, 2025Oct 8, 2025
TensorRT-LLM
Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
cuda pytorch moe blackwell llm-serving
C++
•
Apache License 2.0
•1.8k•12k•719•384•Updated Oct 8, 2025Oct 8, 2025
k8s-driver-manager
Public
The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
Go
•
Apache License 2.0
•17•38•2•4•Updated Oct 8, 2025Oct 8, 2025
vgpu-device-manager
Public
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
Go
•
Apache License 2.0
•22•142•0•12•Updated Oct 8, 2025Oct 8, 2025
tilus
Public
Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.
tile programming kernel cuda
Python
•
Apache License 2.0
•9•375•6•0•Updated Oct 8, 2025Oct 8, 2025
TensorRT-Model-Optimizer
Public
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
Python
•
Apache License 2.0
•169•1.4k•119•27•Updated Oct 8, 2025Oct 8, 2025
warp
Public
A Python framework for accelerated simulation, data generation and spatial computing.
python gpu cuda nvidia gpu-acceleration differentiable-programming nvidia-warp
Python
•
Apache License 2.0
•367•5.6k•259•9•Updated Oct 8, 2025Oct 8, 2025
grove
Public
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
kubernetes gpu inference operator auto-scaling role-based grove multinode auto-scaling-group gang-scheduling
Go
•
Apache License 2.0
•16•67•8•12•Updated Oct 8, 2025Oct 8, 2025
spark-rapids-ml
Public
Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
Jupyter Notebook
•
Apache License 2.0
•32•84•31•2•Updated Oct 8, 2025Oct 8, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•281•2k•1.1k•176•Updated Oct 8, 2025Oct 8, 2025
TransformerEngine
Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
python machine-learning deep-learning gpu cuda pytorch jax fp8
Python
•
Apache License 2.0
•513•2.8k•218•88•Updated Oct 8, 2025Oct 8, 2025
bionemo-framework
Public
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
machine-learning gpu pytorch drug-discovery
Jupyter Notebook
•87•531•54•86•Updated Oct 8, 2025Oct 8, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•290•815•404•85•Updated Oct 8, 2025Oct 8, 2025
cudaqx
Public
Accelerated libraries for quantum-classical computing built on CUDA-Q.
C++
•
Other
•32•60•23•11•Updated Oct 8, 2025Oct 8, 2025
cuopt
Public
GPU accelerated decision optimization
gpu optimization cuda linear-programming
Cuda
•
Apache License 2.0
•78•450•84•13•Updated Oct 8, 2025Oct 8, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•69•357•192•192•Updated Oct 8, 2025Oct 8, 2025
NVFlare
Public
NVIDIA Federated Learning Application Runtime Environment
python decentralized pet privacy-protection federated-learning federated-analytics federated-computing
Python
•
Apache License 2.0
•213•803•12•17•Updated Oct 8, 2025Oct 8, 2025
cuda-python
Public
CUDA Python: Performance meets Productivity
Python
•
Other
•212•3k•167•14•Updated Oct 8, 2025Oct 8, 2025
cloud-native-docs
Public
Documentation repository for NVIDIA Cloud Native Technologies
kubernetes containers kubernetes-operator
PowerShell
•
Apache License 2.0
•29•29•5•19•Updated Oct 8, 2025Oct 8, 2025
nvkind
Public
Go
•
Apache License 2.0
•22•171•9•11•Updated Oct 8, 2025Oct 8, 2025
spark-rapids-jni
Public
RAPIDS Accelerator JNI For Apache Spark
Cuda
•
Apache License 2.0
•74•51•79•5•Updated Oct 8, 2025Oct 8, 2025
spark-rapids
Public
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
big-data spark gpu rapids
Scala
•
Apache License 2.0
•259•932•1.7k•20•Updated Oct 8, 2025Oct 8, 2025
NeMo-Agent-Toolkit
Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Python
•
Apache License 2.0
•378•1.4k•57•17•Updated Oct 8, 2025Oct 8, 2025