Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 561 92

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 381 58

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.4k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 224

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.8k 435

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.6k 911

Repositories

Showing 10 of 627 repositories
  • tilus Public

    Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

    NVIDIA/tilus’s past year of commit activity
    Python 401 Apache-2.0 9 7 1 Updated Nov 18, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 12,169 1,876 672 440 Updated Nov 18, 2025
  • NVFlare Public

    NVIDIA Federated Learning Application Runtime Environment

    NVIDIA/NVFlare’s past year of commit activity
    Python 832 Apache-2.0 222 14 17 Updated Nov 18, 2025
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 2,033 294 1,133 (5 issues need help) 192 Updated Nov 18, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 851 303 415 (16 issues need help) 82 Updated Nov 18, 2025
  • numba-cuda Public

    The CUDA target for Numba

    NVIDIA/numba-cuda’s past year of commit activity
    Python 211 BSD-2-Clause 44 96 (1 issue needs help) 23 Updated Nov 18, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,229 3,285 313 203 Updated Nov 18, 2025
  • TensorRT-Incubator Public

    Experimental projects related to TensorRT

    NVIDIA/TensorRT-Incubator’s past year of commit activity
    MLIR 114 19 37 (1 issue needs help) 17 Updated Nov 18, 2025
  • TransformerEngine Public

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

    NVIDIA/TransformerEngine’s past year of commit activity
    Python 2,924 Apache-2.0 546 237 96 Updated Nov 18, 2025
  • physicsnemo Public

    Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

    NVIDIA/physicsnemo’s past year of commit activity
    Python 2,059 Apache-2.0 485 38 35 Updated Nov 18, 2025