Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 935 192

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 460 80

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 17.1k 1.7k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.9k 262

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.4k 536

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 4.1k 1.1k

Repositories

Showing 10 of 745 repositories
  • infra-controller Public

    NVIDIA Infra Controller - Hardware Lifecycle Management and multitenant networking

    NVIDIA/infra-controller’s past year of commit activity
    Rust 189 Apache-2.0 114 294 (4 issues need help) 56 Updated Jun 8, 2026
  • aicr Public

    Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

    NVIDIA/aicr’s past year of commit activity
    Go 325 Apache-2.0 57 57 3 Updated Jun 8, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 16,629 4,048 352 491 Updated Jun 8, 2026
  • warp Public

    A Python framework for GPU-accelerated simulation, robotics, and machine learning.

    NVIDIA/warp’s past year of commit activity
    Python 6,736 Apache-2.0 522 213 11 Updated Jun 8, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,890 Apache-2.0 431 58 175 Updated Jun 8, 2026
  • NeMo-Retriever Public

    NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever Library uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

    NVIDIA/NeMo-Retriever’s past year of commit activity
    Python 2,936 Apache-2.0 324 119 (1 issue needs help) 78 Updated Jun 8, 2026
  • cuda-python Public

    CUDA Python: Performance meets Productivity

    NVIDIA/cuda-python’s past year of commit activity
    Cython 3,283 296 184 35 Updated Jun 8, 2026
  • srt-slurm-recipes Public

    Official NVIDIA/srt-slurm sweep configs for benchmarking LLMs across NVIDIA GPUs and frameworks, spanning aggregated and disaggregated serving on single- and multi-node setups.

    NVIDIA/srt-slurm-recipes’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Jun 8, 2026
  • NeMo-Agent-Toolkit Public

    The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

    NVIDIA/NeMo-Agent-Toolkit’s past year of commit activity
    Python 2,389 Apache-2.0 670 26 19 Updated Jun 8, 2026