Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 769 147

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 424 72

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.8k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 243

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.2k 494

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.9k 1k

Repositories

Showing 10 of 704 repositories
  • cloudai Public

    CloudAI Benchmark Framework

    NVIDIA/cloudai’s past year of commit activity
    Python 89 Apache-2.0 44 7 7 Updated Mar 20, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 15,743 3,719 323 (1 issue needs help) 328 Updated Mar 20, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 2,227 361 1,281 (6 issues need help) 224 Updated Mar 20, 2026
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,212 Apache-2.0 308 70 118 Updated Mar 20, 2026
  • ncx-infra-controller-rest Public

    NCX Infra Controller - Hardware Lifecycle Management (REST API)

    NVIDIA/ncx-infra-controller-rest’s past year of commit activity
    Go 30 Apache-2.0 27 17 15 Updated Mar 20, 2026
  • k8s-test-infra Public

    K8s-test-infra

    NVIDIA/k8s-test-infra’s past year of commit activity
    Go 13 Apache-2.0 11 16 0 Updated Mar 20, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,152 2,202 547 593 Updated Mar 20, 2026
  • cutile-python Public

    cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

    NVIDIA/cutile-python’s past year of commit activity
    Python 1,979 126 20 4 Updated Mar 20, 2026
  • DALI Public

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

    NVIDIA/DALI’s past year of commit activity
    C++ 5,646 Apache-2.0 660 206 (26 issues need help) 34 Updated Mar 20, 2026
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    TypeScript 114 Apache-2.0 20 66 5 Updated Mar 20, 2026