Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 747 139

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 416 69

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.8k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 240

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.1k 489

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.8k 992

Repositories

Showing 10 of 687 repositories
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,066 2,163 539 554 Updated Mar 12, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 15,600 3,679 314 (1 issue needs help) 316 Updated Mar 12, 2026
  • NV-Kernels Public

    Ubuntu kernels which are optimized for NVIDIA server systems

    NVIDIA/NV-Kernels’s past year of commit activity
    93 58 0 13 Updated Mar 12, 2026
  • tilus Public

    Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

    NVIDIA/tilus’s past year of commit activity
    Python 447 Apache-2.0 15 8 1 Updated Mar 12, 2026
  • numbast Public

    Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.

    NVIDIA/numbast’s past year of commit activity
    Python 57 Apache-2.0 19 28 (3 issues need help) 11 Updated Mar 12, 2026
  • stdexec Public

    `std::execution`, the proposed C++ framework for asynchronous and parallel programming.

    NVIDIA/stdexec’s past year of commit activity
    C++ 2,267 Apache-2.0 231 127 15 Updated Mar 12, 2026
  • cuopt Public

    GPU accelerated decision optimization

    NVIDIA/cuopt’s past year of commit activity
    Cuda 747 Apache-2.0 139 91 (4 issues need help) 27 Updated Mar 12, 2026
  • recsys-examples Public

    Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

    NVIDIA/recsys-examples’s past year of commit activity
    Python 225 49 44 13 Updated Mar 12, 2026
  • bionemo-framework Public

    BioNeMo Framework: For building and adapting AI models in drug discovery at scale

    NVIDIA/bionemo-framework’s past year of commit activity
    Jupyter Notebook 679 126 35 131 Updated Mar 12, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 2,207 356 1,280 (6 issues need help) 213 Updated Mar 12, 2026