Skip to content
@gpustack

GPUStack

GPU cluster manager for optimized AI model deployment

Pinned Loading

  1. gpustack gpustack Public

    Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

    Python 4.6k 470

  2. runner runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    Dockerfile 9 9

  3. runtime runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    Python 11 13

  4. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 249 24

  5. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 199 32

Repositories

Showing 10 of 15 repositories
  • runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    gpustack/runtime’s past year of commit activity
    Python 11 Apache-2.0 13 0 3 Updated Mar 12, 2026
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 74 Apache-2.0 54 2 5 Updated Mar 12, 2026
  • runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    gpustack/runner’s past year of commit activity
    Dockerfile 9 Apache-2.0 9 0 0 Updated Mar 12, 2026
  • gpustack Public

    Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

    gpustack/gpustack’s past year of commit activity
    Python 4,627 Apache-2.0 470 480 30 Updated Mar 11, 2026
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 1 2 0 0 Updated Mar 7, 2026
  • gpustack/benchmark-runner’s past year of commit activity
    Python 2 Apache-2.0 2 1 0 Updated Mar 6, 2026
  • gpustack/gpustack-higress-plugin’s past year of commit activity
    Go 1 2 0 0 Updated Feb 28, 2026
  • community-inference-backends Public

    Community Inference Backends for GPUStack V2

    gpustack/community-inference-backends’s past year of commit activity
    Python 8 Apache-2.0 7 0 1 Updated Feb 13, 2026
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 249 MIT 24 0 0 Updated Feb 11, 2026
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    1 Apache-2.0 4 0 0 Updated Feb 4, 2026

Top languages

Loading…

Most used topics

Loading…