Skip to content
@sgl-project

sgl-project

Pinned Loading

  1. sglang sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 23.7k 4.5k

  2. sgl-learning-materials sgl-learning-materials Public

    Materials for learning SGLang

    751 57

  3. ome ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    Go 379 61

  4. genai-bench genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    Python 271 49

  5. SpecForge SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python 698 157

  6. sglang-jax sglang-jax Public

    JAX backend for SGL

    Python 241 71

Repositories

Showing 10 of 23 repositories
  • sgl-docs Public
    sgl-project/sgl-docs’s past year of commit activity
    MDX 3 Apache-2.0 14 0 2 Updated Feb 22, 2026
  • sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    sgl-project/sglang’s past year of commit activity
    Python 23,653 Apache-2.0 4,517 580 (29 issues need help) 1,640 Updated Feb 22, 2026
  • sgl-project.github.io Public

    This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang

    sgl-project/sgl-project.github.io’s past year of commit activity
    HTML 107 26 10 1 Updated Feb 22, 2026
  • whl Public

    Kernel Library Wheel for SGLang

    sgl-project/whl’s past year of commit activity
    HTML 16 MIT 7 1 1 Updated Feb 22, 2026
  • mini-sglang Public

    A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

    sgl-project/mini-sglang’s past year of commit activity
    Python 3,526 MIT 446 7 12 Updated Feb 22, 2026
  • ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    sgl-project/ome’s past year of commit activity
    Go 379 Apache-2.0 61 33 (2 issues need help) 44 Updated Feb 21, 2026
  • sgl-flash-attn Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    sgl-project/sgl-flash-attn’s past year of commit activity
    Python 18 BSD-3-Clause 2,417 0 0 Updated Feb 20, 2026
  • genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    sgl-project/genai-bench’s past year of commit activity
    Python 271 MIT 49 6 17 Updated Feb 20, 2026
  • sglang-jax Public

    JAX backend for SGL

    sgl-project/sglang-jax’s past year of commit activity
    Python 241 Apache-2.0 71 94 (8 issues need help) 29 Updated Feb 18, 2026
  • rbg Public

    A workload for deploying LLM inference services on Kubernetes

    sgl-project/rbg’s past year of commit activity
    Go 170 Apache-2.0 42 19 13 Updated Feb 18, 2026