Skip to content
View yafengio's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Block or report yafengio

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yafengio/README.md

Hello, I'm yafengio 👋

Pinned Loading

  1. sgl-project/ome sgl-project/ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    Go 404 66

  2. dynamo dynamo Public

    Forked from ai-dynamo/dynamo

    A Datacenter Scale Distributed Inference Serving Framework

    Rust

  3. lws lws Public

    Forked from kubernetes-sigs/lws

    LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

    Go

  4. Mooncake Mooncake Public

    Forked from kvcache-ai/Mooncake

    Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

    C++

  5. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python

  6. torchada torchada Public

    Forked from MooreThreads/torchada

    Adapter package for torch_musa to act exactly like PyTorch CUDA

    Python