Skip to content
View liyorozuya's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report liyorozuya

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. duckdb duckdb Public

    Forked from duckdb/duckdb

    DuckDB is an analytical in-process SQL database management system

    C++

  2. llama.cpp llama.cpp Public

    Forked from ggml-org/llama.cpp

    LLM inference in C/C++

    C++

  3. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  4. vllm-mlx vllm-mlx Public

    Forked from waybarrios/vllm-mlx

    OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …

    Python

  5. petals petals Public

    Forked from bigscience-workshop/petals

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Python