Skip to content
View superleo's full-sized avatar

Block or report superleo

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NVMeVirt: A Versatile Software-defined Virtual NVMe Device

C 292 94 Updated Dec 23, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 272 19 Updated Feb 2, 2026

Persist and reuse KV Cache to speedup your LLM.

Python 267 69 Updated Mar 23, 2026

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 3,024 233 Updated Feb 9, 2026

Kubernetes Operator, Helm Charts, Ansible Playbooks, and utility scripts for large-scale AIStore deployments on Kubernetes.

Go 130 29 Updated Mar 20, 2026

Large language model fine-tuning capabilities based on cloud native and distributed computing.

Go 92 18 Updated Feb 22, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,402 557 Updated Oct 19, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,620 370 Updated Feb 27, 2025

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,926 298 Updated Nov 25, 2024

Lustre Monitoring System based on Collectd, Grafana and Influxdb

Python 46 16 Updated Dec 12, 2023

This repository is established to store personal notes and annotated papers during daily research.

187 16 Updated Mar 20, 2026

Delivers efficient, stable, and secure data distribution and acceleration powered by P2P technology, with an optional content‑addressable filesystem that accelerates OCI container launch.

Go 3,091 382 Updated Mar 20, 2026

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 130 36 Updated Sep 23, 2025

🏭 ↔️ 👥 Digital Twin as a Service

TypeScript 152 78 Updated Mar 20, 2026

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,010 102 Updated Jul 29, 2024
Python 168 54 Updated Feb 22, 2024

Really fast sync tool for S3

Go 564 78 Updated Dec 5, 2025

The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.

C++ 10,036 1,884 Updated Mar 23, 2026

Training and serving large-scale neural networks with auto parallelization.

Python 3,188 362 Updated Dec 9, 2023

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,664 608 Updated Jul 25, 2023

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,596 536 Updated Jul 10, 2024

Piranha: A GPU Platform for Secure Computation

C++ 104 33 Updated Apr 2, 2023

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,962 3,693 Updated Mar 23, 2026

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 37,608 6,184 Updated Nov 10, 2025

UnifyFS: A file system for burst buffers

C 121 33 Updated Sep 29, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,357 183 Updated Mar 12, 2026

AIStore: scalable storage for AI applications

Go 1,794 243 Updated Mar 23, 2026
Next