Skip to content
View ArrowLuo's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ArrowLuo

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 9,778 723 Updated Mar 27, 2026

[CVPR 2026] DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution

JavaScript 17 Updated Mar 23, 2026

LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer, ICLR 2026

Python 1,188 102 Updated Mar 25, 2026
Python 20 2 Updated Jan 30, 2026

Repository of AudioX

Python 1,458 137 Updated Mar 10, 2026

Krea Realtime 14B. An open-source realtime AI video model.

Python 516 33 Updated Nov 13, 2025

The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 8,266 934 Updated Mar 27, 2026

The official UniVerse-1 code.

Python 123 11 Updated Oct 13, 2025

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

107 Updated Mar 16, 2026

WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild

Python 464 40 Updated Mar 22, 2026

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 8,546 921 Updated Mar 26, 2026
Python 327 22 Updated Mar 25, 2026

[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".

Python 27 Updated Feb 11, 2026

Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click

TypeScript 12,603 1,968 Updated Mar 27, 2026

Helios: Real Real-Time Long Video Generation Model

Python 1,549 116 Updated Mar 26, 2026

run agents that work for you in the background based on what you do

Rust 17,598 1,513 Updated Mar 27, 2026

AI agents running research on single-GPU nanochat training automatically

Python 58,023 8,052 Updated Mar 26, 2026

Public repository for Agent Skills

Python 104,317 11,491 Updated Mar 25, 2026

Official repository for “PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss”

Python 229 12 Updated Feb 3, 2026

FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution

21 Updated Mar 5, 2026

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

Python 33,275 2,670 Updated Mar 26, 2026

首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.

TypeScript 10,513 2,331 Updated Mar 23, 2026

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 292 18 Updated Feb 24, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 5,292 798 Updated Mar 11, 2026

Unified automatic quality assessment for speech, music, and sound.

Python 697 50 Updated Jun 5, 2025

Moonshot's most powerful model

1,577 170 Updated Jan 31, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 49,675 5,930 Updated Mar 27, 2026

Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)

Python 116 9 Updated Sep 15, 2025
Python 2,003 234 Updated Feb 26, 2026
Next