Skip to content
View wtjiang98's full-sized avatar
🐟
Touching Fish
🐟
Touching Fish

Block or report wtjiang98

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,111 3,479 Updated Mar 22, 2026

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,385 1,327 Updated Jul 9, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

1,034 40 Updated Mar 15, 2026

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,498 95 Updated Sep 11, 2025

🌟 Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)

TypeScript 25,727 4,609 Updated Mar 22, 2026

Sora 视频无水印链接提取器

Python 309 90 Updated Nov 14, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,746 165 Updated Mar 22, 2026

[NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Jupyter Notebook 202 7 Updated Mar 8, 2026

A curated list of papers on reinforcement learning for video generation

425 1 Updated Mar 22, 2026
Python 10,609 707 Updated Feb 9, 2026

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 732 56 Updated Mar 6, 2026

Official code for "VideoReward Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning"

Python 45 1 Updated Oct 20, 2025

(arXiv) MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Python 1,113 48 Updated Feb 26, 2026

[ICLR 2026] EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 224 6 Updated Mar 20, 2026
Python 1,799 80 Updated Dec 16, 2025

Contexts Optical Compression

Python 22,739 2,091 Updated Jan 27, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,754 363 Updated Mar 10, 2026

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,267 44 Updated Jan 1, 2026

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,928 150 Updated Feb 3, 2026

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,126 103 Updated Feb 26, 2026

An official implementation of Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching

Python 69 5 Updated Sep 11, 2025

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​

Python 671 54 Updated Oct 14, 2025

Pusa: Thousands Timesteps Video Diffusion Model

Python 675 47 Updated Feb 13, 2026

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 253 11 Updated Feb 10, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,709 672 Updated Mar 22, 2026
Python 1,749 248 Updated Mar 6, 2026

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

911 112 Updated Aug 27, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,597 463 Updated Feb 10, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,781 1,791 Updated Mar 17, 2026
Python 2,499 241 Updated Jul 16, 2025
Next