ZihanWang314

Follow

🏠

Working from home

Zihan Wang ZihanWang314

🏠

Working from home

Follow

PhD student at Northwestern University. Previously @deepseek-ai @uiucnlp & Renmin University

328 followers · 19 following

Achievements

Achievements

Highlights

Pro

ZihanWang314/README.md

Hi there 👋 I am Zihan Wang.

Pinned Loading

mll-lab-nu/RAGEN mll-lab-nu/RAGEN Public

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2.6k 210
deepseek-ai/ESFT deepseek-ai/ESFT Public

Expert Specialized Fine-Tuning

Python 732 261
CoE CoE Public

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 227 27
xingyaoww/mint-bench xingyaoww/mint-bench Public

Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …

Python 133 8
mll-lab-nu/TStar mll-lab-nu/TStar Public

TStar is a unified temporal search framework for long-form video question answering

Python 93 6
mll-lab-nu/VAGEN mll-lab-nu/VAGEN Public

Training VLM agents with multi-turn reinforcement learning

Python 433 50