Skip to content
View yang-su2000's full-sized avatar
👀
You found me!
👀
You found me!

Block or report yang-su2000

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yang-su2000/README.md

Welcome to Yang's GitHub

Hi there! I am a leading explorer on fundamental agentic research at Qwen Team.

Some topics I am interested in include

  • code/tool centric agentic rl (collaborative/adversarial training)
  • context/memory rl (online learning)
  • environment alignment (self-evolve and scaling algorithms)

I am happy to chat and discuss potential collaborations, feel free to reach out by

Linkedin Twitter Gmail WeChat

🌟 Studying Zone

(2025.06) My thought about agentic rl training.

(2024.09-) I joined Qwen Team as a researcher 🥝!

(2024.01-) I am part-time collaborating with Cornell ICPC and Millennium to build LLMs for code and data generation.

  • This work is called ALICE (Aligning Language models for Interactive Code Execution), find more about it at alicellm.github.io.
  • ALICE is a meta-agent collaboration system that generates high-quality data through multi-turn interactions and feedback without human intervention.
  • It produces multimodal data with traces from agent strategies like ReAct and Reflexion, which are scarce but offer potential for aligning advanced LLMs.

(2023-2024) I I led the prior work of ALICE called Voice2Action with Cornell XRC, an Unity Package for real-time code execution in VR; and studied on large-scale generation augmented retrieval systems (opposed to RAG) at Cornell NLP.

(2021-2022) I interned on graph machine learning at AWS AI Lab and contributed to the open source Deep Graph Library.

👀 Chilling Zone

I like programming! I lead the "Cornell Tech" Group at Cornell ICPC and won the Top 20% in 2023 Regional!

I enjoy cooking, listening to music of all forms, playing ping-pong, reading science fiction, and more!

LeetCode CodeForces Visitors

⚡ Developing Zone

Pinned Loading

  1. Voice2Action Voice2Action Public

    ALICE and its prior work, Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality

    C# 41 6

  2. meituan-longcat/vitabench meituan-longcat/vitabench Public

    VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

    Python 68 7

  3. boson-ai/RPBench-Auto boson-ai/RPBench-Auto Public

    An automated pipeline for evaluating LLMs for role-playing.

    Python 201 10

  4. luyug/GradCache luyug/GradCache Public

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

    Python 419 26

  5. Authorship-Identification-with-NLP Authorship-Identification-with-NLP Public

    Large-scale user portarit ranking and generation augmented retrieval systems.

    Jupyter Notebook 6 1

  6. dmlc/dgl dmlc/dgl Public

    Python package built to ease deep learning on graph, on top of existing DL frameworks.

    Python 14.2k 3.1k