Skip to content
View ForJadeForest's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Southeast University
  • Nanjing, Jiangsu

Highlights

  • Pro

Block or report ForJadeForest

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ForJadeForest/README.md

Hi there, I'm Yingzhe Peng πŸ‘‹

πŸŽ“ About Me

I'm a Master's student at Southeast University, School of Computer Science, with a strong background in AI research. My research focuses on multimodal reasoning models, in-context learning, and human-computer interaction systems.

  • πŸ”­ I'm currently working as an Algorithm Researcher Intern at Ant Group
  • 🌱 I'm exploring multimodal reasoning models such as PRM and Rule-based RL
  • πŸ‘― I'm collaborating with OpenRLHF to develop multimodal RL frameworks
  • πŸ“« How to reach me: yingzhepeng@foxmail.com

πŸš€ Research & Projects

  • LMM-R1: A high-performance rule-based RL framework for multimodal models (400+ stars)
  • LIVE: Learnable In-Context Vector for Visual Question Answering (NeurIPS 2024)
  • Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models (NeurIPS 2024)
  • Chat-Based Collaborative Interface: For Personalized Exploratory Tasks (IUI 2025)

πŸ’Ό Experience

  • Algorithm Researcher Intern, Ant Group (2024.12 - Present)
  • Algorithm Researcher Intern, Microsoft (DKI Group) (2024.07 - 2024.12)
  • User Safety Algorithm Engineer Intern, ByteDance (Douyin) (2023.01 - 2023.08)
  • AI Engineer Intern, Intel (2022.05 - 2023.01)

πŸ›  Skills

  • Programming: Python, PyTorch, TensorFlow
  • AI/ML: Multimodal Learning, Reinforcement Learning, LLMs, VLMs
  • Research Areas: In-Context Learning, Multimodal Reasoning, Human-Computer Interaction

πŸ“Š GitHub Stats

Yingzhe's GitHub stats

Top Langs

πŸ“ Latest Publications


⭐️ From ForJadeForest

Pinned Loading

  1. TideDra/lmm-r1 TideDra/lmm-r1 Public

    Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

    Python 829 54

  2. LIVE-Learnable-In-Context-Vector LIVE-Learnable-In-Context-Vector Public

    【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185

    Python 22 3

  3. Lever-LM Lever-LM Public

    The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models

    Python 16 2

  4. ImageSearchLightningCLIP ImageSearchLightningCLIP Public

    Using distilled CLIP model to deploy the android device

    Java 20 4