Skip to content
View chenllliang's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@pkunlp-icler @StarsfieldAI

Block or report chenllliang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
chenllliang/README.md

Hi there 👋

  • I am Liang Chen (陈亮), currently a fourth-year PhD student at the school of CS, Peking University. I study with the guidance of Prof. Baobao Chang. Currently, I am interested in building agent in realworld application such as Computer Use, Browser Use and SWE.
  • Google Scholar
  • Homepage

Feel free to drop an email if you are interested in connecting.

Pinned Loading

  1. UniPat-AI/BabyVision UniPat-AI/BabyVision Public

    We introduce BabyVision, a benchmark revealing the infancy of AI vision.

    Python 201 7

  2. StarsfieldAI/R1-V StarsfieldAI/R1-V Public

    Witness the aha moment of VLM with less than $3.

    Python 4k 286

  3. pkunlp-icler/FastV pkunlp-icler/FastV Public

    [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

    Python 562 27

  4. MoonshotAI/Kimi-VL MoonshotAI/Kimi-VL Public

    Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

    1.2k 74

  5. LMM101/Awesome-Multimodal-Next-Token-Prediction LMM101/Awesome-Multimodal-Next-Token-Prediction Public

    [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

    477 13

  6. DreamEngine DreamEngine Public

    Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!

    Python 122 4