chenllliang

Follow

🎯

Focusing

Liang Chen chenllliang

🎯

Focusing

Follow

AI is Cool Stuff!

270 followers · 71 following

UniPat AI
Bejing, China
23:50 (UTC -12:00)
https://chenllliang.github.io
@liangchen5518

Achievements

Achievements

Highlights

Pro

Organizations

chenllliang/README.md

Hi there 👋

I am Liang Chen (陈亮), currently a fourth-year PhD student at the school of CS, Peking University. I study with the guidance of Prof. Baobao Chang. Currently, I am interested in building agent in realworld application such as Computer Use, Browser Use and SWE.
Google Scholar
Homepage

Feel free to drop an email if you are interested in connecting.

Pinned Loading

UniPat-AI/BabyVision UniPat-AI/BabyVision Public

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 201 7
StarsfieldAI/R1-V StarsfieldAI/R1-V Public

Witness the aha moment of VLM with less than $3.

Python 4k 286
pkunlp-icler/FastV pkunlp-icler/FastV Public

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 562 27
MoonshotAI/Kimi-VL MoonshotAI/Kimi-VL Public

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1.2k 74
LMM101/Awesome-Multimodal-Next-Token-Prediction LMM101/Awesome-Multimodal-Next-Token-Prediction Public

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

477 13
DreamEngine DreamEngine Public

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!

Python 122 4