Vision & Goals
Win-Win Community in the Agent Era
True AGI is still a decade away. Agent is the essential path. Seize the wave to stay relevant.
Through deep practice, knowledge sharing, and project collaboration, break down technical barriers and transform them into personal advantages and real value.
Empower everyone, win the Agent decade.
What We Offer
- · Systematic learning paths and practice manuals
- · One-on-one guidance from technical mentors and industry experts
- · Paper notes, cutting-edge talks, case co-creation
- · Portfolio polishing, resume & interview coaching, referrals
Advanced Play
Deep Co-creation · Papers/Projects/Career Full Chain
If you want to get started with LLM Agents
- •Learning paths and open-source repo recommendations
- •Intro courses + hands-on projects
- •Like-minded exchange community
If you want further collaboration / papers / career
- •Paper collaboration and experiment co-building
- •Industry landing project cooperation
- •Big tech job referrals and interview coaching
If you seek partnerships
- •Co-build community brand
- •Joint promotion and events
- •AI product and training guidance
Learning & Exploration
Carefully curated learning resources to help you quickly master Agent core technologies
Advanced Path
Community Projects / Papers
Idea2Story
An Agent framework that automatically generates top-tier conference-level paper narratives from ideas. Trained on tens of thousands of top conference papers and their review data, teaching AI to master "Scientific Storytelling".
Talks & Roundtables · Paper Reading

从 Depth Scaling 到 Width Scaling!WideSeek-R1:通过多智能体 RL 探索大模型的广度扩展(Width Scaling)
DeepSeek-R1 的成功证明,深度扩展(Depth Scaling)在复杂逻辑推理中具有巨大潜力。但当任务从“深推理”转向“广信息”——如汇总全球头部科技公司多维财务数据——单一大模型往往受限于多轮检索带来的上下文干扰与串行效率瓶颈。 为此,我们提出“广度扩展”(Width Scaling)这

从推理架构的角度,谈谈 Attention Residual 架构一些背后的想法
作者:YyWangCS https://zhuanlan.zhihu.com/p/2017528295286133070 前言 作为月之暗面 AI Infra 团队的一员,这篇文章我想从 AI Infra,尤其是推理架构的角度(关于训练,我们的同事有一篇非常好的回答,推荐读一下,这里就不多谈了),聊

Vibe Coding & Agent Evolved Meetup
当顶尖极客遇上 AI,代码从此有了“感觉” Stop Writing Code. Start Designing Vibe. 停止机械地敲击键盘,开始设计你的创意流。 你是否还在为 Rust 的所有权机制头秃? 你是否觉得构建一个行业级 Agent 需要组建一支专家团队? 你是否相信,一个周末足以重

深度对话!2025 "青稞" AI 嘉年华,与 20+ 位青年科学家一起探讨AI 技术瞬间
本次活动专为青年科学家打造,旨在搭建一场 AI 技术的深度对话,来自学术和工业界的 20+ 青年科学家,将与大家一起回顾 2025,展望 2026!

TRPO重生:大模型时代的信任域策略优化
在大型语言模型的强化学习阶段,特别是RLHF中,我们追求策略的持续优化。本次分享深入探讨TRPO在LLM时代的应用。

从 π_0 到 π_RL:面向流匹配 VLA 的强化学习后训练框架
深入解析流匹配VLA的强化学习后训练框架π_RL,探索具身智能的前沿技术。

RLinf:面向具身智能的"渲训推一体化"开源强化训练框架
开源强化训练框架RLinf,实现渲染、训练、推理一体化,加速具身智能研发。

RLinf-VLA 实践:从零上手 VLA(OpenVLA)强化学习
手把手教你使用RLinf-VLA框架进行OpenVLA强化学习实践,入门具身智能开发。

深度对话!2025 "青稞" AI 嘉年华,与 20+ 位青年科学家一起探讨AI 技术瞬间
本次活动专为青年科学家打造,旨在搭建一场 AI 技术的深度对话,来自学术和工业界的 20+ 青年科学家,将与大家一起回顾 2025,展望 2026!

TRPO重生:大模型时代的信任域策略优化
在大型语言模型的强化学习阶段,特别是RLHF中,我们追求策略的持续优化。本次分享深入探讨TRPO在LLM时代的应用。

从 π_0 到 π_RL:面向流匹配 VLA 的强化学习后训练框架
深入解析流匹配VLA的强化学习后训练框架π_RL,探索具身智能的前沿技术。

RLinf:面向具身智能的"渲训推一体化"开源强化训练框架
开源强化训练框架RLinf,实现渲染、训练、推理一体化,加速具身智能研发。

RLinf-VLA 实践:从零上手 VLA(OpenVLA)强化学习
手把手教你使用RLinf-VLA框架进行OpenVLA强化学习实践,入门具身智能开发。
Join Us
Foundation Mastery
LLM/Multimodal fundamentals, code skills enhancement and engineering standards
Agent Architecture
Planning/Memory/Tool calling and evaluation, real business case breakdown
Project Co-creation
Hands-on project teaming, mentor Q&A and code review
Career Leap
Portfolio polishing, interview workshops, mentor recommendations and referrals
Deep Practice + Mentor Q&A + Project Co-creation
Portfolio polishing, code review, weekly retrospectives, referral recommendations. Limited spots per cohort to ensure interaction quality.
Partnership & Consultation
WeChat Official Account: AgentAlpha
Co-build community / Promotion partnerships / AI products / Training guidance, or need paper, project, career support, scan QR code to follow official account for more info.



