Skip to content
View alphadl's full-sized avatar
🎯
hiring @ alibaba https://liamding.cc/hiring.html
🎯
hiring @ alibaba https://liamding.cc/hiring.html

Highlights

  • Pro

Block or report alphadl

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alphadl/README.md

Hi there

πŸ™‹β€β™‚οΈ I am building an agentic AI econsystem at Alibaba. I was the chief scientist at a startup (raised more than 50M$), previously worked at JD Explore Academy and Tencent AI Lab, and held an adjunct researcher position at ZJU.

πŸ”­ Working on the whole pipeline of LLM R&D and their human-centric applications, including efficient and sufficient training, alignment, evaluations, compression, multilinguality, multimodality, agentic application, and much more.

πŸ’ͺ I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019πŸ˜…. will resume training in 2024πŸ’ͺ🏻).

πŸ₯— I (onceπŸ˜…) enjoy cooking.

🐈 I like to spend Sundays with my cats (two from 2020-2023, one from 2023).

πŸ”₯ Recent open-source projects on agentic AI, together covering data generation, reuse, evaluation, and context efficiency:

  • πŸ”„ AgentHER Hindsight relabeling of failed trajectories for training.
  • 🧬 AgentSynth Synthetic agent data from scratch with execution validation.
  • πŸ“ AdaRubric Dynamic rubric evaluation for trajectory quality.
  • πŸ—œοΈ trajectory_tokenization ReAct with compressed history for long-horizon context.

Pinned Loading

  1. THUNLP-MT/MT-Reading-List THUNLP-MT/MT-Reading-List Public

    A machine translation reading list maintained by Tsinghua Natural Language Processing Group

    TeX 2.4k 440

  2. lookahead.pytorch lookahead.pytorch Public

    lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

    Python 338 64

  3. AgentHER AgentHER Public

    AgentHER: Hindsight Experience Replay for LLM Agents

    Python 4

  4. AgentSynth AgentSynth Public

    AgentSynth: Industrial-Grade Agent Data Synthesis Pipeline

    Python 2

  5. AdaRubrics AdaRubrics Public

    AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories

    Python 3

  6. darts.pytorch1.1 darts.pytorch1.1 Public

    Implementation with latest PyTorch (v1.1) for multi-gpu differentiable architecture search https://arxiv.org/abs/1806.09055

    Python 84 29