Skip to content
View duoan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report duoan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duoan/README.md

πŸ‘‹ Hi, I'm Duo(Victor) An

Senior Machine Learning Engineer @ Amazon AGI
Scaling multimodal foundation models β€” optimizing how they learn, generalize, and align through data, model and systems co-design.


🧭 About

I work at the intersection of machine learning and distributed systems,
designing large-scale learning pipelines and multimodal data systems that improve how foundation models learn from vast, diverse signals.

My focus areas:

  • 🧠 Training dynamics & optimization β€” improving convergence, stability, and efficiency of large-scale multimodal models
  • 🧩 Learning-centric systems β€” integrating data, architecture, and feedback to enhance representation learning and model alignment
  • βš™οΈ Scalable orchestration β€” leveraging Ray, Spark, and Kubernetes to parallelize multimodal workloads across thousands of GPUs
  • πŸ” Evaluation & feedback loops β€” automating model-driven data refinement and continual quality signals for alignment and adaptation

My work centers on how models learn, not just how they’re trained.


🧰 Core Stack

Machine Learning

PyTorch Transformers DeepSpeed Megtron-LM Ray Spark

Systems & Infra

Amazon AWS Kubernetes CDK Docker

Languages

Python Scala Rust C++


βš–οΈ Principles

1. Models and systems co-evolve.
The best architectures emerge when data, compute, and learning dynamics are designed together.

2. Scale reveals behavior.
Many learning problems only appear β€” and can only be solved β€” at massive scale.

3. Data is part of the model.
Every batch defines what the model becomes.


πŸ“Š Snapshot

duoan's github stats Top Langs


🌐 Connect

Linkedin Badge Gmail Badge


β€œAt scale, learning is a systems problem β€” and every system is a hypothesis about how intelligence forms.”
β€” Duo An

Pinned Loading

  1. mega-data-factory mega-data-factory Public

    🏭 Mega Scale Multimodal DataPipeline for SOTA Foundation Models

    Python 358 44

  2. mega-taxonomy mega-taxonomy Public

    🌳Mega Scale Hierarchical Semantic Indexing for the Multimodal Data

    C++ 5

  3. TorchCode TorchCode Public

    πŸ”₯ LeetCode for PyTorch β€” practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

    Jupyter Notebook 2.3k 171

  4. SiQ_VL SiQ_VL Public

    Seek Vision Language Model based on Siglip and Qwen

    Python 4

  5. ReplicateAI ReplicateAI Public

    Recreating every milestone in Machine Learning and Artificial Intelligence

    Python 6

  6. TinySigLIP TinySigLIP Public

    TinySigLIP: SigLIP Distillation via Affinity Mimicking and Weight Inheritance

    Python 4