Skip to content
View Aaronhuang-778's full-sized avatar

Organizations

@Efficient-Large-Model

Block or report Aaronhuang-778

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. NVlabs/Long-RL NVlabs/Long-RL Public

    Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

    Python 705 28

  2. NVlabs/QeRL NVlabs/QeRL Public

    [ICLR 2026]QeRL enables RL for 32B LLMs on a single H100 GPU.

    Python 495 51

  3. BiLLM BiLLM Public

    [ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

    Python 229 18

  4. Mixture-Compressor-MoE Mixture-Compressor-MoE Public

    [ICLR 2025, IEEE TPAMI 2026] Mixture Compressor & MC#

    Python 69 5

  5. SliM-LLM SliM-LLM Public

    [ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

    Python 55 5

  6. hshjerry/VideoEspresso hshjerry/VideoEspresso Public

    [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

    Python 138 5