Skip to content
View DRSY's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report DRSY

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DRSY/README.md

Hi there 👋

😉 I am Siyu Ren.

🎓 I got my Ph.D degree at Shanghai Jiao Tong University.

🔎 Currently, my research interest includes Efficient Methods for NLP/Large Language Models and techniques around mechanistic understanding of LLMs pretraining, instrution-tuning, and alignment.

📚 For my academic publications, please refer to https://drsy.github.io/.

DRSY's github stats主要使用语言

profile

Pinned Loading

  1. MoTIS MoTIS Public

    [NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

    Swift 127 10

  2. EMO EMO Public

    [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

    Python 126 13

  3. EasyKV EasyKV Public

    Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

    Python 63 5

  4. DGen DGen Public

    [AAAI 2021]Knowledge-Driven Distractor Generation for Cloze-Style Multiple Choice Questions

    Python 22 2

  5. KV_Compression KV_Compression Public

    [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens

    Python 25

  6. LAMP LAMP Public

    [NAACL 2022 Findings]Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning

    Python 11