Skip to content
View xrsrke's full-sized avatar
🎯
building foundational knowledge
🎯
building foundational knowledge

Block or report xrsrke

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xrsrke/README.md

Pinned Loading

  1. huggingface/nanotron huggingface/nanotron Public

    Minimalistic large language model 3D-parallelism training

    Python 2.6k 290

  2. pipegoose pipegoose Public

    Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

    Python 87 19

  3. instructGOOSE instructGOOSE Public

    Implementation of Reinforcement Learning from Human Feedback (RLHF)

    Jupyter Notebook 174 21

  4. toolformer toolformer Public

    Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools

    Jupyter Notebook 144 15

  5. reinforcement-learning reinforcement-learning Public

    Jupyter Notebook 10 1