Popular repositories Loading
- AdEMAMix-Optimizer-Pytorch
AdEMAMix-Optimizer-Pytorch PublicForked from nanowell/AdEMAMix-Optimizer-Pytorch
The AdEMAMix Optimizer: Better, Faster, Older.
Python
- recurrentgemma
recurrentgemma PublicForked from google-deepmind/recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
Python
- entropix
entropix PublicForked from xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
Python
- lingua
lingua PublicForked from facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Python
- ecco
ecco PublicForked from jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

