ALR Transformer

ALR Transformer that replaces the original transformer implementation of an joint encoder + decoder block with a feedforward/alr block with a decoder block

Install

pip install alr-transformer

Usage

import torch from alr_transformer import ALRTransformer x = torch.randint(0, 100000, (1, 2048)) model = ALRTransformer( dim = 512, depth = 6, num_tokens = 100000, dim_head = 64, heads = 8, ff_mult = 4 ) out = model(x) print(out) print(out.shape)

Train

First git clone the repo then download and then run the following

python3 train.py

Citation

@misc{bozic2023rethinking, title={Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers}, author={Vukasin Bozic and Danilo Dordervic and Daniele Coppola and Joseph Thommes}, year={2023}, eprint={2311.10642}, archivePrefix={arXiv}, primaryClass={cs.CL} }

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github		.github
alr_transformer		alr_transformer
data		data
.DS_Store		.DS_Store
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
agorabanner.png		agorabanner.png
example.py		example.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
tests.py		tests.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ALR Transformer

Install

Usage

Train

Citation

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ALR Transformer

Install

Usage

Train

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages