aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Install

Requires Python 3.11

git clone https://github.com/EleutherAI/aria-amt.git cd aria-amt pip install -e .

Download the preliminary model weights:

Piano (v1)

wget "https://huggingface.co/datasets/loubb/aria-midi/resolve/main/piano-medium-double-1.0.safetensors?download=true"

Usage

You can download mp3s from youtube using yt-dlp:

yt-dlp --audio-format mp3 --extract-audio --no-playlist --audio-quality 0 <youtube-link> -o <save-path>

You can then transcribe using the cli:

aria-amt transcribe \ medium-double \ <path-to-checkpoint> \ -load_path <path-to-audio> \ -save_dir <path-to-save-dir> \ -bs 1 \ -compile

If you want to do batch transcription, use the -load_dir flag and adjust -bs accordingly. Compiling and may take some time, but provides a significant speedup. Quantizing (-q8 flag) further speeds up inference when the -compile flag is also used.

NOTE: Int8 quantization is only supported on GPUs that support BF16.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github/workflows		.github/workflows
amt		amt
config		config
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aria-amt

Install

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

aria-amt

Install

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages