Skip to content
View 0417keito's full-sized avatar
🦤
Working from home
🦤
Working from home

Block or report 0417keito

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,409 116 Updated Nov 29, 2024

PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.

Python 201 28 Updated Nov 23, 2021

An Open-source Streaming High-fidelity Neural Audio Codec

Python 501 28 Updated Mar 4, 2025

Python parser and tools for MUSDB18 Music Separation Dataset

Python 194 41 Updated May 28, 2025

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

Makefile 242 24 Updated Mar 23, 2026

music generation with masked transformers!

Max 351 42 Updated May 16, 2025

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 867 184 Updated Jul 22, 2023

Simple text to phones converter for multiple languages

Python 1,526 199 Updated Sep 26, 2024

This is the official repository for M2UGen

Jupyter Notebook 515 39 Updated Jan 2, 2025

Single channel speech source separation by diffusion process (ICASSP 2023)

Python 126 14 Updated Mar 15, 2024

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 498 121 Updated Jul 1, 2021

AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files.

Python 492 59 Updated Mar 9, 2026

Refactored version of https://github.com/ming024/FastSpeech2

Python 14 2 Updated Oct 6, 2021

The source code for the paper XiaoiceSing2 (interspeech2023)

Python 49 3 Updated Jan 15, 2024
Jupyter Notebook 19 3 Updated Feb 2, 2023

text to speech using autoregressive transformer and VITS

Python 248 15 Updated Apr 3, 2024

Neural network-based singing voice synthesis library for research

Python 743 83 Updated Oct 9, 2023

vits2 backbone with bert

Python 83 8 Updated Jan 8, 2024

Billboard charts data

Jupyter Notebook 70 5 Updated Mar 24, 2026

Official implementation of the paper "Unifying GANs and Score-Based Diffusion as Generative Particle Models", NeurIPS 2023

Python 25 1 Updated Jan 5, 2025

Code for Pre-training Protein Encoder via Siamese Sequence-Structure Diffusion Trajectory Prediction (https://arxiv.org/abs/2301.12068)

Python 42 6 Updated Jul 9, 2023

Music Pitch detection using Tensorflow SPICE model.

Kotlin 73 20 Updated Sep 6, 2020

Official implementation of "Equivariant Self-Supervision for Musical Tempo Estimation (ISMIR 2022)"

Python 26 2 Updated Feb 6, 2023

Official PyTorch implementation of Contrastive Learning of Musical Representations

Python 335 52 Updated Jul 25, 2024

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,362 180 Updated Aug 19, 2024

Self-supervised learning for real-time pitch estimation

Python 283 25 Updated Oct 15, 2025

a co-creative looper that uses generative modeling to **not** repeat itself.

Max 287 10 Updated Apr 17, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,726 798 Updated Mar 25, 2026

Singing Voice Synthesis based on VITS, different from VISinger

Python 196 36 Updated Nov 13, 2023
Next