💮
love life. live life.
Speech, Alignments, Robust LMs
- NVIDIA Research
- 03:53
(UTC -07:00) - huckiyang.github.io/
- @huckiyang
- channel/UCSj3hCBIds5BpyO7A4F3l7A
Highlights
- Pro
Pinned Loading
- NVIDIA-NeMo/NeMo
NVIDIA-NeMo/NeMo PublicA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
- NVlabs/OmniVinci
NVlabs/OmniVinci PublicOmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
- Srijith-rkr/Whispering-LLaMA
Srijith-rkr/Whispering-LLaMA PublicEMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
- Voice2Series-Reprogramming
Voice2Series-Reprogramming PublicICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification
- YUCHEN005/GenTranslate
YUCHEN005/GenTranslate PublicCode for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
- QuantumSpeech-QCNN
QuantumSpeech-QCNN PublicIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
