Skip to content
View huckiyang's full-sized avatar
💮
love life. live life.
💮
love life. live life.

Highlights

  • Pro

Block or report huckiyang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. NVIDIA-NeMo/NeMo NVIDIA-NeMo/NeMo Public

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    Python 17k 3.4k

  2. NVlabs/OmniVinci NVlabs/OmniVinci Public

    OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

    Python 646 51

  3. Srijith-rkr/Whispering-LLaMA Srijith-rkr/Whispering-LLaMA Public

    EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

    Jupyter Notebook 270 16

  4. Voice2Series-Reprogramming Voice2Series-Reprogramming Public

    ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification

    TypeScript 73 12

  5. YUCHEN005/GenTranslate YUCHEN005/GenTranslate Public

    Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"

    Python 199 9

  6. QuantumSpeech-QCNN QuantumSpeech-QCNN Public

    IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

    Jupyter Notebook 107 20