HydroRoll-Team / HydroRoll Star 8 Code Issues Pull requests 跨平台、多任务、高度自定义的骰系开发框架。 nlp dice text-to-speech framework ai cross-platform model artificial-intelligence tts webui dice-roller roll ner asr re dice-roller-library nature-language-processing hydroroll audio-speech-recognition Updated Nov 21, 2025 Python
hari-huynh / viVQA-voice-assistant Star 4 Code Issues Pull requests Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper text-to-speech lora visual-question-answering llava multimodal-large-language-models audio-speech-recognition mistral-7b Updated May 15, 2024 Python
DevExpert0101 / SpeechDoctor Star 3 Code Issues Pull requests Analyze an audio file and count words, sentences and timestamps, filler words openai speech-to-text spectral-analysis voice-activity-detection google-colab vosk audio-speech-recognition Updated Jun 23, 2023 Jupyter Notebook
merakleee / Audio-Speech-Emotion-Recognition Star 1 Code Issues Pull requests A machine learning solution for classifying emotions in speech audio using hybrid deep learning (CNN-LSTM) and gradient boosting (XGBoost). ml kaggle-competition xgboost cnn-lstm cnn-lstm-models classifying-emotions audio-speech-recognition hybrid-architecture Updated Nov 24, 2025 Jupyter Notebook