#

audio-speech-recognition

Here are 4 public repositories matching this topic...

HydroRoll-Team / HydroRoll

跨平台、多任务、高度自定义的骰系开发框架。

nlp dice text-to-speech framework ai cross-platform model artificial-intelligence tts webui dice-roller roll ner asr re dice-roller-library nature-language-processing hydroroll audio-speech-recognition

Updated Nov 21, 2025
Python

hari-huynh / viVQA-voice-assistant

Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned & PhoWhisper

text-to-speech lora visual-question-answering llava multimodal-large-language-models audio-speech-recognition mistral-7b

Updated May 15, 2024
Python

DevExpert0101 / SpeechDoctor

Analyze an audio file and count words, sentences and timestamps, filler words

openai speech-to-text spectral-analysis voice-activity-detection google-colab vosk audio-speech-recognition

Updated Jun 23, 2023
Jupyter Notebook

merakleee / Audio-Speech-Emotion-Recognition

A machine learning solution for classifying emotions in speech audio using hybrid deep learning (CNN-LSTM) and gradient boosting (XGBoost).

ml kaggle-competition xgboost cnn-lstm cnn-lstm-models classifying-emotions audio-speech-recognition hybrid-architecture

Updated Nov 24, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the audio-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-speech-recognition topic, visit your repo's landing page and select "manage topics."