Private-ASR

This project is modded from FunClip project, built with ASR (Automatic Speech Recognition), speaker identification, SRT editing, and LLM-based summarization capabilities. It integrates Gradio as the user interface, providing an interactive and easy-to-use platform.

简体中文 / English

本项目基于开源项目 FunClip 进行修改，集成了自动语音识别 (ASR)、说话人分离、SRT 字幕编辑以及基于 LLM 的总结功能。项目使用 Gradio 提供了一个直观易用的用户界面。

Update: Added support for GPU inference for both Docker/local deployment. Docker-GPU deployment Check This

📜 Credits

This project builds upon the open-source FunClip by Alibaba DAMO Academy. I modded the functionality to include:

ASR Summarization using LLMs (OpenAI GPT, custom API).
Dynamic SRT Replacement with speaker mapping.
Deployment Ready using Docker for production environments.

🎯 Features

Automatic Speech Recognition (ASR):
- Supports video and audio inputs.
- Outputs text and SRT subtitles.
Speaker Identification (SD):
- Identifies and differentiates speakers in multi-speaker audio/video.
SRT Subtitle Editing:
- Replace speaker identifiers with user-defined names.
LLM Summarization:
- Summarize ASR results using GPT-based models.
- Allows custom API configurations.
Deployment Options:
- Lightweight Docker container for production.
- Python environment for development/testing.

🛠 Requirements

System(2 Ways to Deploy)

Docker (for containerized deployment)
Python 3.9+ (for manual deployment)

Dependencies

See the requirements.txt file

🚀 Deployment

1. Docker Deployment

Build the Docker Image

Run the following command to build the Docker image:

docker build -t audio-processor:latest .

Deploy with Docker Compose

Use the following docker-compose.yml file for deployment:

version: '3.8' services: audio-processor: image: audio-processor:latest # The image you built container_name: audio-processor ports: - "7860:7860" volumes: - ./.env:/app/.env # Map the .env file working_dir: /app restart: unless-stopped

Run the deployment:

docker-compose up -d

The Gradio interface will be available at:
http://localhost:7860

2. Python Deployment

Setup Environment

Clone the repository:

git clone https://github.com/MotorBottle/Audio-Processor.git cd audio-processor

Install dependencies:

python3 -m venv .venv source .venv/bin/activate pip install --no-cache-dir -r requirements.txt

Ensure FFmpeg is installed(for Mac use brew):

sudo apt-get update sudo apt-get install -y ffmpeg

Run the Application

Use the following command:

python funclip/launch.py --listen

The Gradio interface will be available at:
http://localhost:7860

Default user name: motor

Default passwd: admin

⚙️ Environment Configuration

All credentials and API configurations can be stored in a .env file.

Example .env file:

USERNAME=motor PASSWORD=admin OPENAI_API_KEY=your_openai_key OPENAI_API_BASE=https://your-custom-api.com

🎥 Usage

Upload audio or video files.
Perform ASR Recognition or Speaker Differentiation.
Edit speaker names in the generated SRT subtitles.
Use the LLM Summarization feature to analyze and summarize the ASR text.

🔗 Contributions & License

This project is released under the MIT License. Contributions are welcome!

For the original FunClip repository, visit:
FunClip on GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 236 Commits
docs/images		docs/images
font		font
funclip		funclip
.env.example		.env.example
.gitignore		.gitignore
Docker_GPU.md		Docker_GPU.md
Dockerfile		Dockerfile
ExtractCode.py		ExtractCode.py
IMG_3044.JPG		IMG_3044.JPG
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
docker-compose.yml		docker-compose.yml
image.png		image.png
requirements.txt		requirements.txt
requirements_cuda_docker.txt		requirements_cuda_docker.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Private-ASR

📜 Credits

🎯 Features

🛠 Requirements

System(2 Ways to Deploy)

Dependencies

🚀 Deployment

1. Docker Deployment

Build the Docker Image

Deploy with Docker Compose

2. Python Deployment

Setup Environment

Run the Application

⚙️ Environment Configuration

🎥 Usage

🔗 Contributions & License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Private-ASR

📜 Credits

🎯 Features

🛠 Requirements

System(2 Ways to Deploy)

Dependencies

🚀 Deployment

1. Docker Deployment

Build the Docker Image

Deploy with Docker Compose

2. Python Deployment

Setup Environment

Run the Application

⚙️ Environment Configuration

🎥 Usage

🔗 Contributions & License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages