Stars
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Translate the video from one language to another and embed dubbing & subtitles.
UniGetUI: The Graphical Interface for your package managers. Could be terribly described as a package manager manager to manage your package managers
Official implementations for paper: Anydoor: zero-shot object-level image customization
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
Python bindings for FFmpeg - with complex filtering support
Stable Diffusion web UI
Removes backgrounds from pictures. Extension for webui.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Rembg is a tool to remove images background
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Labeling extension for Automatic1111's Web UI
Simplified Chinese translation extension for AUTOMATIC1111's stable diffusion webui
SD-WebUI 简体中文翻译扩展 Simplified Chinese Localization for AUTOMATIC1111 Stable Diffusion WebUI