Imiloin (Imiloin) / Starred · GitHub

Stars

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 10,060 1,271 Updated Mar 17, 2026

Tencent-Hunyuan / HY-Motion-1.0

HY-Motion model for 3D human motion or 3D character animation generation.

Python 2,222 178 Updated Jan 29, 2026

badges / shields

Concise, consistent, and legible badges in SVG and raster format

JavaScript 26,330 5,588 Updated Mar 27, 2026

microsoft / TRELLIS.2

Native and Compact Structured Latents for 3D Generation

Python 4,588 519 Updated Jan 10, 2026

FunAudioLLM / Fun-ASR

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

Python 975 86 Updated Feb 25, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 25,316 2,759 Updated Mar 28, 2026

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,083 122 Updated Dec 3, 2025

Tongyi-MAI / Z-Image

Python 10,766 720 Updated Feb 9, 2026

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 4,625 802 Updated Nov 22, 2022

stepfun-ai / Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 888 62 Updated Mar 16, 2026

1357310795 / TboxWebdav

C# 70 5 Updated Jul 4, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,764 2,092 Updated Jan 27, 2026

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,239 10,044 Updated Mar 26, 2026

nunchaku-ai / ComfyUI-nunchaku

ComfyUI Plugin of Nunchaku

Python 2,821 153 Updated Feb 19, 2026

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,444 4,755 Updated Mar 28, 2026

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 8,131 726 Updated Mar 24, 2026

github / spec-kit

💫 Toolkit to help you get started with Spec-Driven Development

Python 83,149 7,117 Updated Mar 27, 2026

bytedance / USO

[CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning

Python 1,216 76 Updated Sep 12, 2025

RikkaApps / Shizuku

Using system APIs directly with adb/root privileges from normal apps through a Java process started with app_process.

Kotlin 23,398 2,184 Updated Jun 18, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,948 789 Updated Mar 11, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,952 2,064 Updated Mar 27, 2026

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,639 465 Updated Feb 10, 2026

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,996 615 Updated Jan 18, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,415 13,648 Updated Mar 26, 2026

microsoft / vscode-copilot-chat

Copilot Chat extension for VS Code

TypeScript 9,711 1,779 Updated Mar 28, 2026

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 99,348 12,694 Updated Mar 28, 2026

tencent-ailab / SongGeneration

The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment

Python 1,526 184 Updated Mar 12, 2026

anthropics / courses

Anthropic's educational courses

Jupyter Notebook 19,984 1,988 Updated Nov 13, 2025

modelscope / modelscope-classroom

Jupyter Notebook 1,345 163 Updated Mar 24, 2026

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 19,649 2,419 Updated Mar 16, 2026