Skip to content
View rymalia's full-sized avatar

Block or report rymalia

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Claude Code skill: Turn any lecture into a structured, searchable knowledge base with live Whisper transcription

Python 20 2 Updated Mar 21, 2026

Benchmarking STT service TTFB and semantic WER for real-time AI applications

Python 49 9 Updated Mar 20, 2026

Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK

Jupyter Notebook 1,517 235 Updated Mar 20, 2026

To bring back voice to those who lost it

TypeScript 79 7 Updated Mar 3, 2026

A terminal dashboard for Pipecat

Python 42 5 Updated Mar 16, 2026

A meeting note-taker that talks back.

Swift 1,538 143 Updated Mar 22, 2026

A multi-LLM Python pipeline that fetches your X (Twitter) bookmarks, classifies them by category, extracts structured data from chart images, plans trading strategies or indicators for finance book…

Rust 88 7 Updated Mar 20, 2026

Specification andΒ documentation for the Model Context Protocol

TypeScript 7,585 1,397 Updated Mar 22, 2026

A self hosted Web publishing platform on Rails.

Less 1,853 3,645 Updated Mar 16, 2026

A terminal app for macOS with a side-panel UI, workspaces, and native notifications β€” built for engineers juggling parallel sessions.

Swift 1 Updated Mar 17, 2026

GenMedia Creative Studio is a Vertex AI generative media user experience highlighting the use of Imagen, Veo, Gemini 🍌, Gemini TTS, Chirp 3, Lyria and other generative media APIs on Google Cloud.

Jupyter Notebook 988 318 Updated Mar 20, 2026

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 8,264 561 Updated Mar 20, 2026

OSSSpeechKit offers a native iOS Speech wrapper for AVFoundation and Apple's Speech.

Swift 181 42 Updated Apr 22, 2024

Native Swift MLX/Metal implementation of Moonshine V2 speech-to-text for Apple Silicon

Swift 2 Updated Mar 17, 2026

Spark-TTS Inference Code

Python 10,957 1,170 Updated Apr 9, 2025

Cross-platform voice agent pipeline engine (C++)

C++ 12 Updated Mar 16, 2026
JavaScript 11,499 1,013 Updated Mar 21, 2026

Autonomous experiment loop skill for Claude Code β€” port of pi-autoresearch

Shell 189 18 Updated Mar 22, 2026

A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

Python 3,079 361 Updated Mar 12, 2026

https://hf.co/hexgrad/Kokoro-82M

JavaScript 6,070 686 Updated Aug 6, 2025

AI speech toolkit for Apple Silicon β€” ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

Swift 443 50 Updated Mar 22, 2026

Smart IP camera stream finder. Tests 102K+ URL patterns in 30 seconds. Supports 67K camera models. Generates ready Frigate/go2rtc configs.

Go 534 18 Updated Mar 22, 2026

SOTA Open Source TTS

Python 28,649 2,401 Updated Mar 21, 2026

TTS with kokoro and onnx runtime

Python 2,425 253 Updated Jan 30, 2026

Demo code for the AWS/Deepgram Workshop

Python 14 3 Updated Sep 30, 2025

The markdown editor that's just a textarea https://overtype.dev

JavaScript 3,610 90 Updated Mar 14, 2026

SwiftUI agent skill for Claude Code, Codex, and other AI tools.

3,000 97 Updated Mar 11, 2026
Next