Stars
Claude Code skill: Turn any lecture into a structured, searchable knowledge base with live Whisper transcription
Benchmarking STT service TTFB and semantic WER for real-time AI applications
Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK
To bring back voice to those who lost it
A multi-LLM Python pipeline that fetches your X (Twitter) bookmarks, classifies them by category, extracts structured data from chart images, plans trading strategies or indicators for finance bookβ¦
Specification andΒ documentation for the Model Context Protocol
A self hosted Web publishing platform on Rails.
A terminal app for macOS with a side-panel UI, workspaces, and native notifications β built for engineers juggling parallel sessions.
GenMedia Creative Studio is a Vertex AI generative media user experience highlighting the use of Imagen, Veo, Gemini π, Gemini TTS, Chirp 3, Lyria and other generative media APIs on Google Cloud.
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
OSSSpeechKit offers a native iOS Speech wrapper for AVFoundation and Apple's Speech.
Native Swift MLX/Metal implementation of Moonshine V2 speech-to-text for Apple Silicon
Autonomous experiment loop skill for Claude Code β port of pi-autoresearch
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
AI speech toolkit for Apple Silicon β ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML
Smart IP camera stream finder. Tests 102K+ URL patterns in 30 seconds. Supports 67K camera models. Generates ready Frigate/go2rtc configs.
Demo code for the AWS/Deepgram Workshop
The markdown editor that's just a textarea https://overtype.dev
SwiftUI agent skill for Claude Code, Codex, and other AI tools.



