Skip to content
View Pomilon's full-sized avatar

Block or report Pomilon

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Utilities

Useful tools I frequently use.
14 repositories

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 165,794 15,096 Updated Mar 20, 2026

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, dif…

Go 44,167 3,768 Updated Mar 21, 2026

LLM inference in C/C++

C++ 98,865 15,691 Updated Mar 21, 2026

Python bindings for llama.cpp

Python 10,072 1,334 Updated Aug 15, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 98,591 12,494 Updated Mar 21, 2026

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 180,364 56,014 Updated Mar 21, 2026

A responsive, self-hosted manga scraper and reader built with Python and Node.js, featuring automated updates and a modern UI.

JavaScript 1 Updated Nov 27, 2025

[PENDING REWRITE] Pome's official package manager.

C++ 1 Updated Dec 12, 2025

Pome is a powerful scripting language that pairs a lightweight, Lua-style syntax with advanced features like classes and modules.

C++ 1 Updated Feb 15, 2026

Plexir is a modular, keyboard-centric AI terminal workspace that combines multi-provider LLM orchestration, persistent Docker sandboxing, and advanced agentic tools into a professional TUI. Designe…

Python 2 Updated Feb 3, 2026

An AI technical biographer that transforms your GitHub footprint into a professional Persona Audit PDF. Features agentic repository deconstruction and a savage roast mode.

Python 3 Updated Jan 21, 2026

High-performance daemon for real-time codebase indexing. Generates semantic embeddings locally to provide AI agents with tools for instant search interface.

C++ 1 Updated Jan 21, 2026

LEMA (Layer-wise Efficient Memory Abstraction): A hardware-aware framework for fine-tuning LLMs in VRAM-constrained environments using asynchronous binary pre-fetching and triple-tier memory orches…

Python 1 Updated Feb 17, 2026

A semantic meta-search engine and synthesis workstation featuring self-correcting vector memory and local AI insights via Ollama.

HTML 1 Updated Mar 19, 2026