Skip to content
View NormXU's full-sized avatar
🎯
Pixels Do Think Like Text.
🎯
Pixels Do Think Like Text.

Block or report NormXU

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Fast and memory-efficient exact kmeans

Python 436 22 Updated Mar 17, 2026

A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.

JavaScript 37,357 3,029 Updated Mar 21, 2026

OpenClaw-RL: Train any agent simply by talking

Python 3,866 377 Updated Mar 21, 2026

AI agents running research on single-GPU nanochat training automatically

Python 46,484 6,449 Updated Mar 21, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 91,393 11,998 Updated Mar 21, 2026

An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms

Python 46,936 4,550 Updated Mar 10, 2026

Lightweight GUI Automation Agent with Grid-Based Visual Grounding

Python 8 1 Updated Feb 1, 2026

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 419 25 Updated Feb 17, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 327,513 63,426 Updated Mar 21, 2026

Compose multimodal datasets 🎹

Python 551 25 Updated Jan 5, 2026

omo; the best agent harness - previously oh-my-opencode

TypeScript 41,992 3,129 Updated Mar 21, 2026

Agili 的 AIGC 周刊 - 一个由 Agentic AI Agent 驱动的 AIGC(人工智能生成内容)精选周刊。

TypeScript 514 61 Updated Mar 8, 2026

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 188 7 Updated Dec 29, 2025

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 1,249 108 Updated Dec 3, 2024

[ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning

Python 20 2 Updated Jan 14, 2026

Make your JSON data collaborative and version-controlled with CRDTs

Rust 5,446 135 Updated Mar 21, 2026

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,203 151 Updated Dec 8, 2025

Python curses command line CSV and tabular data viewer

Python 473 49 Updated Dec 22, 2022

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 479 31 Updated Mar 3, 2026

Contexts Optical Compression

Python 22,729 2,090 Updated Jan 27, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,813 71 Updated Feb 25, 2026

「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用,功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor. Free for the web, desktop, and more, with features inspired by editors like CapCut.

Batchfile 484 57 Updated Oct 25, 2025

The best ChatGPT that $100 can buy.

Python 49,751 6,524 Updated Mar 17, 2026

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,542 236 Updated Jan 8, 2026

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 8,093 727 Updated Mar 19, 2026

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

21,580 2,205 Updated Dec 12, 2025

Read and write tensorboard data using Rust

Rust 24 1 Updated Feb 4, 2024

Open-Source Frontier Voice AI

Python 23,916 2,639 Updated Mar 6, 2026
Next