NormXU

🎯

Pixels Do Think Like Text.

Norm Inui NormXU

🎯

Pixels Do Think Like Text.

AI Engineer @meituan | Johns Hopkins University

89 followers · 155 following

https://normxu.github.io/

Achievements

Lists (8)

Sort

Starred repositories

svg-project / flash-kmeans

Fast and memory-efficient exact kmeans

Python 436 22 Updated Mar 17, 2026

gsd-build / get-shit-done

A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.

JavaScript 37,357 3,029 Updated Mar 21, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 3,866 377 Updated Mar 21, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 46,484 6,449 Updated Mar 21, 2026

affaan-m / everything-claude-code

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 91,393 11,998 Updated Mar 21, 2026

nextlevelbuilder / ui-ux-pro-max-skill

An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms

Python 46,936 4,550 Updated Mar 10, 2026

ReScienceLab / LightGUIAgent

Lightweight GUI Automation Agent with Grid-Based Visual Grounding

Python 8 1 Updated Feb 1, 2026

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 419 25 Updated Feb 17, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 327,513 63,426 Updated Mar 21, 2026

remyxai / VQASynth

Compose multimodal datasets 🎹

Python 551 25 Updated Jan 5, 2026

code-yeongyu / oh-my-openagent

omo; the best agent harness - previously oh-my-opencode

TypeScript 41,992 3,129 Updated Mar 21, 2026

miantiao-me / aigc-weekly

Agili 的 AIGC 周刊 - 一个由 Agentic AI Agent 驱动的 AIGC（人工智能生成内容）精选周刊。

TypeScript 514 61 Updated Mar 8, 2026

KlingAIResearch / MemFlow

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 188 7 Updated Dec 29, 2025

facebookresearch / llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 1,249 108 Updated Dec 3, 2024

yellow-binary-tree / MMDuet2

[ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning

Python 20 2 Updated Jan 14, 2026

loro-dev / loro

Make your JSON data collaborative and version-controlled with CRDTs

Rust 5,446 135 Updated Mar 21, 2026

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,203 151 Updated Dec 8, 2025

TabViewer / tabview

Python curses command line CSV and tabular data viewer

Python 473 49 Updated Dec 22, 2022

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 479 31 Updated Mar 3, 2026

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,729 2,090 Updated Jan 27, 2026

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,813 71 Updated Feb 25, 2026

juntaosun / ComeCut

「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用，功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor. Free for the web, desktop, and more, with features inspired by editors like CapCut.

Batchfile 484 57 Updated Oct 25, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 49,751 6,524 Updated Mar 17, 2026

SamsungSAILMontreal / TinyRecursiveModels

Python 6,419 1,003 Updated Dec 2, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,542 236 Updated Jan 8, 2026

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 8,093 727 Updated Mar 19, 2026

PicoTrex / Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

21,580 2,205 Updated Dec 12, 2025

meituan-longcat / LongCat-Flash-Chat

1,319 66 Updated Mar 3, 2026

LaurentMazare / tboard-rs

Read and write tensorboard data using Rust

Rust 24 1 Updated Feb 4, 2024

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 23,916 2,639 Updated Mar 6, 2026

Norm Inui NormXU

Lists (8)

awesome libraries

chat-gpt usecases

Image / Video Gen

(M)LLM

open dataset

Prompt Enginnering

reinforcement learning

transformers in Everything

Starred repositories

jekyll-plugin

Google