Skip to content
View MINT-SJTU's full-sized avatar

Block or report MINT-SJTU

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RoboClaw is an Embodied AI Assistant.

Python 218 15 Updated Mar 24, 2026

We release Evo-RL, the opensource real-world offline RL on So-101 and AgileX PiPER for easier reproduction.

Python 325 28 Updated Mar 19, 2026

ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation

7 Updated Feb 8, 2026

The official implementation of VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference.

Python 28 2 Updated Feb 11, 2026

This website is for the collection of VLA SOTA results.

TypeScript 136 3 Updated Mar 17, 2026

Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment

Python 234 25 Updated Mar 8, 2026

U-Arm: Lerobot-Everything-Cross-Embodiment-Teleoperation

Python 212 20 Updated Mar 17, 2026

Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.

54 2 Updated Nov 24, 2025

The open-source CapCut alternative

TypeScript 47,289 4,914 Updated Mar 24, 2026

🔥🔥First-ever hour scale video understanding models

Python 617 41 Updated Jul 14, 2025

RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction

Python 28 1 Updated Dec 10, 2025

STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

Python 38 1 Updated Jan 12, 2026

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 242 5 Updated Aug 21, 2025