Open-source vision stack with stereo camera hardware, GPU processing, and AI agent for training video classifiers.
- Updated
Oct 19, 2025 - Python
Open-source vision stack with stereo camera hardware, GPU processing, and AI agent for training video classifiers.
Masked Multi-Component Gated Decomposition Architecture
🎥 Discover similar motion dynamics in videos with MotionMatch, a physics-based search engine leveraging Meta's V-JEPA 2 for efficient retrieval.
vjepa / vjepa2 / vjepa2.1 PCA visualization utility for dense features and world model inspection.
A physics-based video search engine using Meta's V-JEPA 2 world model to find videos with similar motion dynamics.
Can the V-JEPA2 model be used as a world model?
Locally-Hosted Media Gallery App with AI Similarity Search
Patch-level predictive surprise from video foundation model embeddings. The embedding delta is the attention signal.
🎥 Enhance video–text alignment using V-DeClip's advanced MCGD architecture for precise, semantically decomposed video embeddings.
Assess Data Quality Before Annotation or Labelled Data Quality after Annotation (Txt files/Yolo Format). Visualise the patterns covered by each class/activity.
Add a description, image, and links to the vjepa topic page so that developers can more easily learn about it.
To associate your repository with the vjepa topic, visit your repo's landing page and select "manage topics."