Skip to content
View mnjm's full-sized avatar
🗿
🗿

Block or report mnjm

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Experiments for understanding disentanglement in VAE latent representations

Python 841 148 Updated Feb 2, 2023

[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.

Python 5,943 725 Updated Mar 22, 2026

Fast and accurate automatic speech recognition (ASR) for edge devices

C 7,455 378 Updated Mar 19, 2026

Pure C inference of Mistral Voxtral Realtime 4B speech to text model

C 1,539 99 Updated Feb 15, 2026

Object Detection & Tracking with Object Velocity - Hobby Project

Python 1 Updated Jan 6, 2026

Low-latency AI engine for mobile devices & wearables

C 4,505 335 Updated Mar 22, 2026

End-to-End Object Detection with Transformers

Python 15,172 2,662 Updated Mar 12, 2024

TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data

Python 3,931 525 Updated Mar 21, 2026

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,913 185 Updated Feb 24, 2024

Profile PyTorch models for FLOPs and parameters, helping to evaluate computational efficiency and memory usage.

Python 130 8 Updated Mar 11, 2026

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,779 2,216 Updated Jul 24, 2024

Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification

Python 10 2 Updated May 5, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,503 1,119 Updated Mar 20, 2026

tiny vision language model

Python 9,443 742 Updated Nov 14, 2025

LLM Council works together to answer your hardest questions

Python 16,016 3,205 Updated Nov 22, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 81,722 2,824 Updated Mar 22, 2026

Puzzles for learning Triton

Jupyter Notebook 2,343 207 Updated Mar 18, 2026

Open Hardware Monitor

C# 6,365 1,311 Updated Jul 13, 2024

The best ChatGPT that $100 can buy.

Python 49,901 6,536 Updated Mar 17, 2026

🐍 Geometric Computer Vision Library for Spatial AI

Python 11,125 1,168 Updated Mar 22, 2026

Find why PyTorch training is slow while it’s still running

Python 124 10 Updated Mar 21, 2026

On-device Speech Recognition for Android

Kotlin 205 28 Updated Jan 24, 2026

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 3,005 218 Updated Jan 18, 2026

fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…

Python 1,836 87 Updated Feb 18, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,535 795 Updated Jan 21, 2026

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 77,640 7,497 Updated Mar 22, 2026

Open Source Computer Vision Library

C++ 86,728 56,564 Updated Mar 20, 2026

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,524 158 Updated Dec 17, 2025

It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…

4,821 322 Updated Aug 22, 2025

Sync entire obsidian library to notion database with images included. Minimal user intervention after initial setup, can be run multiple times without duplicating values to provide a backup like ex…

JavaScript 22 5 Updated Jul 21, 2025
Next