basujindal

Basu Jindal basujindal

The difference you'll see When you drop your penny: The river has splashes, The sky hasn't any.

57 followers · 0 following

Apple
Seattle
14:19 (UTC -07:00)
basujindal.me

Achievements

Stars

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 49,955 6,542 Updated Mar 17, 2026

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 22,895 2,541 Updated Mar 22, 2026

lharries / whatsapp-mcp

WhatsApp MCP server

Go 5,441 941 Updated Jul 13, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,478 1,743 Updated Mar 18, 2026

pranjalssh / fast.cu

Fastest kernels written from scratch

Cuda 561 69 Updated Sep 18, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,273 840 Updated Mar 22, 2026

MekkCyber / CutlassAcademy

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

255 13 Updated May 6, 2025

mit-han-lab / distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 726 34 Updated Dec 2, 2024

augustwester / searchthearxiv

The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.

Python 171 15 Updated Apr 21, 2025

kitsunyan / intel-undervolt

Intel CPU undervolting and throttling configuration tool

C 1,057 71 Updated Aug 24, 2023

mihic / linux-intel-undervolt

Guide to linux undervolting for Haswell and never Intel CPUs

394 13 Updated Apr 4, 2018

kuleshov-group / mdlm

[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model

Python 660 92 Updated Sep 29, 2025

wilsonzlin / hackerverse

Exploring Hacker News by mapping and analyzing 40 million posts and comments for fun

TypeScript 211 9 Updated May 14, 2025

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,872 70 Updated Jun 22, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 29,234 3,441 Updated Jun 26, 2025

Guangxuan-Xiao / torch-int

This repository contains integer operators on GPUs for PyTorch.

Python 237 56 Updated Sep 29, 2023

Lightning-AI / lightning-thunder

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,449 109 Updated Mar 17, 2026

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,041 86 Updated Sep 4, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,734 2,917 Updated Apr 30, 2025

xai-org / grok-1

Grok open release

Python 51,529 8,475 Updated Aug 30, 2024

tspeterkim / flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,098 110 Updated Dec 30, 2024

basujindal / stable-diffusion

Forked from CompVis/stable-diffusion

Optimized Stable Diffusion modified to run on lower GPU VRAM

Jupyter Notebook 3,099 455 Updated Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basu Jindal basujindal

Achievements

Achievements

Block or report basujindal

Stars

karpathy / nanochat

Dao-AILab / flash-attention

lharries / whatsapp-mcp

NVIDIA / cutlass

pranjalssh / fast.cu

deepseek-ai / DeepGEMM

MekkCyber / CutlassAcademy

mit-han-lab / distrifuser

augustwester / searchthearxiv

kitsunyan / intel-undervolt

mihic / linux-intel-undervolt

kuleshov-group / mdlm

wilsonzlin / hackerverse

google-deepmind / penzai

karpathy / llm.c

Guangxuan-Xiao / torch-int

Lightning-AI / lightning-thunder

IST-DASLab / marlin

hpcaitech / Open-Sora

xai-org / grok-1

tspeterkim / flash-attention-minimal

basujindal / stable-diffusion

enricoros / big-AGI

EGjoni / DRUGS

Stirling-Tools / Stirling-PDF

chriskiehl / Gooey

ZenPrivacy / zen-desktop

mozilla-ai / llamafile

ekzhang / sshx

Genymobile / scrcpy