Skip to content
View kholam's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report kholam

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of papers in 100 lines of code.

Python 2,643 243 Updated Jan 22, 2026

Contexts Optical Compression

Python 22,732 2,090 Updated Jan 27, 2026

Detail code implementation and experimental setting for our paper: Federated Learning on Multilabel Evolving Data Streams

Python 1 Updated Oct 22, 2025

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 18,845 1,433 Updated Sep 21, 2025

AIs for nature

Rust 151 7 Updated Mar 20, 2026

PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.

Python 995 291 Updated Mar 17, 2026

Self-hosted AI coding assistant

Rust 33,034 1,691 Updated Mar 2, 2026

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

14,012 785 Updated Jul 30, 2025

An open-source framework for machine learning and other computations on decentralized data.

Python 2,430 606 Updated Mar 20, 2026

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,901 307 Updated Jan 16, 2024

DeepSeek LLM: Let there be answers

Makefile 6,778 1,061 Updated Feb 4, 2024

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,526 1,005 Updated Feb 6, 2026

Analyze computation-communication overlap in V3/R1.

1,150 145 Updated Mar 21, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,940 443 Updated Mar 5, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,272 840 Updated Feb 27, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,934 318 Updated Jan 14, 2026

An elegant PyTorch deep reinforcement learning library.

Python 10,394 1,285 Updated Dec 1, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,773 1,013 Updated Mar 9, 2026

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,248 1,815 Updated Feb 26, 2025

Integrate the DeepSeek API into popular software

35,989 3,998 Updated Feb 23, 2026

s1: Simple test-time scaling

Python 6,646 766 Updated Jun 25, 2025

This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report

Jupyter Notebook 56 111 Updated Dec 2, 2025

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,420 85 Updated Apr 21, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 5,125 455 Updated Dec 13, 2025
1 Updated Feb 14, 2025

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Python 738 58 Updated Jun 2, 2025

Machine Learning Engineering Open Book

Python 17,474 1,108 Updated Mar 16, 2026
Next