Skip to content
View LetheSec's full-sized avatar
  • University of Science and Technology of China
  • Hefei, Anhui, P.R.China
  • 23:12 (UTC +08:00)

Block or report LetheSec

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 339,210 66,767 Updated Mar 28, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 704 72 Updated Feb 18, 2026

Machine Learning Engineering Open Book

Python 17,562 1,116 Updated Mar 16, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 83,818 7,084 Updated Mar 27, 2026

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,434 54 Updated Mar 21, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,932 437 Updated Mar 28, 2026

RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.

Python 58 6 Updated Mar 18, 2026
Python 36 2 Updated Jan 19, 2026

LaTeX template for USTC thesis

TeX 2,036 445 Updated Mar 26, 2026

Step-DeepResearch

Python 536 21 Updated Mar 24, 2026

A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…

Python 14,418 1,124 Updated Mar 22, 2026

Nano vLLM

Python 12,478 1,800 Updated Nov 3, 2025
Python 1,246 128 Updated Feb 28, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 335 14 Updated Feb 5, 2026

My learning notes for ML SYS.

Python 5,787 376 Updated Mar 19, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 383 40 Updated Mar 25, 2026

Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau

Python 173 29 Updated Dec 23, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,554 1,430 Updated Feb 27, 2026

Evergreen, contamination-free, real-world, domain-specific AI evaluation framework

Python 130 7 Updated Jan 11, 2026

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

2,004 195 Updated Jul 28, 2025

A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithms with minimal intrusion.

Python 102 3 Updated Aug 25, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,542 861 Updated Dec 22, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 300 22 Updated Nov 7, 2025

Awesome List for Agentic RL

HTML 896 39 Updated Mar 24, 2026

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 361 23 Updated Aug 24, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,731 748 Updated Nov 19, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,558 247 Updated Dec 21, 2025
Next