LetheSec (Xiaojian Yuan) / Starred

Lists (2)

Sort

Basic

15 repositories

Paper Code

5 repositories

Stars

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 339,210 66,767 Updated Mar 28, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 704 72 Updated Feb 18, 2026

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,562 1,116 Updated Mar 16, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 83,818 7,084 Updated Mar 27, 2026

overleaf-workshop / Overleaf-Workshop

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,434 54 Updated Mar 21, 2026

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,932 437 Updated Mar 28, 2026

TinyLoopX / RLLaVA

RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.

Python 58 6 Updated Mar 18, 2026

RUC-NLPIR / SmartSearch

Python 36 2 Updated Jan 19, 2026

ustctug / ustcthesis

LaTeX template for USTC thesis

TeX 2,036 445 Updated Mar 26, 2026

stepfun-ai / StepDeepResearch

Step-DeepResearch

Python 536 21 Updated Mar 24, 2026

muratcankoylan / Agent-Skills-for-Context-Engineering

A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…

Python 14,418 1,124 Updated Mar 22, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,478 1,800 Updated Nov 3, 2025

ChenmienTan / RL2

Python 1,246 128 Updated Feb 28, 2026

stepfun-ai / PaCoRe

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 335 14 Updated Feb 5, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,787 376 Updated Mar 19, 2026

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 383 40 Updated Mar 25, 2026

lucidrains / tiny-recursive-model

Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau

Python 173 29 Updated Dec 23, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,554 1,430 Updated Feb 27, 2026

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,660 74 Updated Jan 20, 2026

xbench-ai / xbench-evals

Evergreen, contamination-free, real-world, domain-specific AI evaluation framework

Python 130 7 Updated Jan 11, 2026

FoundationAgents / awesome-foundation-agents

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

2,004 195 Updated Jul 28, 2025

meituan-longcat / LongCat-Flash-Chat

1,322 67 Updated Mar 22, 2026

rlite-project / RLite

A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithms with minimal intrusion.

Python 102 3 Updated Aug 25, 2025

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,542 861 Updated Dec 22, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 300 22 Updated Nov 7, 2025

thinkwee / AgentsMeetRL

Awesome List for Agentic RL

HTML 896 39 Updated Mar 24, 2026

langchain-ai / open_deep_research

Python 10,969 1,574 Updated Mar 27, 2026

Danau5tin / terminal-bench-rl

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 361 23 Updated Aug 24, 2025

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,731 748 Updated Nov 19, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,558 247 Updated Dec 21, 2025

Xiaojian Yuan LetheSec

Lists (2)

Basic

Paper Code

Stars