open-r1

Here are 5 public repositories matching this topic...

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...) (AAAI 2025).

moe llama lora embedding liger peft multimodal reranker sft megatron llm internvl deepseek-r1 grpo open-r1 qwen3 llama4 qwen3-vl qwen3-next qwen3-omni

Updated Mar 25, 2026
Python

IAAR-Shanghai / xVerify

Star

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

benchmark regex reliability evaluation llm reliability-tools chatgpt cc-by-nc-nd-4 open-compass llm-as-a-judge deepseek-math judge-model reasoning-models open-r1 xverify math-verify

Updated Nov 13, 2025
Jupyter Notebook

Emo-gml / PsyLLM

Star

Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling

emotion dataset psychology mental-health open-r1

Updated Jan 24, 2026
Python

Exgc / R1V-Free

Star

R1V, trained with AI feedback, answers open-ended visual questions.

vlm open-r1 r1v vision-r1 video-r1

Updated Apr 12, 2025
Python

HappyXY / deepscaler

Star

Democratizing Reinforcement Learning for LLMs

open-r1

Updated Feb 16, 2025
Python

Improve this page

Add a description, image, and links to the open-r1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the open-r1 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

open-r1

Here are 5 public repositories matching this topic...

modelscope / ms-swift

IAAR-Shanghai / xVerify

Emo-gml / PsyLLM

Exgc / R1V-Free

HappyXY / deepscaler

Improve this page

Add this topic to your repo