multi-step-reasoning

TreeThinkerAgent is a lightweight orchestration layer that turns any LLM into an autonomous multi-step reasoning agent. It supports multi-step planning, tool execution, and final synthesis while exposing the entire reasoning process as a tree you can explore.

lightweight openai automated-research multi-step-reasoning llm mistralai research-agent agentic-ai deep-research

Updated Feb 11, 2026
JavaScript

IBM / OpenDsStar

Star

OpenDsStar is an open-source implementation of the DS-Star agent that replaces file-based workflows with a flexible, tool-centric architecture. It supports incremental execution, reuses intermediate results, and makes complex multi-step agents more modular, efficient, and extensible.

benchmarking data-science agents reasoning agent-framework multi-step-reasoning tool-calling ds-star incremental-execution reusable-execution

Updated Mar 26, 2026
Python

Strong-AI-Lab / A-Neural-Symbolic-Paradigm

Star

From Symbolic Logic Reasoning to Soft Reasoning: A Neural-Symbolic Paradigm

natural-language-processing deep-learning transformer deductive-reasoning soft-reasoning symbolic-logic-reasoning neural-symbolic-paradigm multi-step-reasoning gate-attention

Updated Jul 18, 2022
Python

HarshTrivedi / DecomP-ODQA

Star

Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23

question-answering multi-step-reasoning large-language-models chain-of-thought retrieval-augmented-qa

Updated Jul 28, 2023
Jsonnet

Strong-AI-Lab / Multi-Step-Deductive-Reasoning-Over-Natural-Language

Star

Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

deductive-reasoning multi-step-reasoning gate-attention out-of-distribution-generalisation

Updated Sep 22, 2023
Python

wzy6642 / PRP

Star

Official implementation for "Get an A in Math: Progressive Rectification Prompting" (AAAI 2024)

verification rectification iterative multi-step-reasoning gpt-35-turbo math-word-problem-solving zero-shot-prompting

Updated Mar 18, 2024
Python

Strong-AI-Lab / PARARULE-Plus

Star

PARARULE Plus: A Larger Deep Multi-Step Reasoning Dataset over Natural Language

natural-language-generation reasoning natural-language-understanding symbolic-logic soft-reasoning multi-step-reasoning

Updated Sep 22, 2023
Python

LakshitaS / Agentic-RAG-implementation

Star

Implementation of "Building Agentic RAG with LlamaIndex" offered by DeepLearning.AI focusing on developing intelligent research agents using the Retrieval-Augmented Generation (RAG) framework.

rag multi-step-reasoning agentic-workflow tool-calling router-query-engine

Updated Jun 25, 2024
Jupyter Notebook

pritamqu / VCRBench

Star

VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models

benchmark video reasoning multi-step-reasoning causal-reasoning multimodal-large-language-models large-multimodal-models large-video-language-models

Updated May 14, 2025
Python

ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

Star

The course teaches how to fine-tune LLMs using Group Relative Policy Optimization (GRPO)—a reinforcement learning method that improves model reasoning with minimal data. Learn RFT concepts, reward design, LLM-as-a-judge evaluation, and deploy jobs on the Predibase platform.

reinforcement-learning machine-learning-algorithms language-model reward-design rft ai-training deeplearning-ai-courses ai-optimization multi-step-reasoning ai-evaluation rlhf llm-fine-tuning opensource-ai llm-as-judge predibase grpo llm-development token-level-control

Updated Jun 13, 2025
Jupyter Notebook

Haaaiawd / Sequential-thinking-skills

Star

Sequential thinking for AI agents: a reusable skill and CLI runtime for stepwise reasoning, revision, replay, and convergence — no extra MCP server required.

cursor reasoning multi-step-reasoning agent-skills sequential-thinking claude-code copilot-coding-agent claude-skills cli-runtime

Updated Mar 23, 2026
TypeScript

Draichi / fusion-center

Star

MCP-based OSINT intelligence platform with LangGraph AI agent. Implements multi-step reasoning (task decomposition, hypothesis testing, self-reflection, verification) for autonomous geopolitical research. Supports Gemini, Grok, Ollama, and Docker Model Runner

osint mcp ai-agents multi-step-reasoning langgraph

Updated Dec 31, 2025
Python

OjasD07 / scaler-openenv-hackathon

Star

A realistic OpenEnv environment for training AI agents to perform enterprise email triage across multi-email inbox workflows, with structured actions, tool usage, and reward shaping, built for the Scaler x Meta PyTorch Hackathon.

Updated Mar 25, 2026
Python

aman-tiwari001 / xray-decision-observability

Star

A general-purpose X-Ray library and dashboard that provides visibility into multi-step decision pipelines by capturing and visualizing why each decision was made.

visualization nodejs decision typescript sdk dashboard pipeline visibility transparency xray oberservability xray-core multi-step-reasoning

Updated Dec 30, 2025
TypeScript

Improve this page

Add a description, image, and links to the multi-step-reasoning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-step-reasoning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-step-reasoning

Here are 24 public repositories matching this topic...

StonyBrookNLP / ircot

ngl567 / KGR-Survey

mukhal / GRACE

versionHQ / multi-agent-system

OpenGVLab / VRBench

TianduoWang / MsAT

Bessouat40 / TreeThinkerAgent

IBM / OpenDsStar

Strong-AI-Lab / A-Neural-Symbolic-Paradigm

HarshTrivedi / DecomP-ODQA

Strong-AI-Lab / Multi-Step-Deductive-Reasoning-Over-Natural-Language

wzy6642 / PRP

Strong-AI-Lab / PARARULE-Plus

LakshitaS / Agentic-RAG-implementation

pritamqu / VCRBench

ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

Haaaiawd / Sequential-thinking-skills

Draichi / fusion-center

OjasD07 / scaler-openenv-hackathon

aman-tiwari001 / xray-decision-observability

Improve this page

Add this topic to your repo