Skip to content

w4ester/SWE-smith

 
 

Repository files navigation

Kawhi the SWE-smith



SWE-smith is a toolkit for training software engineering (SWE) agents. With SWE-smith, you can:

  • Create an unlimited number of SWE-bench style task instances for any Python repository.
  • Generate trajectories of SWE-agent solving those task instances.
  • Train local LMs on these trajectories to improve their software engineering capabilities (SWE-agent-LM-32B).

🚀 Get Started

Check out the documentation for a complete guide on how to use SWE-smith, including how to

🏎️ Quick Start

Install the repo:

git clone https://github.com/SWE-bench/SWE-smith cd SWE-smith conda create -n smith python=3.10; conda activate smith; pip install -e .

Then, check out scripts/cheatsheet.sh for scripts to (1) create execution environments, (2) create task instances, and (3) train SWE-agents.

Tip

SWE-smith requires Docker to create execution environments. SWE-smith was developed and tested on Ubuntu 22.04.4 LTS. We do not plan on supporting Windows or MacOS.

💿 Resources

In addition to this toolkit, we've also provided several artifacts on the SWE-bench HuggingFace, including:

And there's more coming!

💫 Contributions

Excited about SWE-smith? We're actively working on several follow ups, and love meaningful collaborations! What we're thinking about...

  • Make SWE-smith work for non-Python languages
  • New bug generation techniques
  • Train SWE-agents with more trajectories and new methods

Check out the Contributing Guide for more.

Contact Person: John Yang, Kilian Lieret (Email: johnby@stanford.edu)

🪪 License

MIT. Check LICENSE for more information.

✍️ Citation

@misc{yang2025swesmith, title={SWE-smith: Scaling Data for Software Engineering Agents}, author={John Yang and Kilian Leret and Carlos E. Jimenez and Alexander Wettig and Kabir Khandpur and Yanzhe Zhang and Binyuan Hui and Ofir Press and Ludwig Schmidt and Diyi Yang}, year={2025}, eprint={2504.21798}, archivePrefix={arXiv}, primaryClass={cs.SE}, url={https://arxiv.org/abs/2504.21798}, }

📕 Related Works

SWE-bench    SWE-agent    SWE-ReX    sb-cli

About

Scaling Data for SWE-agents

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • Python 97.9%
  • Shell 2.1%