Loki Mode

Build the future, faster.

Describe what you want. Get production-ready code.

Website | Documentation | Installation | Changelog | Purple Lab Web UI

How it works: You provide a PRD. Loki Mode classifies complexity, assembles an agent team from 41 specialized types across 8 swarms, and runs autonomous RARV cycles (Reason - Act - Reflect - Verify) with 9 quality gates. Code is not "done" until it passes automated verification. Output is a Git repo with source, tests, configs, and audit logs.

Why Loki Mode?

Truly autonomous -- Describe what you want, walk away, come back to working code with tests
Production quality built in -- 9 quality gates, blind 3-reviewer code review, anti-sycophancy checks
Self-hosted and private -- Your keys, your infrastructure, no data leaves your network
5 AI providers -- Claude, Codex, Gemini, Cline, Aider with automatic failover
Full-stack output -- Source code, tests, Docker configs, CI/CD pipelines, audit logs
Open source -- Free for personal, internal, and academic use. No vendor lock-in.

Get Started in 30 Seconds

npm install -g loki-mode loki doctor # verify environment loki init my-app --template simple-todo-app cd my-app loki start prd.md # autonomous build starts

Or skip scaffolding and go straight to a quick task:

loki quick "build a landing page with a signup form"

Other install methods

Method	Command
Homebrew	`brew tap asklokesh/tap && brew install loki-mode`
Docker	`docker pull asklokesh/loki-mode`
Inside Claude Code	`claude --dangerously-skip-permissions` then type "Loki Mode"
Git clone	`git clone https://github.com/asklokesh/loki-mode.git`

See the full Installation Guide.

What You Can Build

Project	Build Time	Complexity
Landing page with signup form	~10 min	Simple
REST API with JWT auth	~20 min	Simple
Portfolio with animations	~15 min	Simple
SaaS dashboard with analytics	~25 min	Standard
E-commerce store with Stripe	~45 min	Standard
Task manager with kanban board	~25 min	Standard
Chat app with WebSocket	~30 min	Standard
Blog platform with MDX	~30 min	Standard
Microservice architecture	~2 hours	Complex
ML pipeline with monitoring	~3 hours	Complex

What It Looks Like

block-beta columns 5 block:header:5 columns 5 h1["Purple Lab -- Powered by Loki"]:5 end block:sidebar:1 columns 1 s1["Home"] s2["Projects"] s3["Templates"] s4["Teams"] s5["Showcase"] s6["Compare"] s7["Settings"] s8["Docs"] end block:main:4 columns 4 prompt["Build a SaaS dashboard with user analytics..."]:4 start["Start Building"]:4 p1["Phase Viewer"] p2["Agent Dashboard"] p3["Terminal"] p4["Quality Gates"] p5["File Browser"]:2 p6["Memory Viewer"]:2 end style header fill:#553DE9,color:#fff style prompt fill:#f5f5f5,stroke:#ccc,color:#333 style start fill:#553DE9,color:#fff style sidebar fill:#fafafa,stroke:#ddd

block-beta columns 6 block:wsheader:6 columns 6 wh["Project Workspace"]:6 end block:filetree:1 columns 1 ft1["src/"] ft2["App.tsx"] ft3["index.ts"] ft4["public/"] ft5["tests/"] ft6["package.json"] end block:editor:3 columns 1 code["Monaco Editor"]:1 preview["Live Preview -- localhost:3000"]:1 end block:chat:2 columns 1 ai["AI Chat Panel"] status["Agent Status / Quality Gates"] phase["Phase: Development"] end style wsheader fill:#553DE9,color:#fff style code fill:#1e1e1e,color:#d4d4d4 style preview fill:#fff,stroke:#553DE9,color:#333 style ai fill:#f5f5f5,stroke:#ccc,color:#333 style phase fill:#1FC5A8,color:#fff

What To Expect

	Simple	Standard	Complex
Examples	Landing page, todo app, single API	CRUD + auth, REST API + React	Microservices, real-time, ML pipelines
Duration	5-30 min	30-90 min	2+ hours
Autonomy	Completes independently	May need guidance on complex parts	Use as accelerator with human review

Architecture

RARV Cycle

Every iteration: Reason (read state) - Act (execute, commit) - Reflect (update context) - Verify (run tests, check spec). Failures trigger self-correction.

Core Workflow

41 Agent Types

8 swarms: engineering, operations, business, data, product, growth, review, orchestration. Auto-composed by PRD complexity.

Agent Types

9 Quality Gates

Blind review, anti-sycophancy, severity blocking, mock/mutation detection. Code does not ship until all gates pass.

Quality Gates

Memory System

3-tier architecture: episodic (interaction traces), semantic (generalized patterns), procedural (learned skills). Vector search optional.

Memory Architecture

Dashboard

Real-time monitoring, agent status, task queue, WebSocket streaming. Auto-starts at localhost:57374.

Dashboard Guide

Enterprise Layer

TLS, OIDC/SSO, RBAC, OTEL tracing, policy engine, audit trails. Activated via env vars.

Enterprise Guide

Purple Lab

The hosted development platform. A Replit-like web UI for visual PRD-to-code workflow with AI chat for iterative development.

loki web # launches at http://localhost:57375

Platform Pages

Home -- One-line prompt to start building instantly
Projects -- Browse, search, filter past builds
Templates -- 20+ starter PRDs by category
Showcase -- Gallery of example projects to build
Compare -- Feature comparison vs competitors

IDE Workspace

Monaco editor with tabs, Cmd+P quick open
AI chat panel for iterative development
Activity panel: build log, agents, quality gates
Live preview with URL bar navigation
Right-click context menu: Review, Test, Explain

Loki Mode vs. Alternatives

Feature	Loki Mode	bolt.new	Replit	Lovable
Self-hosted / your keys	Yes	No	No	No
5 AI provider failover	Yes	No	No	No
9 quality gates	Yes	No	No	No
Blind code review	Yes	No	No	No
Enterprise auth (SSO/RBAC)	Yes	No	Yes	No
Air-gapped deployment	Yes	No	No	No
Docker + CI/CD generation	Yes	No	Yes	No
Open source	Yes	No	No	No
Free tier	Open source	Yes	Yes	Yes

Loki Mode is the only platform that is fully self-hosted, open source, and includes automated quality verification. Your code, your keys, your infrastructure.

Multi-Provider Support

Provider	Autonomous Flag	Parallel Agents	Install
Claude Code	`--dangerously-skip-permissions`	Yes (10+)	`npm i -g @anthropic-ai/claude-code`
Codex CLI	`--full-auto`	Sequential	`npm i -g @openai/codex`
Gemini CLI	`--approval-mode=yolo`	Sequential	`npm i -g @google/gemini-cli`
Cline CLI	`--auto-approve`	Sequential	`npm i -g @anthropic-ai/cline`
Aider	`--yes-always`	Sequential	`pip install aider-chat`

Claude gets full features (subagents, parallelization, MCP, Task tool). Other providers run sequentially. Auto-failover switches providers when rate-limited. See Provider Guide.

CLI Reference

All commands

Command	Description
`loki start [PRD]`	Start with optional PRD file
`loki stop`	Stop execution
`loki pause` / `resume`	Pause/resume after current session
`loki status`	Show current status
`loki dashboard`	Open web dashboard
`loki web`	Launch Purple Lab web UI
`loki doctor`	Check environment and dependencies
`loki plan [PRD]`	Pre-execution analysis: complexity, cost, iterations
`loki review [--staged\|--diff]`	AI-powered code review with severity filtering
`loki test [--file\|--dir\|--changed]`	AI test generation (8 languages, 9 frameworks)
`loki onboard [path]`	Project analysis and CLAUDE.md generation
`loki import`	Import GitHub issues as tasks
`loki ci`	CI/CD quality gate integration
`loki failover`	Cross-provider auto-failover management
`loki memory <cmd>`	Memory system: index, timeline, search, consolidate
`loki enterprise`	Enterprise feature management
`loki version`	Show version

Run loki --help for all options. Full reference: CLI Reference | Config: config.example.yaml

BMAD Method Integration

Loki Mode integrates with the BMAD Method, a structured AI-driven agile methodology. If your project uses BMAD for requirements elicitation, Loki Mode can consume those artifacts directly:

loki start --bmad-project ./my-project

The adapter handles BMAD's frontmatter conventions, FR-format functional requirements, Given/When/Then acceptance criteria, and artifact chain validation. Non-BMAD projects are unaffected -- the integration is opt-in via --bmad-project.

See BMAD Integration Validation.

Enterprise Features

Enterprise features are included but require env var activation. Self-audit: 35/45 capabilities working, 0 broken, 1,314 tests passing.

export LOKI_TLS_ENABLED=true export LOKI_OIDC_PROVIDER=google export LOKI_AUDIT_ENABLED=true loki enterprise status

Benchmarks

Self-reported results from the included test harness. Verification scripts included for reproduction.

Benchmark	Result	Notes
HumanEval	162/164 (98.78%)	Max 3 retries, RARV self-verification
SWE-bench	299/300 patches	Patch generation -- evaluator not yet run

See benchmarks/ for methodology.

Presentation

9 slides: Problem, Solution, 41 Agents, RARV Cycle, Benchmarks, Multi-Provider, Full Lifecycle

Download PPTX

Limitations

Area	What Works	What Doesn't (Yet)
Code Gen	Full-stack apps from PRDs	Complex domain logic may need human review
Deploy	Generates configs, Dockerfiles, CI/CD	Does not deploy -- human runs deploy commands
Testing	9 automated quality gates	Test quality depends on AI assertions
Providers	5 providers with auto-failover	Non-Claude providers lack parallel agents
Dashboard	Real-time single-machine monitoring	No multi-node clustering

What "autonomous" means: The system runs RARV cycles without prompting. It does NOT access your cloud accounts, payment systems, or external services unless you provide credentials. Human oversight is expected for deployment, API keys, and critical decisions.

Research Foundation

Papers and sources

Source	What We Use
Anthropic: Building Effective Agents	Evaluator-optimizer, parallelization
Anthropic: Constitutional AI	Self-critique against quality principles
DeepMind: Scalable Oversight via Debate	Debate-based verification in council review
DeepMind: SIMA 2	Self-improvement loop design
OpenAI: Agents SDK	Guardrails, tripwires, tracing
NVIDIA ToolOrchestra	Efficiency metrics, reward signals
CONSENSAGENT (ACL 2025)	Anti-sycophancy in blind review
GoalAct	Hierarchical planning for complex PRDs

Practitioner insights: Boris Cherny, Simon Willison, HN Community

Full Acknowledgements -- 50+ papers and resources

Contributing

git clone https://github.com/asklokesh/loki-mode.git && cd loki-mode npm install && npm test # 683 tests python3 -m pytest # 631 tests

See CONTRIBUTING.md for guidelines.

License

Business Source License 1.1 -- Free for personal, internal, academic, and non-commercial use. Converts to Apache 2.0 on March 19, 2030. Contact founder@autonomi.dev for commercial licensing.

Autonomi | Documentation | Changelog | Comparisons

Name		Name	Last commit message	Last commit date
Latest commit History 848 Commits
.claude		.claude
.github		.github
.loki-test-tmp		.loki-test-tmp
.loki		.loki
agent-skills		agent-skills
agents		agents
api-examples		api-examples
api		api
assets		assets
autonomy		autonomy
benchmarks		benchmarks
bin		bin
blog		blog
collab		collab
completions		completions
coverage		coverage
dashboard-ui		dashboard-ui
dashboard		dashboard
demo		demo
deploy		deploy
docs		docs
events		events
examples		examples
integrations		integrations
learning		learning
mcp		mcp
memory		memory
providers		providers
references		references
schemas		schemas
scripts		scripts
sdk		sdk
signatures/version1		signatures/version1
skills		skills
src		src
state		state
swarm		swarm
templates		templates
tests		tests
tools		tools
vscode-extension		vscode-extension
web-app		web-app
website		website
wiki		wiki
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
.mcp.json		.mcp.json
.npmignore		.npmignore
404.html		404.html
CHANGELOG.md		CHANGELOG.md
CLA.md		CLA.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DOCKER_README.md		DOCKER_README.md
Dockerfile		Dockerfile
Dockerfile.purplelab-test		Dockerfile.purplelab-test
Dockerfile.sandbox		Dockerfile.sandbox
Dockerfile.test-runner		Dockerfile.test-runner
LICENSE		LICENSE
LICENSE-CHANGE-NOTICE.md		LICENSE-CHANGE-NOTICE.md
README.md		README.md
SKILL.md		SKILL.md
VERSION		VERSION
action.yml		action.yml
conftest.py		conftest.py
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml
index.html		index.html
package.json		package.json
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loki Mode

Build the future, faster.

Why Loki Mode?

Get Started in 30 Seconds

What You Can Build

What It Looks Like

What To Expect

Architecture

RARV Cycle

41 Agent Types

9 Quality Gates

Memory System

Dashboard

Enterprise Layer

Purple Lab

Loki Mode vs. Alternatives

Multi-Provider Support

CLI Reference

Limitations

Research Foundation

Contributing

License

About

Licenses found

Uh oh!

Releases 343

Packages

Uh oh!

Uh oh!

Contributors 5

Languages

Folders and files

Latest commit

History

Repository files navigation

Loki Mode

Build the future, faster.

Why Loki Mode?

Get Started in 30 Seconds

What You Can Build

What It Looks Like

What To Expect

Architecture

RARV Cycle

41 Agent Types

9 Quality Gates

Memory System

Dashboard

Enterprise Layer

Purple Lab

Loki Mode vs. Alternatives

Multi-Provider Support

CLI Reference

Limitations

Research Foundation

Contributing

License

About

Topics

Resources

License

Licenses found

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 343

Packages 0

Uh oh!

Uh oh!

Contributors 5

Languages

Packages