LieGraph – An AI Agent-Driven "Who Is Spy" Game

LieGraph is a multi-agent implementation of the popular social deduction game "Who Is spy," built with LangGraph. It features AI agents that can reason, strategize, and interact in natural language to find the spy among them.

✨ Features

Autonomous AI Agents: AI players with unique personalities and strategic thinking capabilities
Dynamic Identity Inference: Agents continuously analyze conversation history and voting patterns to infer their own and others' identities
Natural Language Interaction: Agents communicate and reason in natural language throughout the game
Probabilistic Belief System: Sophisticated belief tracking with self-belief confidence and suspicions matrix
Strategic Reasoning: Advanced bluff detection, alliance formation, and long-term planning
LLM-driven Strategy: Structured tools for speech planning and voting decisions
Built-in Metrics: Automatic quality tracking for win balance, identification accuracy, and speech diversity with JSON reports for prompt evaluation workflows
Historical Analysis: CLI tools for aggregating and analyzing game metrics summaries

🚀 Quick Start

Prerequisites

Python 3.12+
Node.js 16+
uv (recommended for Python package management)

Environment Variables

Create a .env file in the root directory with your LLM configuration:

touch .env

Example for OpenAI:

LLM_PROVIDER=openai OPENAI_API_KEY="your_openai_api_key_here" OPENAI_MODEL="gpt-4o-mini"

Example for DeepSeek:

LLM_PROVIDER=deepseek DEEPSEEK_API_KEY="your_deepseek_api_key_here" DEEPSEEK_MODEL="deepseek-chat"

Installation & Running

Clone and setup:

git clone https://github.com/leslieo2/LieGraph.git cd LieGraph

Install dependencies:

# Install uv if needed curl -LsSf https://astral.sh/uv/install.sh | sh uv sync # Install UI dependencies cd ui-web/frontend npm install

Start services:

# Backend (from project root) langgraph dev --config langgraph.json --port 8124 --allow-blocking # Frontend (from ui-web/frontend) npm start

Open http://localhost:3000 to play the game.

🎮 How It Works

Game Flow

The game is orchestrated by a StateGraph from LangGraph that manages the complete game lifecycle:

Setup: Host agent assigns roles (Civilian/Spy) and corresponding words
Speaking Phase: Players take turns describing their words using LLM-based reasoning
Identity Inference: Agents analyze conversation patterns to deduce roles
Voting Phase: All players vote simultaneously based on accumulated evidence
Result: Player with most votes is eliminated
Win Condition: Game ends when spy is voted out (Civilians win) or spies outnumber civilians (Spies win)

AI Agent Architecture

Each AI player maintains an evolving "mindset" with sophisticated reasoning capabilities:

Dynamic Identity Inference:
- Self-identity analysis through word descriptions and voting patterns
- Other-player analysis tracking speech patterns and strategic behavior
- Real-time conversation history processing for inconsistency detection
Probabilistic Belief System:
- Self-belief confidence based on accumulated evidence
- Suspicion matrix tracking probabilistic beliefs about other players
- Systematic evidence recording of suspicious behaviors
Strategic Reasoning:
- Bluff detection and counter-bluff strategies
- Alliance formation and betrayal prevention
- Long-term planning based on evolving identity beliefs

graph TD START[START] --> HS[host_setup] HS --> HSS[host_stage_switch] HSS -->|speaking| SpeechNodes subgraph SpeechNodes [Speaking Phase] direction LR PS[player_speech_N] end SpeechNodes --> HSS HSS -->|voting| VoteNodes subgraph VoteNodes [Voting Phase - Concurrent] direction LR PV[player_vote_N] end VoteNodes --> CVT[check_votes_and_transition] CVT -->|votes ready| HR[host_result] CVT -->|waiting| __continue__ HR -->|continue| HSS HR -->|end| END[END] classDef hostNode fill:#e1f5fe classDef playerNode fill:#f3e5f5 classDef transitionNode fill:#e8f5e8 class HS,HSS,HR hostNode class PS,PV playerNode class CVT transitionNode

⚙️ Configuration

Customize the game by editing config.yaml:

game: player_count: 6 vocabulary: - ["Shakespeare", "Dumas"] - ["太阳", "月亮"] player_names: - "Alice" - "Bob" # ...

📊 Metrics & Evaluation

LieGraph ships with a lightweight metrics collector (src/game/metrics.py) that records quality indicators as games unfold:

Win balance: Civilian vs. spy win rates and a fairness score targeting 50/50 outcomes.
Identification accuracy: Tracks how confidently players identify their own roles and others over time.
Speech diversity: Measures lexical variety per speech turn to surface repetitive phrasing.

Metrics are streamed to memory during play and automatically persisted when a game ends:

Per-game summaries: logs/metrics/{game_id}.json
Rolling aggregate + functional quality score: logs/metrics/overall.json

You can also access the live collector from code by building a dependency bundle for each game instance:

from src.game.dependencies import build_dependencies deps = build_dependencies() collector = deps.metrics audit = collector.get_overall_metrics() score = collector.compute_quality_score() # deterministic # collector.compute_quality_score(method="llm", llm=client) for LLM-based review

These outputs are ready to feed into downstream prompt-evaluation or offline analysis pipelines.

🔗 Related Projects

LangGraph Mastery Playbook: A six-stage LangGraph curriculum of runnable Python modules that takes you from foundational graph patterns to production-ready retrieval systems.

Metrics Progress

Maintain a running ledger in docs/metrics-history.md so prompt and strategy changes can be tracked against key metrics.
After each batch, record the latest logs/metrics/overall.json summary there and archive the raw JSON if you need long-term snapshots.

🛠️ Development

Project Structure

LieGraph/ ├── src/ │ ├── game/ │ │ ├── graph.py # Main LangGraph workflow │ │ ├── state.py # Game state definitions │ │ ├── nodes/ # Graph node implementations │ │ ├── rules.py # Game logic and win conditions │ │ ├── strategy/ # AI strategy coordination and builders │ │ ├── agent_tools/ # Structured tools for speech and voting │ │ └── metrics.py # Game metrics and quality scoring ├── tests/ # Pytest test suite ├── ui-web/frontend/ # React web interface └── config.yaml # Game configuration

System Architecture

For detailed architecture information, component design, and integration patterns, see ARCHITECTURE.md.

Running Tests

python -m pytest tests/ -v

🗺️ Roadmap

Enhanced AI strategy and long-term memory
Game replay and analysis features
Support for more complex game modes
LLM benchmark capabilities for evaluating different models

🤝 Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make changes and add tests
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
docs		docs
src		src
tests		tests
ui-web/frontend		ui-web/frontend
.env.template		.env.template
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
config.yaml		config.yaml
langgraph.json		langgraph.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LieGraph – An AI Agent-Driven "Who Is Spy" Game

✨ Features

🚀 Quick Start

Prerequisites

Environment Variables

Installation & Running

🎮 How It Works

Game Flow

AI Agent Architecture

⚙️ Configuration

📊 Metrics & Evaluation

🔗 Related Projects

Metrics Progress

🛠️ Development

Project Structure

System Architecture

Running Tests

🗺️ Roadmap

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

leslieo2/LieGraph

Folders and files

Latest commit

History

Repository files navigation

LieGraph – An AI Agent-Driven "Who Is Spy" Game

✨ Features

🚀 Quick Start

Prerequisites

Environment Variables

Installation & Running

🎮 How It Works

Game Flow

AI Agent Architecture

⚙️ Configuration

📊 Metrics & Evaluation

🔗 Related Projects

Metrics Progress

🛠️ Development

Project Structure

System Architecture

Running Tests

🗺️ Roadmap

🤝 Contributing

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages