MygramDB

27-3700x faster than MySQL FULLTEXT. In-memory full-text search engine with MySQL replication.

Why MygramDB?

MySQL FULLTEXT is painfully slow — it scans B-tree pages on disk, doesn't compress postings, and struggles with common terms.

MygramDB solves this with an in-memory search replica that syncs via GTID binlog, delivering sub-80ms queries even for datasets matching 75% of your data.

Performance

Tested on 1.7M rows (real production data), MygramDB with cache.enabled: false:

Query Type	MySQL	MygramDB	Speedup
SORT id LIMIT 100 (typical)	908-2,980ms	22-92ms	19-32x
COUNT queries	124-2,891ms	0.3-6.7ms	413-431x
10 concurrent	90% failed, QPS 0.4	100% success, QPS 288	720x
100 concurrent	Cannot execute	QPS 372	-

Key advantages:

No cache warmup needed - Always fast, even on cold starts
SORT optimization - Uses primary key index (no external sort)
Consistent performance - MySQL varies 900ms-3s, MygramDB stays under 100ms
COUNT performance - Up to 431x faster
Parallel performance - MySQL fails 90% at 10 concurrent; MygramDB achieves QPS 372 at 100 concurrent

Real-world impact: Under heavy concurrent load, MySQL FULLTEXT fails 90% of connections. MygramDB handles 100 concurrent queries with 100% success rate and QPS 372.

See Performance Guide for detailed benchmarks.

Quick Start

Docker (Production Ready)

Prerequisites: Ensure MySQL has GTID mode enabled:

-- Check GTID mode (should be ON) SHOW VARIABLES LIKE 'gtid_mode'; -- If OFF, enable GTID mode (MySQL 8.0+) SET GLOBAL enforce_gtid_consistency = ON; SET GLOBAL gtid_mode = OFF_PERMISSIVE; SET GLOBAL gtid_mode = ON_PERMISSIVE; SET GLOBAL gtid_mode = ON;

Start MygramDB:

docker run -d --name mygramdb \ -p 11016:11016 \ -e MYSQL_HOST=your-mysql-host \ -e MYSQL_USER=repl_user \ -e MYSQL_PASSWORD=your_password \ -e MYSQL_DATABASE=mydb \ -e TABLE_NAME=articles \ -e TABLE_PRIMARY_KEY=id \ -e TABLE_TEXT_COLUMN=content \ -e TABLE_NGRAM_SIZE=2 \ -e REPLICATION_SERVER_ID=12345 \ -e NETWORK_ALLOW_CIDRS=0.0.0.0/0 \ ghcr.io/libraz/mygram-db:latest # Check logs docker logs -f mygramdb # Trigger initial data sync (required on first start) docker exec mygramdb mygram-cli -p 11016 SYNC articles # Try a search docker exec mygramdb mygram-cli -p 11016 SEARCH articles "hello world"

Security Note: NETWORK_ALLOW_CIDRS=0.0.0.0/0 allows connections from any IP address. For production, restrict to specific IP ranges:

# Production example: Allow only from application servers -e NETWORK_ALLOW_CIDRS=10.0.0.0/8,172.16.0.0/12

Docker Compose (with Test MySQL)

git clone https://github.com/libraz/mygram-db.git cd mygram-db docker-compose up -d # Wait for MySQL to be ready (check with docker-compose logs -f) # Trigger initial data sync docker-compose exec mygramdb mygram-cli -p 11016 SYNC articles # Try searching docker-compose exec mygramdb mygram-cli -p 11016 SEARCH articles "hello"

Includes MySQL 8.4 with sample data for instant testing.

Basic Usage

# Search with pagination SEARCH articles "hello world" SORT id LIMIT 100 # Sort by custom column SEARCH articles "hello" SORT created_at DESC LIMIT 50 # LIMIT with offset (MySQL-style) SEARCH articles "tech" LIMIT 10,100 # offset=10, count=100 # Count matches COUNT articles "hello world" # Multi-term AND search SEARCH articles hello AND world # With filters SEARCH articles tech FILTER status=1 LIMIT 100 # Get by primary key GET articles 12345

See Protocol Reference for all commands.

Features

Fast: 19-720x faster than MySQL FULLTEXT (QPS 372 at 100 concurrent)
MySQL Replication: Real-time GTID-based binlog streaming
Runtime Variables: MySQL-style SET/SHOW VARIABLES for zero-downtime config changes
MySQL Failover: Switch MySQL servers at runtime with GTID position preservation
Multiple Tables: Index multiple tables in one instance
Dual Protocol: TCP (memcached-style) and HTTP/REST API
High Concurrency: Thread pool supporting 10,000+ connections
Unicode: ICU-based normalization for CJK/multilingual text
Compression: Hybrid Delta encoding + Roaring bitmaps
Easy Deploy: Single binary or Docker container

Architecture

graph LR MySQL[MySQL Primary] -->|binlog GTID| MygramDB1[MygramDB #1] MySQL -->|binlog GTID| MygramDB2[MygramDB #2] MygramDB1 -->|Search| App[Application] MygramDB2 -->|Search| App App -->|Write| MySQL

MygramDB acts as a specialized read replica for full-text search, while MySQL handles writes and normal queries.

When to Use MygramDB

✅ Good fit:

Search-heavy workloads (read >> write)
Millions of documents with full-text search
Need sub-100ms search latency
Simple deployment requirements
Japanese/CJK text with ngrams

❌ Not recommended:

Write-heavy workloads
Dataset doesn't fit in RAM (~1-2GB per million docs)
Need distributed search across nodes
Complex aggregations/analytics

Documentation

CHANGELOG - Version history and release notes
Docker Deployment Guide - Production Docker setup
Configuration Guide - All configuration options
Protocol Reference - Complete command reference
HTTP API Reference - REST API documentation
Performance Guide - Benchmarks and optimization
Replication Guide - MySQL replication setup
Operations Guide - Runtime variables and MySQL failover
Installation Guide - Build from source
Development Guide - Contributing guidelines
Client Library - C/C++ client library

Release Notes

Latest Release - Download binaries
Detailed Release Notes - Version-specific migration guides

Requirements

System:

RAM: ~1-2GB per million documents
OS: Linux or macOS

MySQL:

Version: 5.7.6+ or 8.0+
GTID mode enabled (gtid_mode=ON)
Binary log format: ROW (binlog_format=ROW)
Replication privileges: REPLICATION SLAVE, REPLICATION CLIENT

See Installation Guide for details.

License

MIT License

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

For development environment setup, see Development Guide.

Authors

libraz libraz@libraz.net

Acknowledgments

Roaring Bitmaps for compressed bitmaps
ICU for Unicode support
spdlog for logging
yaml-cpp for configuration

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
.github		.github
.vscode		.vscode
docs		docs
examples		examples
src		src
support		support
tests		tests
third_party		third_party
.actrc		.actrc
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.clangd		.clangd
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_ja.md		README_ja.md
codecov.yml		codecov.yml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MygramDB

Why MygramDB?

Performance

Quick Start

Docker (Production Ready)

Docker Compose (with Test MySQL)

Basic Usage

Features

Architecture

When to Use MygramDB

Documentation

Release Notes

Requirements

License

Contributing

Authors

Acknowledgments

About

Uh oh!

Releases 16

Packages

Uh oh!

Languages

License

libraz/mygram-db

Folders and files

Latest commit

History

Repository files navigation

MygramDB

Why MygramDB?

Performance

Quick Start

Docker (Production Ready)

Docker Compose (with Test MySQL)

Basic Usage

Features

Architecture

When to Use MygramDB

Documentation

Release Notes

Requirements

License

Contributing

Authors

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 16

Packages 0

Uh oh!

Languages

Packages