HADS — Human-AI Document Standard

Version 1.0.0 · MIT License · Open Standard

📖 Read the article on Medium — AI Is Now the Primary Reader of Your Docs. Nobody Told Your Docs. 📚 Full documentation — specification, writing guide, examples, ecosystem 📄 Position paper — DOI: 10.5281/zenodo.19019719 👤 Author: Nikša Barlović — ORCID: 0009-0004-7421-0913

What is HADS?

HADS is a lightweight convention for writing technical documentation that works equally well for humans and AI language models — with AI as the primary consumer.

It is not a new file format. It is a set of tagging and structural rules applied to standard Markdown. Any editor renders it. Any AI reads it. No tooling required.

Why does this exist?

Technical documentation is increasingly read by AI models before humans ever see it. Models are used to look up APIs, understand file formats, debug errors, generate code. But documentation is written for humans — verbose, contextual, narrative.

This creates waste:

AI consumes thousands of tokens to extract a handful of facts
Smaller / local models lose context and hallucinate
Humans still need the context that AI wants to skip

HADS solves this by separating signal from context — in the same document, without duplication.

Core idea

Every block in a HADS document is tagged with its intended reader:

Tag	Meaning	Primary reader
`[SPEC]`	Machine-readable fact. Terse. Authoritative.	AI
`[NOTE]`	Human context — why, history, caveats, examples	Human
`[BUG]`	Verified failure mode + fix. Always authoritative.	Both
`[?]`	Unverified / inferred. Treat as hypothesis.	Both

AI instruction at the top of every HADS document tells the model what to read and what to skip. This is the key innovation: the document teaches the model how to read it.

Who benefits?

Small / local models — can extract facts without reading everything
Large models — spend fewer tokens, reduce hallucination risk
Developers — write once, serves both audiences
Teams — single source of truth, no separate "AI context" files
Open source projects — documentation that AI coding assistants actually use correctly

Before → After

Best measured example from the current benchmark corpus. AI loads only what it needs.

	Full document	`[SPEC]` only
Tokens loaded	1,704	1,079
Reduction	—	36.68%
Information lost	—	none (facts intact)

The rest stays in the file for humans. One source. No duplication.

Quick example

## Authentication **[SPEC]** - Method: Bearer token - Header: `Authorization: Bearer <token>` - Token expiry: 3600 seconds - Refresh endpoint: `POST /auth/refresh` **[NOTE]** Tokens were originally session-based (pre-v2.0). If you see legacy docs mentioning cookie auth, ignore them. The switch happened in 2022 when the API went public. Refresh tokens do not expire but can be revoked. **[BUG] Token silently rejected after password change** Cause: All tokens invalidated on password change, no error returned. Symptom: 401 with body `{"error": "invalid_token"}` — identical to expired token. Fix: Re-authenticate and store new token. Check for 401 after any account operation.

An AI reading this extracts: method, header format, expiry, refresh endpoint — in one [SPEC] block. A human reads the [NOTE] and understands history and context. Both read the [BUG].

Repository structure

hads/ ├── README.md — this file ├── SPEC.md — full formal specification ├── case-studies/ │ └── polymarket-bot.md — production AI agent codebase (48% token reduction) ├── examples/ │ ├── api-reference.md — REST API documentation example │ ├── file-format.md — binary file format example │ └── configuration.md — configuration system example ├── validator/ │ └── validate.py — validates a Markdown file against HADS spec └── LICENSE

Case studies

Real-world measurements of HADS applied to production codebases:

Polymarket trading bot — 705-line CLAUDE.md for an autonomous trading agent. 48% of context was version history loaded on every session. Tagging saves ~39% per typical session.

Getting started

Read SPEC.md — the full standard (15 min)
Look at examples/ — real documents in HADS format
Run the validator on your own docs: python validator/validate.py your-doc.md

Why it matters at scale

A Fermi estimate of what HADS adoption would mean globally:

Assumptions:

~1 billion AI queries/day touching technical docs (50M developers × 10 queries/day + enterprise agents, CI/CD pipelines, support bots)
HADS reduces tokens read per query by 7.91–36.68% across the current example corpus — 24.34% average SPEC-only reduction, with the best current measured example at 36.68%
Energy cost: ~0.003 Wh per 1,000 tokens (GPT-4 class inference)
Time saved: ~2 min/query from more precise first answers and fewer follow-ups

Annual savings at full adoption:

Metric	Value
Tokens saved/day	1.22 trillion
Electricity saved/year	~1.33 GWh
Developer time saved/year	~12 billion hours
Economic value (@ $50/hr)	~$600 billion/year

At 10% adoption these numbers drop proportionally — still ~$60B/year in recovered productivity.

The point: structuring documentation for AI consumption is economically material at global scale, even before accounting for better code quality from more accurate AI answers.

Ecosystem

hads-skills — AI skills for converting documents to HADS format. Works with Claude Code, any model via system prompt, or zero-setup paste-and-run.

Contributing

HADS is an open standard. Contributions welcome:

New block types (open an issue first)
Additional examples
Validator improvements
Ports of the validator to other languages

Please read SPEC.md before contributing — especially the design principles section.

License

MIT. Use freely, commercially or otherwise. Attribution appreciated but not required.

HADS exists because documentation should serve everyone who reads it — human or machine.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
case-studies		case-studies
docs		docs
hads		hads
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md
SPEC.md		SPEC.md
api-reference.md		api-reference.md
configuration.md		configuration.md
file-format.md		file-format.md
hads_overview.pdf		hads_overview.pdf
hads_paper.pdf		hads_paper.pdf
mkdocs.yml		mkdocs.yml
paper.md		paper.md
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HADS — Human-AI Document Standard

What is HADS?

Why does this exist?

Core idea

Who benefits?

Before → After

Quick example

Repository structure

Case studies

Getting started

Why it matters at scale

Ecosystem

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

Folders and files

Latest commit

History

Repository files navigation

HADS — Human-AI Document Standard

What is HADS?

Why does this exist?

Core idea

Who benefits?

Before → After

Quick example

Repository structure

Case studies

Getting started

Why it matters at scale

Ecosystem

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages