How is llms.txt different from robots.txt?

Robots.txt is a crawler access control file — it tells search engine bots which URLs they can crawl. LLMs.txt is the opposite: it actively curates content for AI agents to load at inference time when answering user questions. Robots.txt is about blocking; llms.txt is about directing.

What does an llms.txt audit check?

Our LLMs.txt checker evaluates: file exists at /llms.txt, has an H1 title, has H2 sections with markdown links, all linked URLs return 200, has a blockquote summary, served as text/markdown or text/plain, no duplicate links, link-format compliance, no auth-walled links, healthy file size (<50KB ideal), linked URLs not blocked by robots.txt, and your robots.txt allows AI bots — plus bonus credit for a /llms-full.txt companion and an HTML discovery link tag.

Where should llms.txt be placed?

The /llms.txt file must live at the root of your domain over HTTPS — example.com/llms.txt, served with content-type text/markdown or text/plain. AI agents look at the root path first; deeper paths (e.g., /docs/llms.txt) are not part of the spec.

What is llms-full.txt and do I need both?

llms.txt is a curated index (under 50KB ideally) with markdown links. llms-full.txt inlines the actual content — every page, every word — in one large markdown file. AI agents pick whichever fits their context window. The pattern adopted by Anthropic and Mintlify ships both: llms.txt for navigation, llms-full.txt for depth.

Can I generate an llms.txt automatically?

Yes. Our generator crawls your site, picks the most important pages, and writes a spec-conformant llms.txt with curated H2 sections, blockquote summary, and proper markdown links. Output is yours to edit before publishing. Costs 50 credits per generation; refunded if generation fails.

Should I block AI bots in robots.txt while having llms.txt?

No — that is self-defeating. Having an llms.txt file says "AI agents, please use this." Blocking those same agents in robots.txt (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, anthropic-ai, CCBot, Bytespider) tells them to go away. Our checker flags this contradiction so you can fix it.

How often should I update llms.txt?

Update llms.txt whenever you add major content sections, rename URLs, change navigation, or release new product areas. Treat it like an SEO sitemap — refresh quarterly at minimum so AI agents stay aligned with your current site structure.

Free AI Optimization Tool

Free LLMs.txt Checker — Validate Your /llms.txt File

Test your /llms.txt against the full llms.txt spec — H1 + blockquote, markdown link integrity, robots.txt cross-checks, AI bot accessibility (GPTBot, ClaudeBot, PerplexityBot, Google-Extended). One-click generator included.

First check can take ~20s to spin up — every one after starts instantly.

Spec-Conformant

Instant Results

100% Free

A welcome mat laid out for a robot — like an llms.txt file guiding AI agents to a site’s key pages — llms.txt is a welcome mat for AI agents — it points them to the pages that matter most when they answer questions about you.

How It Works

Enter Your URL

Type your website address above. We locate /llms.txt automatically at the root of your domain.

We Analyze

Our checker runs every LLMs.txt-native check — H1 + blockquote, link integrity, robots.txt cross-checks, AI bot accessibility, file structure.

Generate or Fix

Get your score, a prioritized issue list, and a one-click AI generator if you don't have an llms.txt yet.

What is llms.txt?

llms.txt is a markdown file placed at the root of your website (example.com/llms.txt) that gives AI agents — ChatGPT, Claude, Perplexity, Gemini — a curated guide to your most valuable content. Where robots.txt says "don't go here," llms.txt says "here's what matters most."

The standard was proposed in September 2024 by Jeremy Howard of Answer.AI. It's now adopted by Anthropic, Vercel, Cloudflare, Mintlify, Hugging Face and thousands of smaller sites. The format is intentionally minimal — pure markdown — so any LLM can parse it without specialized libraries.

The companion file llms-full.txt takes a different approach: it inlines the entire content of your site so AI agents that need depth can grab everything in one request. Most large adopters ship both.

The llms.txt spec at a glance

# Your Site Name

> One-sentence summary of the project — what it is, who it's for.

## Documentation

- [Getting Started](https://example.com/docs/start): The five-minute setup
- [API Reference](https://example.com/api): Full endpoint listing
- [Examples](https://example.com/examples): Working code samples

## Guides

- [Tutorial 1](https://example.com/tutorial-1)
- [Tutorial 2](https://example.com/tutorial-2)

## Optional

- [Changelog](https://example.com/changelog): Skippable for shorter context
- [Press Coverage](https://example.com/press)

Required: H1 with site/project name.
Recommended: Blockquote summary directly after H1.
Recommended: Each H2 section contains a list of markdown links.
Optional special section: A heading literally called ## Optional for content AI agents can skip if context budget is tight.
Best practice: File must be served as text/markdown or text/plain, never text/html.
Size: < 50 KB ideal, < 150 KB OK, ≥ 500 KB defeats the curation purpose — use llms-full.txt for bulk content.

Everything we check

File exists at /llms.txt

HTTP 200 at the root

Has H1 title

Spec's only required element

Has H2 + links

The actual curation payload

Linked URLs resolve

Up to 25 sampled, 200 OK

Blockquote summary

Spec-recommended second element

Correct Content-Type

text/markdown or text/plain

No duplicate links

Wastes AI context window

Markdown link format

No bare URLs, HTML anchors, or "click here"

No auth-walled links

No 401/403 or login redirects

Healthy file size

<50KB ideal, <150KB OK

Not robots-blocked

Linked URLs crawlable per robots.txt

AI bots allowed

GPTBot, ClaudeBot, PerplexityBot, Google-Extended

llms-full.txt companion

Bonus — fallback for depth

Discovery link tag

Bonus — <link rel="llms"> in HTML head

Best practices

✓
Curate, don't dump. An llms.txt with 30 high-quality links beats one with 300 noisy ones. AI agents allocate fixed context budget.
✓
Group by intent. Use H2 sections that match how a user would ask: "Documentation," "Pricing," "Examples," "API Reference."
✓
Use absolute URLs. Spec allows relative URLs, but absolute ones survive non-root deployments and partial fetches.
✓
Add descriptions after a colon. [Quickstart](url): 5-minute setup beats a bare title — gives the AI signal about what each link is for.
✓
Don't block AI bots in robots.txt. Having an llms.txt while blocking GPTBot / ClaudeBot / Google-Extended is self-defeating.
✓
Ship llms-full.txt too. For documentation-heavy sites, the full-content fallback unlocks better AI answers when context allows.
✓
Update quarterly. Treat it like a sitemap — refresh when navigation changes or major content launches.

AI Search / GEO

Keep going — related guides & tools

Explore the AI Search / GEO hub

Related guides

Guide

LLMs.txt

Article

How to Create a Perfect llms.txt File (AI-Powered, 2026)

Article

llms.txt vs robots.txt: What's the Difference and Why Both Matter in 2026

Related tools

Robots.txt LLM Optimization Sitemap

Common llms.txt mistakes to avoid

Linking to pages that no longer exist

An llms.txt is only useful if every link resolves. Dead or redirected URLs tell AI assistants your file is stale and untrustworthy. Validate every link against your live pages before you publish, and re-check whenever you restructure your site.

Skipping the H1 and summary blockquote

The spec opens with a single H1 — your site or product name — followed by a short blockquote that summarises what you do. AI models read that first to establish context. Omitting it makes the rest of your file harder to interpret correctly.

Blocking the same content in robots.txt

If your robots.txt disallows the very URLs you list in llms.txt, AI crawlers cannot fetch them. The two files should work together: robots.txt controls what is crawlable, llms.txt curates the best of what is allowed. Cross-check them so they do not contradict each other.

Dumping every URL instead of curating

llms.txt is a curated index, not a sitemap. List the pages that best explain your product, docs, and key resources — not every URL on the site. A focused file gives AI assistants a cleaner, more accurate picture of what matters.

Want a complete AI + SEO audit?

LLMs.txt is one of 20 checks we run. Get the full picture — robots.txt, sitemaps, meta tags, page speed, content quality, internal links, E-E-A-T, AI visibility — in one report.

Run Full Website Audit

Frequently asked questions

Will AI agents actually read my llms.txt?

Major AI tools — ChatGPT browse mode, Claude, Perplexity, Gemini — check /llms.txt at inference time when answering site-specific queries. Adoption is growing fast. Even where adoption is partial, the cost is zero and the upside is direct control over how AI describes your product.

Should I block AI bots OR have an llms.txt?

They serve opposite purposes. If you publish an llms.txt, allow AI bots (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) in robots.txt. If you actively want to keep AI agents out, block them in robots.txt and don't publish an llms.txt.

Can I autogenerate it?

Yes — our generator crawls your site, picks the top ~50 pages, then writes a spec-conformant llms.txt with curated sections. Edit before publishing. 50 credits per generation; refunded if generation fails.

Where exactly does it live?

HTTPS root: https://yourdomain.com/llms.txt — same convention as robots.txt and sitemap.xml. Subdirectory paths like /docs/llms.txt are not part of the spec.

What's the difference from sitemap.xml?

sitemap.xml is for traditional search-engine crawlers and lists every indexable URL. llms.txt is for AI agents at inference time and lists the curated subset you want them to load when answering questions about your site.

Free LLMs.txt Checker — Validate Your /llms.txt File

First check can take ~20s to spin up — every one after starts instantly.

Spec-Conformant

Instant Results

100% Free

How It Works

Enter Your URL

Type your website address above. We locate /llms.txt automatically at the root of your domain.

We Analyze

Our checker runs every LLMs.txt-native check — H1 + blockquote, link integrity, robots.txt cross-checks, AI bot accessibility, file structure.

Generate or Fix

Get your score, a prioritized issue list, and a one-click AI generator if you don't have an llms.txt yet.

What is llms.txt?

The llms.txt spec at a glance

# Your Site Name > One-sentence summary of the project — what it is, who it's for. ## Documentation - [Getting Started](https://example.com/docs/start): The five-minute setup - [API Reference](https://example.com/api): Full endpoint listing - [Examples](https://example.com/examples): Working code samples ## Guides - [Tutorial 1](https://example.com/tutorial-1) - [Tutorial 2](https://example.com/tutorial-2) ## Optional - [Changelog](https://example.com/changelog): Skippable for shorter context - [Press Coverage](https://example.com/press)

Required: H1 with site/project name.

Recommended: Blockquote summary directly after H1.

Recommended: Each H2 section contains a list of markdown links.

Optional special section: A heading literally called ## Optional for content AI agents can skip if context budget is tight.

Best practice: File must be served as text/markdown or text/plain, never text/html.

Size: < 50 KB ideal, < 150 KB OK, ≥ 500 KB defeats the curation purpose — use llms-full.txt for bulk content.

Everything we check

File exists at /llms.txt

HTTP 200 at the root

Has H1 title

Spec's only required element

Has H2 + links

The actual curation payload

Linked URLs resolve

Up to 25 sampled, 200 OK

Blockquote summary

Spec-recommended second element

Correct Content-Type

text/markdown or text/plain

No duplicate links

Wastes AI context window

Markdown link format

No bare URLs, HTML anchors, or "click here"

No auth-walled links

No 401/403 or login redirects

Healthy file size

<50KB ideal, <150KB OK

Not robots-blocked

Linked URLs crawlable per robots.txt

AI bots allowed

GPTBot, ClaudeBot, PerplexityBot, Google-Extended

llms-full.txt companion

Bonus — fallback for depth

Discovery link tag

Bonus — <link rel="llms"> in HTML head

Best practices

✓

Curate, don't dump. An llms.txt with 30 high-quality links beats one with 300 noisy ones. AI agents allocate fixed context budget.

✓

Group by intent. Use H2 sections that match how a user would ask: "Documentation," "Pricing," "Examples," "API Reference."

✓

Use absolute URLs. Spec allows relative URLs, but absolute ones survive non-root deployments and partial fetches.

✓

Add descriptions after a colon. [Quickstart](url): 5-minute setup beats a bare title — gives the AI signal about what each link is for.

✓

Don't block AI bots in robots.txt. Having an llms.txt while blocking GPTBot / ClaudeBot / Google-Extended is self-defeating.

✓

Ship llms-full.txt too. For documentation-heavy sites, the full-content fallback unlocks better AI answers when context allows.

✓

Update quarterly. Treat it like a sitemap — refresh when navigation changes or major content launches.

Common llms.txt mistakes to avoid

Linking to pages that no longer exist

Skipping the H1 and summary blockquote

Blocking the same content in robots.txt

Dumping every URL instead of curating

Frequently asked questions

Will AI agents actually read my llms.txt?

Should I block AI bots OR have an llms.txt?

Can I autogenerate it?

Where exactly does it live?

HTTPS root: https://yourdomain.com/llms.txt — same convention as robots.txt and sitemap.xml. Subdirectory paths like /docs/llms.txt are not part of the spec.