Hotel llms.txt Adoption Study
We scanned 105,002 hotel websites for llms.txt files. Only 6.3% have one β and 7.3% of those misuse it as a robots.txt clone.
TL;DR
We fetched /llms.txt and /llms-full.txt from 105,002 hotel websites across 7 countries. Only 6.3% have a llms.txt file (6,590 hotels) and just 0.3% serve a llms-full.txt (265 hotels). The US leads adoption at 12.4%, while France trails at 3.8%. WordPress SEO plugins (AIOSEO, Yoast, Rank Math) drive 33.4% of all files, but 7.3% of llms.txt files misuse the format as robots.txt-style access control rules. Hotels with llms.txt score 62% higher on schema.org quality β suggesting it's a marker of technical SEO maturity.
Executive Summary
The llms.txt file is an emerging standard for websites to communicate their content structure to AI models. Unlike robots.txt (which tells bots what not to crawl), llms.txt tells AI models what a site is about β a curated index of pages with descriptions that helps LLMs understand and accurately represent a property.
Our analysis of 105,002 hotel websites reveals extremely early adoption. At 6.3%, llms.txt is where robots.txt was in the early 2000s β a technical signal adopted by forward-thinking properties and platforms, but unknown to the vast majority. The companion file llms-full.txt (designed for detailed content) is even rarer at 0.3%, and every hotel with llms-full.txt also has llms.txt β the two-file approach has not gained traction.
Most adoption is plugin-driven, not strategic. WordPress SEO plugins (AIOSEO, Yoast, Rank Math) account for 33.4% of all llms.txt files, often auto-generated with little curation. The best files β like rich hotel descriptions with room types, amenities, and policies β represent just 2.9% of the total. Meanwhile, 7.3% of files are outright misconfigurations: robots.txt-style access control rules served as llms.txt, providing zero value to AI models.
Adoption Overview
How does llms.txt adoption compare to robots.txt? (n=105,002 hotels)
llms.txt vs robots.txt adoption rates
llms.txt adoption overview
| Metric | Count | % of Reachable |
|---|---|---|
| Has llms.txt | 6,590 | 6.3% |
| Has llms-full.txt | 265 | 0.3% |
| Has either | 6,590 | 6.3% |
| Has both | 265 | 0.3% |
Who Generates These Files?
WordPress SEO plugins drive 33.4% of hotel llms.txt files. The majority (57.4%) are custom.
Generator breakdown of hotel llms.txt files
Generator breakdown
| Generator | Count | % of llms.txt Files |
|---|---|---|
| Custom | 3,784 | 57.4% |
| AIOSEO | 1,197 | 18.2% |
| Yoast SEO | 675 | 10.2% |
| Backhotelite | 280 | 4.2% |
| Rank Math | 238 | 3.6% |
| robots-style | 179 | 2.7% |
Industry-specific platforms
Backhotelite (4.2%) β A hotel-specific CMS platform primarily used by Spanish properties. Accounts for 18.1% of Spanish llms.txt files.
ComboCMS (2.1%) β An Italian hotel CMS that auto-generates llms.txt. Accounts for 10.5% of Italian llms.txt files.
robots-style (2.7%) β 179 files contain User-agent/Allow/Disallow directives instead of the intended llms.txt format. These provide zero value to LLMs.
What's Inside These Files?
Only 42.9% follow the intended spec. 7.3% misunderstand the format entirely.
Content type distribution of hotel llms.txt files
Content type breakdown
| Content Type | Count | % | Description |
|---|---|---|---|
| Site Index | 2,826 | 42.9% | Page listings with URLs and descriptions |
| Other | 2,013 | 30.5% | Mixed or unstructured content |
| Sitemap Only | 933 | 14.2% | Just a sitemap reference, no page details |
| Access Control | 484 | 7.3% | User-agent allow/disallow rules (misuse) |
| Hotel Description | 194 | 2.9% | Detailed property info, amenities, policies |
| Summary | 91 | 1.4% | Brief business summary with contact info |
The 2.9% doing it right: rich hotel descriptions
194 hotels serve llms.txt files with detailed property information β room types, cancellation policies, amenities, and contact details. These are exactly what an AI concierge needs to recommend a property.
The sweet spot for content is 11-50 pages listed (39.1% of files), covering a typical hotel site's key pages: rooms, amenities, location, dining, events, contact, and blog posts. The median file lists 15 pages.
Adoption by Country
The US leads at 12.4%. France trails at 3.8% β consistent with its resistance to AI integration.
llms.txt adoption rate by country
Full country-level adoption data
| Country | Reachable Hotels | Has llms.txt | % Adoption | llms-full.txt % | Top Generator |
|---|---|---|---|---|---|
| USA | 7,445 | 924 | 12.4% | 0.2% | Custom |
| Spain | 16,411 | 1,418 | 8.6% | 0.3% | Custom |
| Netherlands | 2,891 | 237 | 8.2% | 0.4% | Custom |
| United Kingdom | 10,547 | 729 | 6.9% | 0.3% | Custom |
| Germany | 22,268 | 1,256 | 5.6% | 0.2% | Custom |
| Italy | 27,319 | 1,309 | 4.8% | 0.3% | Custom |
The France Paradox: Blocks Most, Adopts Least
France has the highest AI blocking rate (7.5% in our robots.txt study) AND the lowest llms.txt adoption (3.8%) among major markets. This is a consistent signal: the French hospitality industry is the most resistant to AI integration.
For comparison, the US has the opposite pattern: lowest blocking rate (2.1%) and highest llms.txt adoption (12.4%). The divergence suggests fundamentally different attitudes toward AI in the hospitality industry β with France leaning toward restriction and the US toward visibility.
Adoption by Star Classification
5-star hotels adopt at 2.5x the rate of 1-star properties.
llms.txt adoption rate by star classification
Adoption rates by star classification
| Stars | Hotels | Has llms.txt | % Adoption | llms-full.txt % | Avg File Size |
|---|---|---|---|---|---|
| 5-star | 2,062 | 225 | 10.9% | 0.3% | 11.0 KB |
| 4-star | 16,548 | 1,459 | 8.8% | 0.2% | 17.9 KB |
| 3-star | 30,199 | 1,812 | 6% | 0.3% | 19.6 KB |
| 2-star | 10,222 | 539 | 5.3% | 0.3% | 10.9 KB |
| 1-star | 2,699 | 118 | 4.4% | 0.3% | 12.9 KB |
| Unclassified | 43,272 | 2,437 | 5.6% | 0.3% | 10.9 KB |
Schema.org Correlation
Hotels with llms.txt have 62% higher schema.org scores β it's a proxy for technical SEO maturity.
Average schema.org score: with vs without llms.txt
With llms.txt
Without llms.txt
File Size Distribution
Median: 3.3 KB (~15 pages). The long tail reaches 1 MB.
llms.txt file size distribution
File size distribution
| Size Range | Count | % of Files |
|---|---|---|
| 1-100 B | 171 | 2.6% |
| 101-500 B | 282 | 4.3% |
| 501 B-1 KB | 820 | 12.4% |
| 1-5 KB | 2,620 | 39.8% |
| 5-10 KB | 1,208 | 18.3% |
| 10-50 KB | 1,110 | 16.8% |
Pages listed per llms.txt file
| Pages Listed | Count | % of Files |
|---|---|---|
| 0 pages | 1,398 | 21.2% |
| 1-5 pages | 376 | 5.7% |
| 6-10 pages | 994 | 15.1% |
| 11-20 pages | 1,100 | 16.7% |
| 21-50 pages | 1,473 | 22.4% |
| 51-100 pages | 737 | 11.2% |
Frequently Asked Questions
Methodology
Data Collection
- Source: HotelRank global hotel index β 121,425 hotels from Google Maps
- Reachable websites: 105,002 (84.8% of total)
- Files checked:
/llms.txtand/llms-full.txtper hotel - 7 countries: US (7.4K), ES (16.4K), NL (2.9K), UK (10.5K), DE (22.3K), IT (27.3K), FR (17.6K)
- Files fetched during March 2026 crawl window
Content Analysis
- Generator detection: Heuristic identification from file header comments (AIOSEO, Yoast, Rank Math, etc.)
- Content classification: Rule-based categorization into site-index, access-control, hotel-description, summary, sitemap-only, minimal
- Language detection: Keyword-frequency heuristics for English, German, Spanish, Italian, French
- Structural analysis: Section header extraction, URL counting, page listing enumeration
Continue Reading
Explore more Hotelrank research on AI hotel search.