๐Ÿ“„ llms.txt Generator & Validator โ€” AI Search Visibility avatar

๐Ÿ“„ llms.txt Generator & Validator โ€” AI Search Visibility

Pricing

Pay per usage

Go to Apify Store
๐Ÿ“„ llms.txt Generator & Validator โ€” AI Search Visibility

๐Ÿ“„ llms.txt Generator & Validator โ€” AI Search Visibility

Generate llms.txt files for AI search visibility. Fetches sitemap, validates structure, checks AI bot access (OAI-SearchBot, PerplexityBot, ClaudeBot, Google-Extended). 2 blue ocean competitors only.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Alexander Maksimchuk

Alexander Maksimchuk

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 hours ago

Last modified

Share

๐Ÿ“„ llms.txt Generator & Validator

Generate llms.txt files from your sitemap and validate AI search engine visibility. Checks robots.txt for AI bot access (OAI-SearchBot, PerplexityBot, ClaudeBot, Google-Extended), scores your AI discoverability 0โ€“100, and produces a ready-to-deploy llms.txt file.

๐Ÿ“‹ Table of Contents

๐Ÿค” What is this?

llms.txt is a new standard (llmstxt.org) that tells AI search engines โ€” ChatGPT Search, Perplexity, Claude, Gemini โ€” what your website is about and which pages matter most. It's like robots.txt but for AI citations.

This Actor:

  1. Generates llms.txt from your sitemap โ€” fetches all URLs, extracts page titles/descriptions, categorizes into sections, writes spec-compliant file
  2. Validates existing llms.txt โ€” checks structure, H1/blockquote presence, line count, link depth
  3. Checks AI bot access in robots.txt โ€” verifies OAI-SearchBot, PerplexityBot, ClaudeBot, Google-Extended are allowed (not blocked)
  4. Scores AI discoverability 0โ€“100 across 8 categories

๐Ÿš€ How to use

Step 1: Enter your URL

Provide your website root URL (e.g., https://example.com). The Actor auto-discovers your sitemap.

Step 2: Provide site name + description

Enter a human-readable site name and one-sentence description. These become the H1 header and blockquote in your llms.txt.

Step 3: Run

Click Start. The Actor:

  • Fetches + parses your sitemap
  • Extracts page metadata via headless browser
  • Generates llms.txt with categorized sections
  • Checks robots.txt for AI bot access
  • Checks for AI discovery endpoints (/.well-known/ai.txt, /ai/summary.json)
  • Scores your AI visibility 0โ€“100

Step 4: Download

  • Key-value store: llms.txt โ€” Ready-to-deploy file
  • Key-value store: OUTPUT โ€” Full analysis JSON with score + recommendations
  • Dataset โ€” Summary row with score, grade, issues

๐Ÿ“ฅ Input

FieldTypeDefaultDescription
urlstringrequiredRoot URL (e.g., https://example.com)
siteNamestringhostnameHuman-readable site name
siteDescriptionstring"Website at..."One-sentence description
maxUrlsinteger50Max URLs in llms.txt (5โ€“500, keep under 200)
validateOnlybooleanfalseOnly check existing llms.txt + AI bot access. Don't generate.

๐Ÿ“ค Output

FieldTypeDescription
llmsTxtstringGenerated llms.txt content
llmsFullTxtstringSame as llmsTxt (downloadable)
robotsAnalysisobjectAI bot access: which bots allowed/blocked
existingLlmsTxtobjectAnalysis of existing llms.txt (if found)
aiDiscoveryobjectAI discovery endpoints check
sitemapEntriesarrayURLs with title, description, section
scoreintegerAI visibility score 0โ€“100
gradestringCritical / Foundation / Good / Excellent
recommendationsarrayPrioritized action items

๐Ÿ“„ JSON output example

{
"websiteUrl": "https://example.com",
"generatedAt": "2026-07-04T04:30:00.000Z",
"llmsTxt": "# Example Site\n\n> A website about...\n\n## Home\n\n- [Home](https://example.com): Main page\n\n## Blog\n\n- [Latest Posts](https://example.com/blog): Blog index\n\n## Links\n\n- [XML Sitemap](https://example.com/sitemap.xml)\n- [robots.txt](https://example.com/robots.txt)",
"sitemapUrlsFound": 47,
"sitemapUrlsIncluded": 47,
"robotsAnalysis": {
"aiBots": {
"OAI-SearchBot": true,
"PerplexityBot": true,
"ClaudeBot": true,
"Google-Extended": true,
"GPTBot": false,
"anthropic-ai": false
},
"blockingTraining": ["GPTBot", "anthropic-ai"],
"allowingCitation": ["OAI-SearchBot", "PerplexityBot", "ClaudeBot", "Google-Extended"]
},
"existingLlmsTxt": {
"exists": false,
"issues": ["llms.txt not found"]
},
"aiDiscovery": {
"hasAiTxt": false,
"hasSummaryJson": false,
"hasFaqJson": false
},
"score": 50,
"grade": "Foundation",
"recommendations": [
"Create llms.txt file โ€” this Actor can generate one for you",
"Add /.well-known/ai.txt for AI agent discovery",
"Add /ai/summary.json for AI summary access"
]
}

๐Ÿ’ฐ How much does it cost?

$2 per website. Includes sitemap fetch, page metadata extraction, llms.txt generation, robots.txt analysis, AI discovery check, and scoring. No subscription.

Cost examples

Use caseSitesCost
Single site llms.txt1$2
Agency (10 clients)10$20
SEO audit portfolio (50 sites)50$100

๐ŸŽฏ Use cases

  • AI SEO preparation โ€” Get your site ready for ChatGPT Search, Perplexity, and Google AI Overviews citations
  • Periodic audits โ€” Track your AI visibility score over time
  • Agency deliverable โ€” Generate llms.txt for clients as part of SEO package
  • Competitive analysis โ€” Run on competitor sites to see their AI visibility setup
  • Pre-launch check โ€” Verify AI bot access before deploying a new site

๐Ÿ”— Integrations

AI agents (MCP)

Works with Apify MCP server (https://mcp.apify.com). AI agents can run this Actor to check any website's AI visibility.

Chain with ADA Scanner

Run alongside our ADA Compliance Checker for a full AI+accessibility audit.

Automation

  • Zapier โ€” Generate llms.txt on new site deployment โ†’ push to GitHub
  • Make โ€” Schedule monthly AI visibility audits โ†’ email report
  • Custom โ€” Use Apify webhooks to trigger CI/CD llms.txt deployment

Yes. This Actor fetches public web pages (sitemap.xml, robots.txt) and generates a static text file. It does not modify, hack, or breach any website. llms.txt is an open standard (llmstxt.org).

๐Ÿ”ง Troubleshooting

ProblemCauseFix
0 URLs foundSitemap not found or emptyCheck for sitemap at /sitemap.xml or /sitemap_index.xml. Ensure robots.txt has Sitemap directive
Score is 0No AI bot access + no llms.txtAllow AI citation bots in robots.txt. Generate llms.txt
Slow generationMany URLs to extractReduce maxUrls (default 50). Each URL requires browser page load for metadata
Empty llms.txtSitemap found but no URLsCheck sitemap XML is valid. Try validateOnly: true first

โ“ FAQ

๐Ÿ› ๏ธ Tech stack

๐Ÿ’ฌ Feedback & issues

  • Apify Console โ€” Issues tab on this Actor's page
  • Review โ€” Rate on Store page