Pricing

from $1.00 / 1,000 results

Robots.txt Checker & Parser - Crawl Rules API

Check, parse, and validate robots.txt files in bulk. Extract crawl rules, sitemaps, crawl-delay, blocked paths, and per-user-agent allow/disallow results for SEO audits and crawler compliance.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Ava Torres

Actor stats

Bookmarked

Total users

Monthly active users

17 days ago

Last modified

robots.txt Validator & Analyzer

Fetch, parse, and analyze robots.txt files for any domain in bulk. Built for SEO professionals, developers, and crawler operators who need to audit site access rules at scale.

What It Does

For each domain you supply, the actor:

Fetches /robots.txt from the domain root over HTTPS (falls back gracefully on 404 or network errors)
Parses all User-agent, Allow, Disallow, Crawl-delay, and Sitemap directives
Reports structured rules grouped by user-agent
Optionally checks whether specific paths are allowed or blocked for your chosen user-agent

Input

Field	Type	Required	Description
`urls`	string[]	Yes	Domains or full URLs (e.g. `google.com`, `https://openai.com/blog`)
`userAgent`	string	No	User-agent to evaluate rules for. Defaults to `*`
`checkPaths`	string[]	No	Specific paths to test for allow/disallow (e.g. `/admin`, `/api/`)
`maxResults`	integer	No	Cap on domains to process. Defaults to 100

Output

One record per domain:

Field	Description
`domain`	Domain name
`robotsTxtUrl`	Full URL of the fetched robots.txt
`robotsTxtFound`	`true` if HTTP 200 was returned
`robotsTxtContent`	Raw robots.txt text
`userAgentRules`	Parsed rule blocks, each with `userAgent` and `rules` array of `{directive, path}`
`sitemapUrls`	All Sitemap URLs declared in the file
`crawlDelay`	Crawl-delay in seconds for the requested user-agent (null if not set)
`analyzedPaths`	Per-path results: `{path, allowed}` for each path in `checkPaths`
`fetchError`	Error message if the file could not be fetched

Example Use Cases

SEO audit: Check which bots can access which parts of your site
Crawler compliance: Verify your spider respects Disallow rules before running at scale
Competitive research: Understand what paths competitors block from indexing
Security review: Identify paths hidden from crawlers (admin panels, staging URLs)
Sitemap discovery: Extract all declared sitemap URLs without manual inspection

Pricing

$0.10 per 1,000 domains checked. Typical run of 100 domains costs less than $0.02.

Robots.txt Validator - Check Rules, Sitemaps & Crawl Directives

scrappy_garden/robots-txt-validator

Validate robots.txt for one or more websites: fetches /robots.txt per host, parses directive groups (User-agent/Allow/Disallow/Crawl-delay/Sitemap), reports common errors and warnings, and can test URLs against the chosen User-Agent.

Bikram Adhikari

robots.txt Parser & AI Crawler Block Checker

taroyamada/robotstxt-ai-checker

robots.txt parser that audits AI crawler block rules (GPTBot, ClaudeBot, anthropic-ai, PerplexityBot) across thousands of websites in one run. Returns per-bot allow/disallow disposition and crawl-delay.

naoki anzai

Robots.txt Generator

automation-lab/robots-txt-generator

Generate valid robots.txt files from structured rules. Apply presets (block AI bots, SEO-friendly), add custom per-bot rules, sitemaps, and crawl-delay. Zero-proxy, instant output.

Stas Persiianenko

robots.txt Parser & URL Tester

scrapeworks/robots-txt

Fetch and parse robots.txt for any site: user-agent rules, crawl-delay, and declared sitemaps. Optionally test whether specific URLs are allowed for a given user-agent, using correct longest-match rules.

Nicolas van Arkens

Robots.txt Analyzer

mahogany_songbird/robots-txt-analyzer

Read robots.txt disallow rules and sitemap declarations.

Britton Furness

Robots Txt Analyzer

zerobreak/robots-txt-analyzer

Robots txt analyzer that fetches and parses crawl rules from any website in bulk, so SEO teams and developers can audit blocked paths, user agents, and sitemap locations across hundreds of domains without manual work.

ZeroBreak

Robots.txt & Sitemap Analyzer

automation-lab/robots-sitemap-analyzer

This actor fetches and parses robots.txt and sitemap.xml files for any list of websites. It extracts crawl directives (user-agent rules, allowed/disallowed paths, crawl-delay), discovers sitemap URLs, and counts the number of pages listed in each sitemap. Use it for SEO audits, competitive...

Stas Persiianenko