robots.txt AI Policy Monitor | GPTBot ClaudeBot avatar

robots.txt AI Policy Monitor | GPTBot ClaudeBot

Pricing

from $11.00 / 1,000 results

Go to Apify Store
robots.txt AI Policy Monitor | GPTBot ClaudeBot

robots.txt AI Policy Monitor | GPTBot ClaudeBot

Detect AI crawler block policies in robots.txt and monitor policy shifts over time. Export compact or full JSON for SEO and governance workflows.

Pricing

from $11.00 / 1,000 results

Rating

0.0

(0)

Developer

太郎 山田

太郎 山田

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Detect AI crawler block policies in robots.txt and monitor policy changes over time.

Output First: What You Get

Each checked domain returns policy summary plus crawler-level status.

{
"domain": "nytimes.com",
"status": "ok",
"summary": {
"totalCrawlers": 16,
"blocked": 8,
"partialBlock": 2,
"allowed": 6
},
"aiPolicies": [
{
"crawler": "GPTBot",
"company": "OpenAI",
"blocked": true,
"partialBlock": false,
"allowed": false
}
],
"changes": [
{
"crawler": "PerplexityBot",
"type": "allowed_to_blocked",
"from": "allowed",
"to": "blocked"
}
]
}

Input Examples

Demo run (safe trial):

{
"domains": ["openai.com", "google.com"],
"demoMode": true
}

Production run:

{
"domains": ["nytimes.com", "github.com", "openai.com"],
"delivery": "webhook",
"webhookUrl": "https://example.com/apify/webhook",
"concurrency": 5,
"demoMode": false
}

Demo Mode (Conversion-Friendly, Non-Abusive)

When demoMode=true:

  • Domain count is capped at 1
  • Output is compact (limited policy fields)
  • Webhook delivery is disabled (dataset only)
  • Snapshot writes are disabled

Upgrade signal is included in meta.upgradeHint so users can unlock bulk monitoring.

AI Crawlers Covered

Includes GPTBot, ChatGPT-User, OAI-SearchBot, Google-Extended, ClaudeBot, Claude-Web, CCBot, Bytespider, PerplexityBot, and more.

Common Use Cases

  • SEO teams: AI crawler policy audits
  • Publishers: policy governance and change tracking
  • AI companies: crawler access monitoring
  • Researchers: AI opt-out trend measurement

Cost Notes

  • Uses public robots.txt only
  • No external API or proxy dependency required

Commercial Ops

Set up .env:

$cp -n .env.example .env

Cloud task/schedule setup:

$npm run apify:cloud:setup

Live checks:

npm run canary:check
npm run contract:test:live

OpenClaw cron examples:

  • openclaw-cron-commands.md