Website Content Crawler
Pricing
from $0.01 / 1,000 results
Website Content Crawler
Crawl websites for SEO audits. Extracts HTML, title, meta tags, headings, links, & text content from pages. Automatic sitemap detection & parsing Extracts metadata (title, description, OG tags) Heading structure (H1, H2, H3) Internal & external link analysis Image extraction w/alt text Word count
Pricing
from $0.01 / 1,000 results
Rating
0.0
(0)
Developer

John Rippy
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
2 days ago
Last modified
Categories
Share
Crawl websites for SEO audits. Extracts HTML, title, meta tags, headings, links, and text content from each page.
Features
- Automatic sitemap detection and parsing
- Extracts metadata (title, description, OG tags)
- Heading structure (H1, H2, H3)
- Internal and external link analysis
- Image extraction with alt text
- Word count and load time metrics
Input
{"startUrls": [{ "url": "https://example.com" }],"maxCrawlPages": 25,"maxCrawlDepth": 2,"crawlSitemap": true}
Output
Each page returns:
{"url": "https://example.com/page","title": "Page Title","html": "<html>...</html>","text": "Page text content...","statusCode": 200,"loadTimeMs": 1234,"metadata": {"description": "Meta description","ogTitle": "Open Graph Title","canonical": "https://example.com/page"},"headings": {"h1": ["Main Heading"],"h2": ["Subheading 1", "Subheading 2"],"h3": []},"links": {"internal": ["https://example.com/other"],"external": ["https://external.com"]},"images": [{ "src": "https://example.com/image.jpg", "alt": "Alt text" }],"wordCount": 1500}
Use Cases
- SEO audits
- Content analysis
- Site structure mapping
- Broken link detection
- Meta tag analysis
Pricing
This actor uses pay-per-event pricing:
| Event | Description | Price |
|---|---|---|
page_crawled | Per page crawled | $0.01 |
Example costs:
- Crawl 25 pages: 25 × $0.01 = $0.25
- Crawl 100 pages: 100 × $0.01 = $1.00
Author
Built by John Rippy | johnrippy.link
Keywords
website crawler, web scraper, site crawler, seo crawler, content extraction, website data extraction