Sitemap Health Validator avatar
Sitemap Health Validator

Pricing

Pay per usage

Go to Apify Store
Sitemap Health Validator

Sitemap Health Validator

Validates sitemap.xml files and checks health of listed URLs

Pricing

Pay per usage

Rating

5.0

(2)

Developer

riya rawat

riya rawat

Maintained by Community

Actor stats

0

Bookmarked

33

Total users

16

Monthly active users

7 hours ago

Last modified

Categories

Share

JavaScript Website Scraper (Crawlee + Cheerio)

A fast, lightweight Apify Actor for scraping static and semi-dynamic websites using Crawlee’s CheerioCrawler. The Actor extracts page titles and URLs from provided start pages and stores the results in an Apify Dataset.

Designed for performance, low resource usage, and easy extensibility. Fully compliant with Apify Actor Store rules and suitable for the Apify $1 Million Actor Challenge.


Key Features

  • Fast HTML parsing using Cheerio (no browser required)
  • Crawlee-powered request handling and concurrency
  • Low memory usage (works on Apify free plan)
  • Proxy support for reduced blocking
  • Structured and consistent dataset output
  • Easy to customize and extend

Input

startUrls

Type: Array
Description: List of URLs where the crawler starts.

maxPagesPerCrawl

Type: Number
Description: Maximum number of pages to scrape.

Example Input

{
"startUrls": [
{ "url": "https://example.com" }
],
"maxPagesPerCrawl": 10
}