Generic Html Scraper
Pricing
from $1.00 / 1,000 results
Go to Apify Store
Generic Html Scraper
A lightweight, robust, and simple actor to fetch the raw HTML content of any URL
Pricing
from $1.00 / 1,000 results
Rating
5.0
(1)
Developer

DaddyAPI
Maintained by Community
Actor stats
1
Bookmarked
3
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
A lightweight, robust, and simple actor to fetch the raw HTML content of any URL using Cheerio.
Features
- Generic Fetching: Returns the full HTML body of any provided URL.
- Proxy Flexibility: Full support for Apify Proxy (Datacenter/Residential) and custom proxies.
- Anti-Blocking: Includes basic header generation to mimic real browsers.
- Lightweight: Designed to run with minimal resources.
Usage
Simply provide a list of startUrls. The actor will visit each one and save the HTML to the dataset.
Input Parameters
| Field | Type | Description |
|---|---|---|
startUrls | Array | List of URLs to fetch. |
proxyConfiguration | Object | Proxy settings. Residential proxies are recommended for hard-to-scrape sites. |
debugLog | Boolean | Enable verbose logging for debugging. |
Output
The actor stores results in the default dataset. Each item contains:
{"url": "https://example.com","title": "Example Domain","html": "<!doctype html><html>...</html>","scrapedAt": "2023-10-27T10:00:00.000Z"}
Recommended Resources
- Memory:
256 MBis usually sufficient. - Timeout: Default is fine, adjust if fetching very slow sites.


