Generic Html Scraper avatar
Generic Html Scraper

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Generic Html Scraper

Generic Html Scraper

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

Pricing

from $1.00 / 1,000 results

Rating

5.0

(1)

Developer

DaddyAPI

DaddyAPI

Maintained by Community

Actor stats

1

Bookmarked

3

Total users

1

Monthly active users

2 days ago

Last modified

Share

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL using Cheerio.

Features

  • Generic Fetching: Returns the full HTML body of any provided URL.
  • Proxy Flexibility: Full support for Apify Proxy (Datacenter/Residential) and custom proxies.
  • Anti-Blocking: Includes basic header generation to mimic real browsers.
  • Lightweight: Designed to run with minimal resources.

Usage

Simply provide a list of startUrls. The actor will visit each one and save the HTML to the dataset.

Input Parameters

FieldTypeDescription
startUrlsArrayList of URLs to fetch.
proxyConfigurationObjectProxy settings. Residential proxies are recommended for hard-to-scrape sites.
debugLogBooleanEnable verbose logging for debugging.

Output

The actor stores results in the default dataset. Each item contains:

{
"url": "https://example.com",
"title": "Example Domain",
"html": "<!doctype html><html>...</html>",
"scrapedAt": "2023-10-27T10:00:00.000Z"
}
  • Memory: 256 MB is usually sufficient.
  • Timeout: Default is fine, adjust if fetching very slow sites.