Pricing

from $1.00 / 1,000 results

Try for free

Go to Apify Store

Generic Html Scraper

Try for free

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

Pricing

from $1.00 / 1,000 results

Rating

5.0

(1)

Developer

DaddyAPI

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Features

Generic Fetching: Returns the full HTML body of any provided URL.
Proxy Flexibility: Full support for Apify Proxy (Datacenter/Residential) and custom proxies.
Anti-Blocking: Includes basic header generation to mimic real browsers.
Lightweight: Designed to run with minimal resources.

Usage

Simply provide a list of startUrls. The actor will visit each one and save the HTML to the dataset.

Input Parameters

Field	Type	Description
`startUrls`	Array	List of URLs to fetch.
`proxyConfiguration`	Object	Proxy settings. Residential proxies are recommended for hard-to-scrape sites.
`debugLog`	Boolean	Enable verbose logging for debugging.

Output

The actor stores results in the default dataset. Each item contains:

{
  "url": "https://example.com",
  "title": "Example Domain",
  "html": "<!doctype html><html>...</html>",
  "scrapedAt": "2023-10-27T10:00:00.000Z"
}

Recommended Resources

Memory: 256 MB is usually sufficient.
Timeout: Default is fine, adjust if fetching very slow sites.

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

Data Pilot

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

Making Data Meaningful

HTML Extractor

dataguru/html-extractor

Fetch raw HTML from any URL. Inputs: url, headers (textarea), timeout, mobile UA toggle, optional proxy. Outputs: Dataset + KV “OUTPUT” with {url, statusCode, contentType, length, html}. Perfect for Make.com, debugging, and pipelines.

Achraf Mannani

132

4.0

My Actor

david15999/my-actor

HTML scraper

David Emanuel Moreira

Download HTML from URLs

mtrunkat/url-list-download-html

This actor takes a list of URLs and downloads HTML of each page.

Marek Trunkát

9.1K

Download HTML from URLs

scrapeai/html-downloader

This actor takes a list of URLs and downloads HTML of each page.

ScrapeAI

5.0

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

scrapingxpert

278

5.0

HTML to JSON Smart Parser

parseforge/html-to-json-smart-parser

Convert HTML to structured JSON using AI! Uses OpenAI to extract and structure data from HTML into clean JSON format. Perfect for developers and data analysts who need to transform HTML into structured data without manual parsing.

ParseForge

5.0

Crawlee HTML Scraper

ellustar/my-actor-28

Crawlee HTML Scraper is a fast, lightweight web scraping actor built with JavaScript, Crawlee, and Cheerio. It efficiently extracts structured data from static HTML pages, supports custom selectors, pagination, and scalable crawling for reliable web data collection.