Pricing

from $1.00 / 1,000 results

Headless Browser HTML Scraper

Render any URL in a real headless browser and return the fully-rendered HTML, the page text, or a selected area by CSS selector. Scroll for lazy content, wait for elements, and capture screenshots. A browserless-style HTML API on Apify.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Dev Patel

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

What it does

🌐 Renders any URL with a real browser (JavaScript executed)
🧩 Selected area — pass a CSS selector and get every matching element's HTML, text, attributes, and position
📜 Scroll to bottom — trigger infinite-scroll / lazy-loaded content with real wheel events
⏳ Wait for a selector, a load event, or a fixed delay
🖼️ Optional full-page screenshot
🚫 Block images/media/fonts/CSS to speed up and cut bandwidth
🔌 Use it synchronously as an API (run-sync-get-dataset-items)

Input

Field	Type	Description
`urls`	array	Required. URLs to render and scrape.
`selector`	string	Optional CSS selector for the "selected area". Returns each match's HTML/text/attributes/position. Empty = full page only.
`scrollToBottom`	boolean	Scroll down to load lazy content. Default `false`.
`maxScrolls`	integer	Max scroll rounds when scrolling. Default `15`.
`waitForSelector`	string	Wait until this selector appears (≤30s).
`waitUntil`	enum	`domcontentloaded` (default) · `load` · `networkidle`.
`waitMs`	integer	Extra fixed wait after load (ms).
`htmlMode`	enum	`full` (entire DOM, default) or `visible` — just the above-the-fold content shown on open (no scroll), scripts/styles stripped for a short, clean HTML.
`blockResources`	array	Resource types to block. Default `["media","font"]`.
`returnFullHtml`	boolean	Include the rendered HTML (full or visible per `htmlMode`). Default `true`.
`returnText`	boolean	Include page visible text. Default `true`.
`includeScreenshot`	boolean	Capture a full-page screenshot and return its URL. Default `false`.
`proxyConfiguration`	object	Apify Proxy (datacenter) by default; use Residential for bot-protected sites.

Example: full HTML of a JS-rendered page

{ "urls": [{ "url": "https://www.example.com" }], "waitUntil": "networkidle" }

Example: extract a selected area, after scrolling

{
    "urls": [{ "url": "https://news.ycombinator.com" }],
    "selector": "span.titleline a",
    "scrollToBottom": true
}

Output

One record per URL:

{
    "url": "https://www.example.com",
    "loadedUrl": "https://www.example.com/",
    "statusCode": 200,
    "title": "Example Domain",
    "html": "<!DOCTYPE html><html>...</html>",
    "text": "Example Domain\nThis domain is for use in...",
    "selectedCount": 30,
    "selectedElements": [
        {
            "text": "Some headline",
            "html": "<a href=\"...\">Some headline</a>",
            "attributes": [{ "name": "href", "value": "https://..." }],
            "width": 320, "height": 18, "top": 140, "left": 24
        }
    ],
    "screenshotUrl": "https://api.apify.com/v2/key-value-stores/.../records/screenshot-1",
    "scrapedAt": "2026-06-13T08:00:00.000Z"
}

Use as an API

curl -X POST "https://api.apify.com/v2/acts/USERNAME~browserless-html-scraper/run-sync-get-dataset-items?token=TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls":[{"url":"https://www.example.com"}],"selector":"h1"}'

Notes

For bot-protected sites, switch proxyConfiguration to Residential.
Blocking image/stylesheet speeds things up but can break layout-dependent lazy scrolling on some sites — keep them enabled (don't block) when using scrollToBottom on such pages.

Html Renderer

jakubbalada/html-renderer

Generate image for your HTML using a headless browser

Jakub Balada

Screenshot & HTML file from Url

leadsbrary/screenshot-html-file-from-url

From 1$/1000 results. Capture website screenshots &/or full-page HTML in one run, from $1/1000 URLs. PNG, JPEG & PDF — full-page, custom viewport, lazy-load scroll, cookie-banner hiding, batch mode. HTML files open correctly in any browser. REST API ready. No watermark.

Alexandre Manguis

5.0

Screenshots from HTML

vojtam/screenshots-from-html

Actor creates screenshots from a saved HTML structure.

Vojtěch Mašláň

Website Screenshot Generator

perryay/website-screenshot-generator

Capture screenshots of any website using Playwright headless browser. Full-page or viewport capture. Supports batch mode (up to 50 URLs), device presets (desktop/tablet/mobile), CSS selector targeting, PNG/JPEG/WebP output.

Perry AY

HTML Scraper

making-data-meaningful/html-scraper

Access and extract full HTML source code from any webpage instantly. The HTML Scraper API lets you retrieve clean, accurate page HTML for SEO analysis, web scraping, and content monitoring - all without being blocked.

Scrape Hub

HTML to PDF Converter

rainminer/html-to-pdf-converter

Convert raw HTML or web page URLs into downloadable PDF files using a real browser. Render CSS, images, tables, invoices, reports, and dynamic layouts, then save the generated PDF to the Apify Key-Value Store with dataset metadata.

rainminer

Download HTML from URLs

datapilot/download-html-from-urls

This script with an Apify Actor to fetch the complete HTML source of any website. The user provides a URL, the page is loaded with JavaScript execution, the full HTML is printed in the terminal, saved to an HTML file,

Data Pilot

Generic Html Scraper

daddyapi/generic-html-scraper

A lightweight, robust, and simple actor to fetch the raw HTML content of any URL

DaddyAPI

HTML Table Extractor

benthepythondev/html-table-extractor

Extract structured rows from HTML tables on any web page.

Ben

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving