Web Page Scraper
Pricing
from $20.00 / 1,000 successful api calls
Web Page Scraper
Full page scrape via Firecrawl. Returns HTML, markdown, links, and title.
Pricing
from $20.00 / 1,000 successful api calls
Rating
0.0
(0)
Developer
Alex Jordan
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 days ago
Last modified
Categories
Share
What does Web Page Scraper do?
Web Page Scraper fetches the full content of any web page and returns it as raw HTML, clean Markdown, extracted links, and page title — with anti-bot protection handled via Firecrawl. No proxy configuration needed, no browser setup required.
Built on the Apify platform, this Actor runs in seconds and integrates with Apify's scheduling, webhooks, and 1,500+ tools via Zapier and Make.
Why use Web Page Scraper?
- AI content pipelines — Feed clean Markdown page content directly into LLMs for summarisation, classification, or Q&A
- Content monitoring — Track changes to competitor pricing pages, landing pages, or documentation
- Data extraction — Scrape structured content from pages that block traditional scrapers
- Link discovery — Extract all outbound and internal links from any page for SEO or crawler seeding
- Research automation — Bulk-scrape article pages and convert them to Markdown for analysis
How to use Web Page Scraper
- Click Try for free on this Actor's page
- Enter the URL of the page you want to scrape (e.g.
https://example.com) - Optionally specify formats to control what's returned (html, markdown, links)
- Click Start and wait a few seconds
- Download your results from the Output tab in JSON, CSV, or Excel
Input
| Field | Type | Required | Description |
|---|---|---|---|
url | string | ✅ | URL of the page to scrape |
formats | array | ❌ | Output formats: html, markdown, links (default: all) |
cache | boolean | ❌ | Use cached result if available (default true) |
Example input:
{"url": "https://example.com","formats": ["markdown", "links"]}
Output
Example output:
{"html": "<!DOCTYPE html><html>...","markdown": "# Example Domain\n\nThis domain is for use in illustrative examples...","links": ["https://www.iana.org/domains/reserved"],"title": "Example Domain","meta": { "cache_hit": false, "execution_time_ms": 890 }}
You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.
Data fields
| Field | Type | Description |
|---|---|---|
html | string | Full raw HTML of the page |
markdown | string | Page content converted to clean Markdown |
links | array | All links found on the page |
title | string | Page title from the <title> tag |
Pricing / Cost estimation
$0.02 per successful API call on Apify.
- 1,000 successful Apify runs = $20.00
FAQ & Support
Is this legal? This Actor scrapes publicly accessible web pages. Always respect the target site's robots.txt and Terms of Service.
Known limitations: Heavily JavaScript-dependent single-page apps (SPAs) may return incomplete content. Login-required pages are not supported.
Need help? Open an issue in the Issues tab or contact the support team for custom solutions.