Web Page Scraper avatar

Web Page Scraper

Pricing

from $20.00 / 1,000 successful api calls

Go to Apify Store
Web Page Scraper

Web Page Scraper

Full page scrape via Firecrawl. Returns HTML, markdown, links, and title.

Pricing

from $20.00 / 1,000 successful api calls

Rating

0.0

(0)

Developer

Alex Jordan

Alex Jordan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Categories

Share

What does Web Page Scraper do?

Web Page Scraper fetches the full content of any web page and returns it as raw HTML, clean Markdown, extracted links, and page title — with anti-bot protection handled via Firecrawl. No proxy configuration needed, no browser setup required.

Built on the Apify platform, this Actor runs in seconds and integrates with Apify's scheduling, webhooks, and 1,500+ tools via Zapier and Make.

Why use Web Page Scraper?

  • AI content pipelines — Feed clean Markdown page content directly into LLMs for summarisation, classification, or Q&A
  • Content monitoring — Track changes to competitor pricing pages, landing pages, or documentation
  • Data extraction — Scrape structured content from pages that block traditional scrapers
  • Link discovery — Extract all outbound and internal links from any page for SEO or crawler seeding
  • Research automation — Bulk-scrape article pages and convert them to Markdown for analysis

How to use Web Page Scraper

  1. Click Try for free on this Actor's page
  2. Enter the URL of the page you want to scrape (e.g. https://example.com)
  3. Optionally specify formats to control what's returned (html, markdown, links)
  4. Click Start and wait a few seconds
  5. Download your results from the Output tab in JSON, CSV, or Excel

Input

FieldTypeRequiredDescription
urlstringURL of the page to scrape
formatsarrayOutput formats: html, markdown, links (default: all)
cachebooleanUse cached result if available (default true)

Example input:

{
"url": "https://example.com",
"formats": ["markdown", "links"]
}

Output

Example output:

{
"html": "<!DOCTYPE html><html>...",
"markdown": "# Example Domain\n\nThis domain is for use in illustrative examples...",
"links": [
"https://www.iana.org/domains/reserved"
],
"title": "Example Domain",
"meta": { "cache_hit": false, "execution_time_ms": 890 }
}

You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Data fields

FieldTypeDescription
htmlstringFull raw HTML of the page
markdownstringPage content converted to clean Markdown
linksarrayAll links found on the page
titlestringPage title from the <title> tag

Pricing / Cost estimation

$0.02 per successful API call on Apify.

  • 1,000 successful Apify runs = $20.00

FAQ & Support

Is this legal? This Actor scrapes publicly accessible web pages. Always respect the target site's robots.txt and Terms of Service.

Known limitations: Heavily JavaScript-dependent single-page apps (SPAs) may return incomplete content. Login-required pages are not supported.

Need help? Open an issue in the Issues tab or contact the support team for custom solutions.