Pricing

from $10.00 / 1,000 results

Web Page to Single-Page PDF & HTML (Automation-Ready)

Convert webpages to single-page PDFs and extract raw HTML via API. Captures full scroll height (no A4 splits). Built for automation with n8n, Make, and Zapier. Ideal for archiving, AI workflows, compliance, and bulk processing.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Gavin Campbell

Actor stats

Bookmarked

Total users

Monthly active users

7 days ago

Last modified

Web Page to Single-Page PDF Converter (Automation Ready)

Capture full-length webpages as single-page PDFs and extract raw HTML source code via API.

Designed for seamless integration with automation platforms like n8n, Make.com, and Zapier, this Apify Actor allows you to programmatically archive web content, generate visual reports, and feed clean data into your AI workflows.

Unlike standard converters that cut pages into A4 sheets, this tool captures the entire scrollable area of a webpage into one continuous PDF file, ensuring no data is cut off at page breaks.

🚀 Key Features

Single-Page "Long" PDFs: Captures the full height of the webpage in a single continuous document. Perfect for newsletters, landing pages, and social media feeds.
HTML Source Extraction: Option to save the exact view-source: HTML code alongside the visual PDF.
Bulk Processing: Handle thousands of URLs in a single run.
Anti-Blocking: Built-in support for Apify Proxy and stealth mode to bypass bot detection.
Smart Waiting: Configurable waitUntil strategies (e.g., networkidle0) ensure dynamic JavaScript content loads completely before capture.

💡 Use Cases

Compliance & Archiving: Automatically screenshot and save the HTML source of your legal pages, T&Cs, or partner sites for compliance auditing.
Marketing Swipe Files: Build a visual database of competitor landing pages, emails, and ad creatives.
AI Knowledge Base: Feed the raw HTML output into LLMs (like ChatGPT or Claude) via n8n to analyze page structure or content without parsing complex DOMs yourself.
Invoicing & Receipts: Convert web-based invoice views into portable PDF files for accounting systems.
Design QA: Automate visual regression testing by capturing full-page renders of your staging environment.

⚙️ Input Configuration

Field	Type	Default	Description
`startUrls`	Array	`[]`	A list of URLs you want to convert. Supports direct URLs or object format.
`saveHtml`	Boolean	`true`	If enabled, saves the raw HTML source code (`.html`) to the Key-Value store.
`proxyConfiguration`	Object	`Apify Proxy`	Recommended to keep enabled to avoid IP bans.
`waitUntil`	String	`networkidle0`	When to take the snapshot. Use `networkidle0` for strict loading or `domcontentloaded` for speed.

🔌 Automation Integrations

This Actor is built to be a backend microservice. Here is how to connect it to your favorite workflow automation tools.

1. n8n Integration

Goal: Trigger the actor from a workflow and download the resulting PDF.

Add the "Apify" Node: In your n8n workflow, add the Apify node.
Select Action: Choose Run Actor.
Actor ID: Search for web-to-pdf-converter (or use the Actor ID from the Apify console).

Input: switch to JSON mode and map your URL:

{
  "startUrls": [{ "url": "{{$json.your_url_field}}" }],
  "saveHtml": true
}

Wait for Finish: Ensure the "Synchronous" option is checked (or use a separate "Wait" node and "Get Dataset Items" node for long runs).
Retrieve Files: The output will contain a pdfUrl. Use an HTTP Request node to GET that URL and save the binary data.

2. Make.com (Integromat) Integration

Goal: Save a webpage to Google Drive every time a new row is added to Google Sheets.

Trigger: Google Sheets (Watch Rows).
Action: Add the Apify module -> Run Actor.

Settings:

Actor: Select this actor.

Body:

{
  "startUrls": [{ "url": "{{1.url}}" }],
  "saveHtml": true
}

Action: Add Apify module -> Get Dataset Items.
- Dataset ID: Map the defaultDatasetId from the previous step.
Action: Add HTTP module -> Get a file.
- URL: Map the pdfUrl from the dataset items.
Action: Google Drive -> Upload a File.

3. Zapier Integration

Goal: Email a PDF version of a webpage when a specific event occurs.

Trigger: Any Zapier trigger (e.g., "New Trello Card").
Action: Search for Apify.
Event: Select Run Actor.

Configure:

Actor: Paste the Actor ID.

Input Body:

{
    "startUrls": [{ "url": "https://example.com" }]
}

Action: Select Apify -> Get Dataset Items (to get the PDF link).
Action: Gmail -> Send Email. Use the pdfUrl in the attachment field or body.

📦 Output Format

The actor stores results in two locations:

Key-Value Store: The physical files.
- Page_Title_hash.pdf (The visual render)
- Page_Title_hash_source.html (The source code)
Dataset: The JSON metadata used for linking.

Sample Dataset JSON:

{
  "url": "https://apify.com",
  "title": "Apify: The Web Scraping and Automation Platform",
  "pdfUrl": "https://api.apify.com/v2/key-value-stores/mYStoReId/records/Apify_hash.pdf",
  "htmlUrl": "https://api.apify.com/v2/key-value-stores/mYStoReId/records/Apify_hash_source.html",
  "timestamp": "2023-10-27T14:30:00.000Z"
}

🛠 Troubleshooting

PDF is blank/white: Try changing waitUntil to networkidle0. This forces the crawler to wait until all network activity (images, scripts) has settled.
Cookie Consent Popups: The actor attempts to hide scrollbars, but popups may obscure content. For complex sites, you may need an actor with custom "click" logic or use a pre-navigation hook (advanced usage).
Access Denied: Ensure you are using the proxyConfiguration set to useApifyProxy: true to avoid 403 errors.

Built with ❤️ using the Apify SDK and Puppeteer.

Html To Pdf Api

simplifysme/html-to-pdf-api

📄 Convert any HTML page or URL to high-quality PDF documents via API. Perfect for reports, invoices, documentation, web page archiving, and automated document generation.

SimplifySME Toolbox

HTML To PDF for N8N

exciting_perfume/HTML-to-PDF-Apify-Actor

Generate accurate PDFs from HTML or URLs using Chromium. Supports CSS, fonts, and backgrounds. Automation-ready and perfect for n8n workflows, reports, invoices, and contracts.

Gavin Campbell

n8n Workflow Automation Templates Scraper

scraped/n8n-workflow-automation-templates-scraper

A tool that automatically scrapes and collects n8n workflow automation templates from the n8n for easy access and use.

scraped

276

n8n-mcp

nourishing_courier/web-data-for-ai

n8n-mcp

Ani Björkström

HTML to PDF converter

apify/html-to-pdf-converter

Convert HTML string to A4 PDF.

Apify

155

4.3

Reddit Scraper - Markdown for AI & n8n

clearpath/reddit-to-llm-api

Extract Reddit posts and comments as LLM-ready Markdown. No API key needed. Direct n8n/Make integration—connect output to AI nodes instantly. 20x faster than browser scrapers. Perfect for lead gen, product validation, and market research workflows.

ClearPath

n8n Documentation MCP Server

agentify/n8n-mcp-server

n8n MCP Server provides AI assistants with structured access to n8n node documentation, properties, and validation tools for building and verifying workflows efficiently.

agentify

N8n Template Scraper

scrapio/n8n-template-scraper

Scrape n8n workflow templates automatically. This actor gathers template metadata, node configurations, integrations, and descriptions. Built for automation engineers, developers, and teams exploring scalable n8n automation ideas.

Scrapio

Reddit Scraper Pro

webdatalabs/reddit-scraper-pro

High-performance Reddit scraper (99%+ success rate) for automation workflows. Monitor subreddits, track keywords with sentiment analysis, scrape comments, and integrate with n8n/Zapier for powerful automation.

WebDataLabs

5.0

n8n Workflow Template Scraper

muhammetakkurtt/n8n-scraper

Automate n8n.io workflow template collection with this Apify actor. Scrape by category (AI, Marketing, DevOps), sort (relevancy, popularity), & get detailed structured data. Fetch importable JSONs for direct n8n use. Ideal for developers, automation experts & businesses.