n8n web scraper node

n8n web scraper node strips navigation, ads, and cookie banners, delivering clean content optimized for AI. Integrates with LangChain, LlamaIndex, and OpenAI.

Trusted by industry leaders all over the world

What data you can get with n8n web scraper node

Extract website content in LLM-ready formats with structure preserved. Download PDFs, Word docs, and Excel files found on pages. Integrates with popular AI frameworks.

Output

{
"url": "https://docs.apify.com/academy/web-scraping-for-beginners",
"html": null,
"text": "Skip to main content\nOn this page\nWeb scraping for beginners\nLearn how to develop web scrapers with this comprehensive and practical course. Go from beginner to expert, all in one place.\nWelcome to Web scraping for beginners...",
"crawl": {
"depth": 0,
"loadedUrl": "https://docs.apify.com/academy/web-scraping-for-beginners",
"loadedTime": "2023-04-05T16:26:51.030Z",
"referrerUrl": "https://docs.apify.com/academy"
},
"markdown": " Web scraping for beginners | Apify Documentation \n\n[Skip to main content](#docusaurus_skipToContent_fallback)\n\nOn this page\n\n# Web scraping for beginners...",
"metadata": {
"title": "Web scraping for beginners | Apify Documentation",
"author": null,
"keywords": null,
"description": "Learn how to develop web scrapers with this comprehensive and practical course. Go from beginner to expert, all in one place.",
"canonicalUrl": "https://docs.apify.com/academy/web-scraping-for-beginners",
"languageCode": "en"
},
"screenshotUrl": null
}

How to set up n8n web scraper node with Apify

Point it at a URL and configure output format and crawl scope. The Actor follows internal links, rendering JavaScript where needed. Choose Markdown, plain text, or HTML.

Sign up for Apify account01

Creating an account is quick and free. No credit card required. Your account gives you access to more than 20,000+ scrapers and APIs.

Start for free
Get your Apify API token02

Go to Settings in Apify Console and navigate to the API & Integrations tab. There, create a new token and save it for later.

Test run Web Scraper Node03

Open Web Scraper Node in Apify Console and configure your input parameters. Click Start to run the Actor and preview the data structure you receive in your n8n workflow.

Integrate Web Scraper Node via n8n04

Add the Apify node to your n8n workflow. Select Run Actor as the operation, choose your Actor, and pass your input configuration as JSON. Enable Wait for finish to retrieve results directly in subsequent nodes.

Why use Apify?

Never get blocked

Never get blocked

Every plan (free included) comes with Apify Proxy, which is great for avoiding blocking and giving you access to geo-specific content.

Customers love us

Customers love us

We truly care about the satisfaction of our users and thanks to that we're one of the best-rated data extraction platforms on both G2 and Capterra.

Monitor your runs

Monitor your runs

With our latest monitoring features, you always have immediate access to valuable insights on the status of your web scraping tasks.

Frequently Asked Questions

Add an HTTP Request node to your n8n workflow and point it to the Apify API. Use your API token for authentication and specify the web scraper node Actor ID you want to run. The Actor executes and returns data directly to your workflow. You can also use n8n's dedicated Apify node if available in your version.

Yes. Apify offers a free tier with prepaid platform usage. This is enough to test Actors with your n8n workflows and run small-scale extractions. No credit card required to start.

No. You can configure Apify Actors through their web interface and connect them to n8n using the HTTP Request node - no coding required. For advanced use cases, you can customize Actor inputs or use the Apify SDK with JavaScript or Python.

Building and maintaining scrapers takes significant time. Websites change their structure, add bot detection, and block requests. Apify Actors handle all of this automatically - proxy rotation, anti-bot bypassing, error handling, and data parsing. You get reliable data without the maintenance burden.

Try n8n Web Scraper Node now