n8n web scraper node
n8n web scraper node strips navigation, ads, and cookie banners, delivering clean content optimized for AI. Integrates with LangChain, LlamaIndex, and OpenAI.
Trusted by industry leaders all over the world
What data you can get with n8n web scraper node
Extract website content in LLM-ready formats with structure preserved. Download PDFs, Word docs, and Excel files found on pages. Integrates with popular AI frameworks.
Output
{ "url": "https://docs.apify.com/academy/web-scraping-for-beginners", "html": null, "text": "Skip to main content\nOn this page\nWeb scraping for beginners\nLearn how to develop web scrapers with this comprehensive and practical course. Go from beginner to expert, all in one place.\nWelcome to Web scraping for beginners...", "crawl": { "depth": 0, "loadedUrl": "https://docs.apify.com/academy/web-scraping-for-beginners", "loadedTime": "2023-04-05T16:26:51.030Z", "referrerUrl": "https://docs.apify.com/academy" }, "markdown": " Web scraping for beginners | Apify Documentation \n\n[Skip to main content](#docusaurus_skipToContent_fallback)\n\nOn this page\n\n# Web scraping for beginners...", "metadata": { "title": "Web scraping for beginners | Apify Documentation", "author": null, "keywords": null, "description": "Learn how to develop web scrapers with this comprehensive and practical course. Go from beginner to expert, all in one place.", "canonicalUrl": "https://docs.apify.com/academy/web-scraping-for-beginners", "languageCode": "en" }, "screenshotUrl": null}How to set up n8n web scraper node with Apify
Point it at a URL and configure output format and crawl scope. The Actor follows internal links, rendering JavaScript where needed. Choose Markdown, plain text, or HTML.
Sign up for Apify account01
Creating an account is quick and free. No credit card required. Your account gives you access to more than 20,000+ scrapers and APIs.
Get your Apify API token02
Go to Settings in Apify Console and navigate to the API & Integrations tab. There, create a new token and save it for later.
Test run Web Scraper Node03
Open Web Scraper Node in Apify Console and configure your input parameters. Click Start to run the Actor and preview the data structure you receive in your n8n workflow.
Integrate Web Scraper Node via n8n04
Add the Apify node to your n8n workflow. Select Run Actor as the operation, choose your Actor, and pass your input configuration as JSON. Enable Wait for finish to retrieve results directly in subsequent nodes.
Never get blocked
Every plan (free included) comes with Apify Proxy, which is great for avoiding blocking and giving you access to geo-specific content.
Customers love us
We truly care about the satisfaction of our users and thanks to that we're one of the best-rated data extraction platforms on both G2 and Capterra.
Monitor your runs
With our latest monitoring features, you always have immediate access to valuable insights on the status of your web scraping tasks.
Add an HTTP Request node to your n8n workflow and point it to the Apify API. Use your API token for authentication and specify the web scraper node Actor ID you want to run. The Actor executes and returns data directly to your workflow. You can also use n8n's dedicated Apify node if available in your version.
Yes. Apify offers a free tier with prepaid platform usage. This is enough to test Actors with your n8n workflows and run small-scale extractions. No credit card required to start.
No. You can configure Apify Actors through their web interface and connect them to n8n using the HTTP Request node - no coding required. For advanced use cases, you can customize Actor inputs or use the Apify SDK with JavaScript or Python.
Building and maintaining scrapers takes significant time. Websites change their structure, add bot detection, and block requests. Apify Actors handle all of this automatically - proxy rotation, anti-bot bypassing, error handling, and data parsing. You get reliable data without the maintenance burden.