Make.com data scraper
Make.com data scraper extracts clean text and Markdown from any site, removing navigation, ads, and cookie warnings. Connect it to Make.com for AI applications.
Trusted by industry leaders all over the world
What data you can get with Make.com data scraper
Page text in plain text or Markdown, metadata (title, description, author, language), canonical URLs, and downloadable files (PDF, DOC, XLSX). Cleaned of navigation and ads.
Output
{ "url": "https://docs.apify.com/academy/web-scraping-for-beginners", "html": null, "text": "Skip to main content\nOn this page\nWeb scraping for beginners\nLearn how to develop web scrapers with this comprehensive and practical course. Go from beginner to expert, all in one place.\nWelcome to Web scraping for beginners...", "crawl": { "depth": 0, "loadedUrl": "https://docs.apify.com/academy/web-scraping-for-beginners", "loadedTime": "2023-04-05T16:26:51.030Z", "referrerUrl": "https://docs.apify.com/academy" }, "markdown": " Web scraping for beginners | Apify Documentation \n\n[Skip to main content](#docusaurus_skipToContent_fallback)\n\nOn this page\n\n# Web scraping for beginners...", "metadata": { "title": "Web scraping for beginners | Apify Documentation", "author": null, "keywords": null, "description": "Learn how to develop web scrapers with this comprehensive and practical course. Go from beginner to expert, all in one place.", "canonicalUrl": "https://docs.apify.com/academy/web-scraping-for-beginners", "languageCode": "en" }, "screenshotUrl": null}How to set up Make.com data scraper with Apify
Configure the Actor with your target URL via Make.com's Apify integration. It discovers and processes linked pages, returning clean text or Markdown for your LLM or vector database.
Sign up for Apify account01
Creating an account is quick and free. No credit card required. Your account gives you access to more than 20,000+ scrapers and APIs.
Get your Apify API token02
Go to Settings in Apify Console and navigate to the API & Integrations tab. There, create a new token and save it for later.
Test run Make.com data scraper03
Open data scraper in Apify Console and configure your input parameters. Click Start to run the Actor and preview the data structure you receive in your Make scenario.
Integrate data scraper via Make.com04
Search for Apify in the Make module library and add the "Run an Actor" module to your scenario. Connect using OAuth or paste your API token. Select your Actor, configure the input, and use "Get Dataset Items" to retrieve results for downstream modules.
Never get blocked
Every plan (free included) comes with Apify Proxy, which is great for avoiding blocking and giving you access to geo-specific content.
Customers love us
We truly care about the satisfaction of our users and thanks to that we're one of the best-rated data extraction platforms on both G2 and Capterra.
Monitor your runs
With our latest monitoring features, you always have immediate access to valuable insights on the status of your web scraping tasks.
Use Make.com's Apify integration to trigger crawls and receive results via webhook. Route the extracted content to Google Sheets, databases, or AI services.
The crawler handles both static HTML sites and JavaScript-rendered pages using headless browsers. It works on documentation sites, blogs, knowledge bases, and most public websites.
Yes. The output integrates directly with LangChain, LlamaIndex, Pinecone, and OpenAI assistants. Extract text to feed RAG pipelines or train custom chatbots.
Select the headless browser crawler type to render dynamic content. The adaptive mode automatically switches between raw HTTP and browser based on page complexity.
Yes. Enable the Save files option to download PDF, DOC, DOCX, XLS, and XLSX files linked from crawled pages.