Cloudflare Web Scraper avatar

Cloudflare Web Scraper

Try for free

1 day trial then $20.00/month - No credit card required now

Go to Store
Cloudflare Web Scraper

Cloudflare Web Scraper

ecomscrape/cloudflare-web-scraper
Try for free

1 day trial then $20.00/month - No credit card required now

Cloudflare Web Scraper extracts data from Cloudflare-protected websites. You can customize parameters such as proxies, timeouts, and JavaScript execution, making it ideal for reports, spreadsheets, and applications.

What does Cloudflare web Scraper do?

Cloudflare web Scraper allows you to extract data from the web page protected by Cloudflare. It gives you detailed web page data in structured formats like JSON, which you can use in your own reports, spreadsheets, and applications. Cloudflare web Scraper allows you to collect web page protected by Cloudflare, including:

  • URL
  • HTML
  • Result from JS Script

Input & Output

To start collecting Idealo.de product data, simply fill out the input form. Cloudflare web Scraper recognizes the following input parameters:

  • URLs - Links to product information pages, seller product listing pages, or product query pages via search, categories, etc. You can paste URLs one at a time or use the Bulk Edit section to add a prepared list.
  • js_script - JavaScript code to execute on the web page, You can use it to manipulate web pages or get js command results after being run.
  • js_timeout - Maximum time (in seconds) to wait for the JavaScript execution to complete.
  • retrieve_result_from_js_script - Determines whether to fetch the result of the executed JavaScript.
  • page_is_loaded_before_running_script - Specifies whether to wait for the page to fully load before running the JavaScript.
  • execute_js_async" - Indicates if the JavaScript should be executed asynchronously.
  • retrieve_html_from_url_after_loaded - Determines whether to retrieve the page's HTML after it has fully loaded.
  • Limit the number of retries - Maximum number of retries when collecting data when an unexpected error occurs.
  • Timeout on page load - Maximum waiting time when accessing the links you provided.
  • Proxy configuration - Add a proxy to ensure that during the data collection process, you are not detected as a bot.

Collect data from web page protected by Cloudflare

Example url: https://gitlab.com

Example Screenshot of product information page:

Input:

1{
2  "max_retries": 2, // Maximum waiting time when accessing the links you provided.
3  "proxy": { // Add a proxy to ensure that during the data collection process, you are not detected as a bot.
4    "useApifyProxy": true,
5    "apifyProxyGroups": [
6      "RESIDENTIAL" 
7    ],
8    "apifyProxyCountry": "SG" // You should choose an Country that coincides with the Country you want to collect data from
9  },
10  "timeout": 5,
11  "urls": [ // Links to web pages.
12    "https://gitlab.com"
13  ],
14  "js_script": "return 10 + 10 + 20", // JS script you want to run
15  "js_timeout": 10,
16  "retrieve_result_from_js_script": true, // Retrieve result from JS script
17  "page_is_loaded_before_running_script": true, // Page is loaded before running script
18  "execute_js_async": false, // Execute JS async
19  "retrieve_html_from_url_after_loaded": true, // Retrieve page HTML from url after loaded
20}

Output:

You get the output from the Idealo.de product scraper stored in a tab. The following is an example of the Information Fields collected after running the Actor.

1[ // List of product information
2  {
3    "url": "https://about.gitlab.com/",
4    "result_from_js_script": 40,
5    "html": "<!DOCTYPE html>...</html>" // HTML from web page
6  }, // ... Many other product details
7]

How can I use the data extracted from web page with Cloudflare web Scraper?

1💙 Bypass cloudflare's protection.
2⭐ Execute custom js script

Your feedback

We are always working to improve Actors' performance. So, if you have any technical feedback about Cloudflare web Scraper or simply found a bug, please create an issue on the Actor's Issues tab in Apify Console.

Developer
Maintained by Community

Actor Metrics

  • 2 monthly users

  • 0 No stars yet

  • >99% runs succeeded

  • Created in Jan 2025

  • Modified a day ago