Proxy Scraper avatar
Proxy Scraper
Try for free

No credit card required

View all Actors
Proxy Scraper

Proxy Scraper

mstephen190/proxy-scraper
Try for free

No credit card required

Free proxy scraper and checker. Search dozens of free proxy websites. Get list of 100% working public proxies in seconds. Automatically test proxies based on target URL and maximum timeout.

What is Proxy Scraper?

This online proxy scraper searches a number of different free-proxy websites and scrapes all of them. Then, optionally, it can test each of those proxies for you on a specified target and timeout, then only return back the proxies which pass the test.

Why use Proxy Scraper?

There are lots of free public proxies available out there, such as Geonode and free-proxy-list.net. All proxies listed on these sites vary in terms quality, speed, and security. Some of them are painfully slow, while others don't work at all.

This proxy scraper tool is the best way to quickly obtain a list of working public proxies. It uses multithreading to run its automated tests in true concurrency while it scrapes multiple proxies simultaneously, and has a straightforward configuration process, fast runtimes, and reliable outputs.

Proxy Scraper performs two key tasks every time you run it:

  • It scrapes all currently available proxies from 17 different free proxy websites and APIs
  • It individually tests each proxy

This makes the process of retrieving data from free proxy websites much more accessible, as it removes the need to check each proxy manually.

Just need to scrape the web using proxies?

If you only want to use proxy servers for web scraping, Apify Proxy gives you both residential and datacenter proxies, intelligent proxy rotation, and a large pool of IP addresses. Even better, if you run your web scrapers on the Apify platform, our industry-leading browser fingerprinting research will make your scraping bots indistinguishable from humans.

How to use Proxy Scraper

  1. Choose which proxies to use while scraping the proxies.
  2. Select 'true' or 'false' to choose whether or not to test the proxies and only return the ones that pass.
  3. Select 'true' or 'false' to choose whether or not to run the actor in debug mode.
  4. Choose the name of the key-value store in which to save a .txt of the proxies.
  5. Select your preferred 'build', 'timeout', and 'memory' options.
  6. Click Start.
  7. View and download your data in any of the available formats.

What are Proxy Scraper's limitations?

Currently, Proxy Scraper can find anywhere from 20-60 reliable proxies out of the 2,500 that it scrapes every run. In the future, Proxy Scraper will scrape proxies from more than just the 17 sources currently being used.

How much does it cost to use Proxy Scraper?

With an Apify Free plan, you can run the scraper 400 times (about 10 times a day to get a fresh list of working proxies). You get 4,000 runs with a Personal plan, and 40,000 runs with a Team plan. Check out Apify pricing to see which plan is best for you.

Input example

1{
2    "testProxies": true,
3    "testTimeout": 7,
4    "testTarget": "https://google.com",
5    "kvStoreName": "reliable-proxies-store",
6    "pushToKvStore": true,
7    "datasetName": "test-dataset"
8}

Output example

The output will be an array of objects looking like this:

1{
2  "host": "164.27.6.74", // string
3  "port": 8080, // number
4  "full": "164.27.6.74:8080" // string
5}

What's happening under the hood?

  1. All of the current proxies from these free proxy resources are added to the request queue (with certain filters applied):
  1. Each site is scraped, and the results are stored temporarily in a global state

  2. Each scraped proxy is tested by making a request to a target URL specified by you with a specific timeout set. If the request fails, the proxy is removed from the list.

  3. All duplicate results are cleaned from the list.

  4. The proxies are finally pushed to the dataset.

Developer
Maintained by Community
Actor metrics
  • 51 monthly users
  • 27.7% runs succeeded
  • 0.0 days response time
  • Created in Feb 2022
  • Modified 9 months ago