๐ŸŒ Proxy Scraper avatar

๐ŸŒ Proxy Scraper

Pricing

from $4.99 / 1,000 results

Go to Apify Store
๐ŸŒ Proxy Scraper

๐ŸŒ Proxy Scraper

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

API Empire

API Empire

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

๐ŸŒ Proxy Scraper โ€” HTTP / HTTPS / SOCKS4 / SOCKS5

Stream fresh public proxies from dozens of open lists into a single, de-duplicated, country-tagged dataset. Each row is saved live as it is collected so partial results survive any crash.

โš ๏ธ Legal notice โ€” Only the publicly-available, openly-published proxy lists are consumed. The actor does not test, validate or store credentials. You are responsible for complying with the terms of service of any site you ultimately route through these proxies.


โœจ Why choose this actor?

  • ๐Ÿš€ 30+ public sources aggregated in parallel โ€” typically thousands of unique proxies per run, deduplicated on first sight.
  • ๐Ÿ›ฐ๏ธ Country + anonymity enrichment โ€” proxifly metadata is merged first, then ip-api.com batch geo fills the rest. Country filtering early-stops the enrichment once enough matches are found, saving minutes on big runs.
  • ๐Ÿ›ก๏ธ Three-tier escalating proxy fallback โ€” starts direct (free), escalates to Apify Datacenter then Residential when upstream blocking is detected, and sticks at the higher tier for the rest of the run.
  • ๐Ÿ’พ Live row streaming โ€” every proxy lands in the dataset the moment it is found; no end-of-run flush, no lost rows.
  • ๐Ÿ“Š Per-protocol output views โ€” switch between
    All / HTTP / HTTPS / SOCKS4 / SOCKS5
    tabs straight from the Output table.

๐Ÿ“ฅ Input

{
"protocol": "all",
"maxItems": 100,
"country": "US",
"enableGeoEnrichment": true,
"timeout": 15,
"retries": 3,
"proxyConfiguration": { "useApifyProxy": true }
}
FieldTypeDescription
protocolstringall ยท http ยท https ยท socks4 ยท socks5
maxItemsintegerHard cap on dataset rows. 0 = unlimited
countrystringOptional ISO-2 code (e.g. US, GB, DE). Leave empty for any country
enableGeoEnrichmentbooleanResolve missing countries via ip-api.com batch endpoint
timeoutintegerPer-source HTTP timeout in seconds
retriesintegerMax attempts per upstream request
proxyConfigurationobjectApify proxy used only as the datacenter fallback tier

๐Ÿ“ค Output

Each dataset row matches the example below:

{
"host": "138.199.35.215",
"port": "9002",
"full": "138.199.35.215:9002",
"protocol": "http",
"anonymity": "transparent",
"country": "US"
}
FieldMeaning
hostIPv4 address of the proxy
portListening port (string)
fullConvenience ip:port for immediate use
protocolhttp ยท https ยท socks4 ยท socks5
anonymityelite ยท anonymous ยท transparent ยท unknown
countryISO-2 country code (e.g. US), or Unknown when not resolvable

๐Ÿš€ How to use the Actor (via Apify Console)

  1. Log in at https://console.apify.com โ†’ Actors.
  2. Open this actor and click Try for free / Start.
  3. Set Protocol (or leave all), Country, and Max items.
  4. Optional: enable Datacenter / Residential proxy fallback for blocked upstreams in the Advanced section.
  5. Click Start โ€” rows stream into the Output tab in real time.
  6. Export to JSON / CSV / Excel when the run finishes.

๐Ÿงฐ How the proxy fallback works

no proxy ๐ŸŸข
โ”‚ (blocked โ†’ 401/403/407/429/451/5xx)
โ–ผ
๐Ÿ›ก๏ธ datacenter
โ”‚ (still blocked)
โ–ผ
๐Ÿ  residential โ”€โ”€โ”€โ”€โ”€ 3ร— exponential-backoff retries

The escalation is sticky โ€” once the actor switches tiers, every subsequent request uses the higher tier so a single blocked source can not undo the switch. Every tier change is logged with ๐Ÿšจ.


๐Ÿ’ฐ Pricing (Pay-Per-Event)

This actor is wired for the Pay-Per-Event model. The synthetic apify-default-dataset-item event fires automatically for every row pushed via Actor.push_data, so the price-per-proxy is set entirely from the Monetization wizard in the Apify Console โ€” no manual Actor.charge() calls are needed.

ACTOR_MAX_TOTAL_CHARGE_USD is honoured by the platform: when a user's spend cap is reached the actor exits with User spending limit reached.


โ“ Frequently asked questions

Are these proxies tested? No โ€” the actor aggregates and deduplicates the public lists. Validate them on your side before relying on them for any particular site.

Why is anonymity sometimes unknown? Most plain-text sources don't ship that information. The proxifly JSON feed and your own validation step are the two ways to fill it.

Why does the country filter trigger geo enrichment? Plain ip:port lists have no country metadata. The actor batch-resolves the missing IPs through ip-api.com (free tier, 45 req/min) and stops as soon as it has maxItems matches for your target country.


๐Ÿ“ฌ Support and feedback

Open an issue on the actor's Apify page or contact the maintainer through the Console.