Soopage Scraper - Global Business Directory (148 Countries)
Pricing
Pay per event
Soopage Scraper - Global Business Directory (148 Countries)
Scrape business listings from Soopage.com, a global business directory covering 148 countries. Extract company names, phone numbers, fax, addresses, cities, and countries from country subdomains like jpn.soopage.com, cn.soopage.com, and more.
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share
Soopage Global Business Directory Scraper
Scrape business listings from Soopage.com, a global business directory covering 148 countries. Returns company names, phone numbers, fax, addresses, cities, and country data from over 10 million listings across country subdomains like jpn.soopage.com, cn.soopage.com, and are.soopage.com.
Soopage Scraper Features
- Extracts structured company data via schema.org microdata — consistent fields across all 148 countries
- Filters by country subdomain: choose from Japan, China, UAE, India, and 144 more
- Filters by city: scrape Tokyo's 120K listings or Osaka's 55K without touching the rest
- Handles Cloudflare managed challenge transparently using residential proxy bypass
- Respects
maxItemsso test runs are fast and won't run up your bill
What Can You Do With Soopage Data?
- B2B sales teams — Build contact lists of local businesses in specific countries or cities for targeted outreach
- Market researchers — Map business density and distribution across regions by pulling city-level listing counts
- Data aggregators — Supplement thinner national business datasets with Soopage's global coverage
- Lead generation agencies — Batch-collect phone and fax numbers for international clients who need country-specific leads
How Soopage Scraper Works
- Reads the cities list from
{subdomain}.soopage.com/cities.htmlto discover all available city pages (or uses yourcityFilterdirectly). - Paginates through city listing pages — each page has 20 company links in the format
/city/{City}.html,/city/{City}-page2.html, and so on. - Fetches each company detail page and extracts schema.org microdata: name, phone, fax, address, city, country, and description.
- Stops when
maxItemsis reached.
Input
{"countrySubdomain": "jpn","cityFilter": "Tokyo","maxItems": 100,"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
| Field | Type | Default | Description |
|---|---|---|---|
countrySubdomain | string | "jpn" | Country subdomain to scrape (e.g. jpn for Japan, cn for China, are for UAE) |
cityFilter | string | "" | Optional city name filter (e.g. Tokyo, Osaka). Leave empty to scrape all cities |
maxItems | integer | 10 | Maximum number of listings to return |
proxyConfiguration | object | residential | Proxy settings. Residential proxy required for Cloudflare bypass on paginated pages |
Soopage Scraper Output Fields
{"name": "4 LIFE CITY CO., LTD.","phone": "03-3448-9555 (+81-03-3448-9555)","fax": "075-495-6612 (+81-075-495-6612)","address": "2-14-5 Kami-osaki, Shinagawa-ku, Tokyo","city": "Tokyo","country": "Japan","description": "4 LIFE CITY CO., LTD. is a company at Japan, Address is 2-14-5 Kami-osaki, Shinagawa-ku, Tokyo","listing_url": "https://jpn.soopage.com/company/4-LIFE-CITY-CO-LTD_5.html"}
| Field | Type | Description |
|---|---|---|
name | string | Company name |
phone | string | Telephone number, may include country code |
fax | string | Fax number if available |
address | string | Street address |
city | string | City or area |
country | string | Country name |
description | string | Auto-generated company description |
listing_url | string | URL to the Soopage company profile page |
FAQ
Which countries are supported?
All 148 country subdomains on Soopage.com. Common ones: jpn (Japan, ~394K listings), cn / chn (China), are (UAE), in (India), id (Indonesia), sgp (Singapore), kor (South Korea). Check soopage.com/countries.html for the full list.
Why do I need a residential proxy?
Soopage.com sits behind Cloudflare CDN. The first page of each city listing loads without challenge, but page 2 and beyond trigger a managed challenge. Residential proxies bypass this automatically — datacenter IPs get blocked on paginated pages.
How long does a full Japan crawl take?
Japan has ~394K listings across 49 cities. At 20 listings per page that is roughly 19,700 city-listing page requests plus 394K detail page requests. For targeted runs, use cityFilter to focus on a single city and set maxItems to a reasonable cap.
Does the data include email addresses or websites?
No. Soopage listings are thin by design. The available fields are name, phone, fax, address, city, country, and description. No email, no website, no categories. For richer profiles, consider the japanyello-scraper for Japan or the yello-network-scraper for other countries.
Need More Features?
Open an issue on the actor page or contact support with your use case.
Why Use Soopage Scraper?
- Zero competition — No other Apify actor covers Soopage.com's 148-country global dataset
- Scale — 10M+ listings across 148 countries from a single actor, configurable by country and city
- Clean extraction — schema.org microdata means structured fields, not brittle CSS selectors that break on site redesigns