Yellowpages India Scraper avatar

Yellowpages India Scraper

Pricing

$5.00/month + usage

Go to Apify Store
Yellowpages India Scraper

Yellowpages India Scraper

Yellow Pages India is an online business directory that helps users discover local businesses and services across cities in India. It provides company names, phone numbers, addresses, categories, ratings, and directions to connect customers with nearby service providers quickly and easily.

Pricing

$5.00/month + usage

Rating

5.0

(2)

Developer

ScrapeAI

ScrapeAI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

πŸ“‡ Yellowpages India Scraper

A powerful Apify Actor that scrapes business listings from Yellowpages India. It performs HTML content tag-based data extraction in two stages:

  1. Search Page β€” extracts property cards and queues detail URLs
  2. Detail Page β€” extracts full description, amenities, and review data

βœ… Use Cases

  • πŸ” Scrape business listings (e.g., electricians, plumbers) in any city or area
  • πŸ“Š Aggregate contact details, addresses and service/service lists
  • 🏷️ Build local-business datasets for directories or lead generation
  • πŸ“‹ Export listings with structured contact and service fields

πŸ“₯ Input Configuration

{
"startUrls": [
{
"url": "http://yellowpages.in/hyderabad/electricians/610972586"
}
],
"maxItems": 50,
"scrollTimeout": 40,
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
},
"debugMode": false
}

🧾 Fields Explained

FieldTypeDescription
startUrlsarrayYellowpages.in listing or search URLs. If provided, other parameters are ignored.
maxItemsintegerMaximum number of listing detail pages to scrape
scrollTimeoutintegerSeconds to scroll the page to load more results
proxyobjectProxy configuration
debugModebooleanSave debug screenshots and verbose logs

πŸ”— URL Structure

The scraper supports direct Yellowpages listing URLs:

http://yellowpages.in/hyderabad/electricians/610972586

No URL parameters required β€” provide full Yellowpages listing or search URL(s).


πŸ“€ Output

Each record saved to the dataset contains (exact output keys):

{
"name": "Bhavani Electricals",
"business_url": "https://www.yellowpages.in/b/bhavani-electricals-malkajgiri-hyderabad/133997041",
"image": "http://www.yellowpages.in/assets/business/t_default.png",
"reviews": "0 reviews",
"open_status": "Open - until 9:00PM",
"phone": "9440444721",
"area": "Malkajgiri Hyderabad - 500047",
"fulladdress": "D. No. 14-220, New Mirjalguda, Malkajgiri, Hyderabad - 500047, Telangana.",
"categories": [
"Building Materials",
"Cement Materials",
"Electrical Suppliers"
],
"business_timings": [
{ "day": "Sunday", "time": "9:00AM - 1:30PM", "status": "Closed" },
{ "day": "Monday", "time": "9:00AM - 9:00PM", "status": "Open" },
{ "day": "Tuesday", "time": "9:00AM - 9:00PM", "status": "Open" },
{ "day": "Wednesday", "time": "9:00AM - 9:00PM", "status": "Open" },
{ "day": "Thursday", "time": "9:00AM - 9:00PM", "status": "Open" },
{ "day": "Friday", "time": "9:00AM - 9:00PM", "status": "Open" },
{ "day": "Saturday", "time": "9:00AM - 9:00PM", "status": "Open" }
],
"directions": "http://www.google.com/maps/place/17.4491,78.5338",
"source": "https://www.yellowpages.in/hyderabad/..."
}

πŸ“‹ Output Schema

FieldTypeDescription
namestringBusiness / listing name
business_urlstringFull listing URL on Yellowpages.in
imagestringImage URL from the card
reviewsstringReviews count text (e.g. "0 reviews")
open_statusstringOpen/closed status text from the card
phonestringBusiness contact phone (if available)
areastringArea / address text
categoriesarrayBusiness categories / tags
directionsstringExternal maps/directions link
sourcestringSearch page or source URL where the listing was found

πŸ”’ Proxy Configuration

{
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}

Residential proxies are recommended to avoid Yellowpages.in anti-bot measures.


πŸ› οΈ Tech Stack

  • 🧩 Apify SDK β€” actor and dataset handling
  • πŸ•·οΈ Crawlee β€” crawling and request queue management
  • 🌐 Puppeteer β€” headless browser automation
  • βš™οΈ Node.js β€” fast, scalable backend