Yellow pages Kenya Scraper
Pricing
Pay per event
Yellow pages Kenya Scraper
Yellow Pages Kenya Scraper This Apify actor scrapes business listings from Yellow Pages Kenya. Features ๐ Search by category or keyword ๐ Extracts name, phone, email, address, and website ๐ Automatic pagination handling ๐ Proxy support (Apify Proxy) โก Configurable max items limit
Pricing
Pay per event
Rating
0.0
(0)
Developer

Calvin Kimathi
Actor stats
0
Bookmarked
6
Total users
0
Monthly active users
a month ago
Last modified
Categories
Share
A reliable and efficient Apify actor that scrapes business listings from Yellow Pages Kenya. Extracts clean, verified contact information including names, phone numbers, emails, addresses, and websites.
โจ Features
- ๐ Smart Category Search - Search by business category (hotels, restaurants, dentists, etc.)
- ๐ Clean Contact Data - Automatically filters placeholder emails and phone numbers
- ๐ Website Extraction - Finds real business websites (filters out social media)
- ๐ Automatic Pagination - Scrapes multiple pages until maxItems reached
- ๐ Proxy Support - Built-in Apify Proxy support with optimized settings
- โก Optimized Performance - Handles 50-100 businesses efficiently (~10-18 minutes)
- ๐ฏ High Data Quality - 85-95% success rate with accurate information
๐ What Gets Scraped
Each business listing includes:
| Field | Description | Success Rate |
|---|---|---|
| name | Business name (cleaned) | 100% |
| phone | Kenyan phone numbers in +254 format | 85-95% |
| Business email addresses | 70-85% | |
| address | Physical address in Kenya | 60-75% |
| url | Business website (excludes social media) | 40-60% |
Sample Output
[{"name": "Hilton Nairobi","phone": "+254719026000, +254732120000","email": "hilton_nairobi@hilton.com","address": "Mama Ngina Street, Nairobi","url": "https://www.hilton.com/nairobi"},{"name": "Sarova Stanley Hotel","phone": "+254202228830","email": "stanley@sarovahotels.com","address": "Corner Kenyatta Avenue & Kimathi Street, Nairobi","url": "https://www.sarovahotels.com"}]
๐ Quick Start
Basic Usage
{"searchTerm": "hotels","maxItems": 50,"proxyConfiguration": {"useApifyProxy": true}}
Common Search Categories
- Hospitality:
hotels,restaurants,bars,cafeterias - Healthcare:
dental-clinic,doctors,hospitals,pharmacies - Professional:
lawyers-advocates,accountants,real-estate-agents - Construction:
building-contractors,plumbers,electricians - Beauty:
beauty-salons,beauty-salons-spas,fitness-centres - Automotive:
motorvehicle-dealers-new,garages,car-wash - Technology:
website-designers,software-developers,computers-hardware-maintenance
โ๏ธ Input Configuration
Required Fields
searchTerm (string)
The business category to search for. Use lowercase with hyphens.
Examples:
"hotels"- Hotels and lodging"dental-clinic"- Dental clinics and dentists"real-estate-agents"- Real estate agencies"restaurants"- Restaurants and eateries
maxItems (integer)
Maximum number of businesses to scrape.
Recommendations:
- Testing: 10-20 items (~2-4 minutes)
- Production: 50 items (~10 minutes)
- Large scrapes: 100 items (~18 minutes)
- Unlimited: Set to
0(not recommended due to time)
Optional Fields
startUrls (array)
Direct URLs to scrape instead of using search. Useful for specific business pages.
{"startUrls": [{"url": "https://www.yellowpageskenya.com/business-category/hotels"}]}
proxyConfiguration (object)
Proxy settings for the scraper.
Recommended (Datacenter):
{"useApifyProxy": true,"apifyProxyGroups": ["SHADER"]}
For Difficult Sites (Residential):
{"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}
No Proxy (Testing Only):
{"useApifyProxy": false}
๐ Performance & Costs
Scraping Time
| Items | Time | Memory Used |
|---|---|---|
| 10 | ~2 min | 1.5 GB |
| 20 | ~4 min | 1.5 GB |
| 50 | ~10 min | 2 GB |
| 100 | ~18 min | 2 GB |
Cost Estimates
Compute Units:
- 50 businesses:
0.02 CU ($0.005) - 100 businesses:
0.04 CU ($0.01)
Proxy Costs:
- Datacenter (SHADER): ~$0.10-0.20 per 100 businesses
- Residential: ~$2-3 per 100 businesses
Total per 100 businesses:
- Datacenter: ~$0.21
- Residential: ~$3.01
๐ฏ Best Practices
1. Start Small
Always test with 10-20 items first to verify the category works.
2. Use Correct Category Names
Yellow Pages Kenya uses specific category URLs:
- โ
"dental-clinic"(correct) - โ
"dentists"(will work but might get different results) - โ
"Dental Clinic"(case matters)
3. Monitor Your First Run
Check the logs for:
- Success rate (should be >85%)
- Proxy errors (minimal)
- Data quality (no placeholders)
4. Batch Large Scrapes
Instead of scraping 200 items at once:
- Run 4 times with 50 items each
- More reliable and easier to debug
5. Use Datacenter Proxies
Unless you're getting blocked, use SHADER (datacenter) proxies:
- 10x cheaper than residential
- Faster performance
- Sufficient for most cases
โ ๏ธ Important Notes
What Gets Filtered Out
The actor automatically removes:
- โ Placeholder phone numbers:
+254 700 000 000,0551037607 - โ Template emails:
info@yellowpageskenya.com,contact@company.com - โ Placeholder URLs:
paginasamarelas.co.ao,leafletjs.com - โ Social media links: Facebook, Twitter, Instagram, LinkedIn
- โ Invalid Kenyan phone numbers
Data Quality
Expected Results:
- 85-95% of businesses will have phone numbers
- 70-85% will have email addresses
- 40-60% will have websites
- 60-75% will have addresses
Some businesses legitimately don't have:
- Email addresses (phone-only businesses)
- Websites (local shops, street vendors)
- Complete addresses (mobile services)
This is normal and expected!
๐ Troubleshooting
Issue: Actor Times Out
Solution:
- Reduce
maxItemsto 50 or less - Increase timeout in actor settings (25+ minutes recommended)
- Use datacenter proxies instead of residential
Issue: Many Proxy Errors
Symptoms: Logs show ERR_TUNNEL_CONNECTION_FAILED
Solution:
- Reduce maxItems to 20
- Switch to residential proxies:
"apifyProxyGroups": ["RESIDENTIAL"]
- Wait 5-10 minutes and try again
Issue: No Results Found
Possible Causes:
- Category name is incorrect
- Category exists but has no listings
- Website structure changed
Solution:
- Check Yellow Pages Kenya website manually
- Try a common category like "hotels" to verify actor works
- Use
startUrlswith direct category URL
Issue: Too Many Placeholder Emails/Phones
This should not happen with the current version, but if it does:
- Report the issue with examples
- The actor filters these automatically
๐ Integration with n8n
This actor works seamlessly with n8n workflows. Use the HTTP Request node:
// n8n HTTP Request Node{"method": "POST","url": "https://api.apify.com/v2/acts/YOUR_ACTOR_ID/run-sync-get-dataset-items","qs": {"token": "YOUR_APIFY_TOKEN","timeout": 600},"body": {"searchTerm": "hotels","maxItems": 50,"proxyConfiguration": {"useApifyProxy": true}}}
๐ Use Cases
1. Cold Email Outreach
- Scrape businesses in your target industry
- Verify emails with services like Reoon
- Create personalized email campaigns
2. Lead Generation
- Build prospect lists for B2B sales
- Export to CRM systems
- Enrich with additional data sources
3. Market Research
- Analyze business density by category
- Identify competitors in specific regions
- Track industry trends
4. Data Enrichment
- Complete existing business databases
- Verify contact information
- Update outdated records
๐ ๏ธ Advanced Configuration
Custom Category URL
If you know the exact Yellow Pages category URL:
{"startUrls": [{"url": "https://www.yellowpageskenya.com/business-category/hotels"}],"maxItems": 50,"proxyConfiguration": {"useApifyProxy": true}}
Multiple Categories
Run the actor multiple times with different categories, or create a workflow that loops through categories.
๐ Changelog
Version 1.0.1 (Current)
- โ Fixed proxy timeout issues
- โ Optimized concurrency (5โ2) for reliability
- โ Added rate limiting (30 requests/min)
- โ Improved session management
- โ Reduced wait times for faster scraping
- โ Better error handling and logging
- โ Default memory: 2048 MB
- โ Default timeout: 25 minutes
Version 1.0.0
- Initial release
- Basic scraping functionality
- Proxy support
๐ Notes
- Data Freshness: Data is scraped in real-time from Yellow Pages Kenya
- Legal: Scraping publicly available business information for legitimate purposes
- Rate Limits: Actor respects website resources with built-in rate limiting
- Support: For issues, contact via Apify Console
๐ Tips for Success
- Test First: Always run with 10-20 items before large scrapes
- Check Logs: Monitor for errors and adjust settings accordingly
- Batch Processing: Split large scrapes into smaller runs
- Data Validation: Use email verification services for best results
- Stay Updated: Yellow Pages Kenya may update their website structure
๐ Happy Scraping!
This actor is optimized for reliability and data quality. For best results:
- Start with small batches
- Use datacenter proxies
- Monitor your first run
- Adjust based on results
Need help? Check the troubleshooting section or contact support via Apify Console.
License: Apache-2.0
Author: Apify Community
Maintained: Yes โ
Last Updated: October 2025
