Houzz Lead Scraper & Contact Enrichment
Pricing
Pay per usage
Houzz Lead Scraper & Contact Enrichment
Extract Houzz leads with this lightweight Python scraper. Get business names, websites, phone numbers, and social media. Features optional email enrichment by crawling business sites. Cost-efficient, fast, and ideal for B2B sales, architects, and contractor lead generation. Supports proxies.
Pricing
Pay per usage
Rating
5.0
(1)
Developer
Mohammed Yusuf
Maintained by CommunityActor stats
0
Bookmarked
4
Total users
3
Monthly active users
3 days ago
Last modified
Categories
Share
Houzz Lead Scraper & Contact Enrichment Actor
Lightweight Houzz scraper built with Python, Requests, BeautifulSoup, and the Apify SDK.
Extract Houzz business leads including company websites, social media profiles, ratings, reviews, project counts, and optional contact email enrichment from business websites.
Unlike heavy browser-based scrapers, this Actor uses a lightweight requests-based architecture designed to reduce runtime costs while maintaining high-quality lead extraction.
Ideal for:
- B2B lead generation
- Sales prospecting
- CRM enrichment
- Marketing agencies
- Architecture and remodeling outreach
- Interior design lead sourcing
- Contractor lead discovery
Features
Houzz Lead Extraction
Extract structured business lead data from Houzz professional listings:
- Company / professional name
- Business location
- Phone number
- Website URL
- Ratings
- Review counts
- Project counts
- Services provided
- Houzz profile URL
- Social media profiles
Supported social platforms:
- Twitter/X
Optional Contact Enrichment
Enable Contact Enrichment mode to visit discovered business websites and extract contact emails.
The Actor checks lightweight high-value pages only:
- Homepage
/contact/contact-us/about
Supported extraction methods:
mailto:links- Visible page text
- Common email obfuscations
- Cloudflare email protection decoding
False positives such as image filenames and static assets are filtered automatically.
Pricing
This Actor is monetized on a pay-per-result basis.
- Each run is priced per lead extracted, with enriched contact emails considered premium.
- As a reference, standard usage (Houzz data + websites + socials) is priced at $3.99 per 1,000 leads.
Note: Due to Apify limits, the maximum cost per run is currently $3.00. This typically allows 500–750 results per run.
⚠️ Actual cost may vary depending on:
- number of leads extracted
- whether contact enrichment is enabled
- optional proxy usage
- platform fees
Why This Actor?
Lightweight & Cost Efficient
Most Houzz scrapers rely on full browser automation, which significantly increases runtime costs.
This Actor uses:
- Requests
- BeautifulSoup
- Lightweight HTML parsing
Benefits:
- Lower runtime cost
- Faster execution
- Lower memory usage
- Better scalability for lead generation
- Configurable concurrent email enrichment for faster large-scale runs
Input
| Field | Type | Default | Description |
|---|---|---|---|
startUrl | string | NYC architects example | Houzz search/results URL to start scraping from |
maxResults | integer | 10 | Maximum profiles to push to the dataset |
maxPages | integer | 3 | Maximum paginated search pages to inspect |
extractEmails | boolean | false | Enable Contact Enrichment mode |
proxyConfiguration | object | No proxy | Optional Apify proxy configuration |
Example Input
{"startUrl": "https://www.houzz.com/professionals/architect/new-york-city-ny-us-probr0-bo~t_11784~r_5128581","maxResults": 25,"maxPages": 2,"extractEmails": true,"proxyConfiguration": {"useApifyProxy": false}}
Proxy Support
Proxy usage is optional.
Small scraping jobs can run efficiently without proxy.
For larger scraping runs, users can enable:
- Residential proxies
- Datacenter proxies
- Own proxies
through the Apify proxy configuration UI.
The Actor uses official Apify proxy integration and automatically applies configured proxies to:
- Houzz requests
- External website enrichment requests
Proxy Usage Notes
We have tested this Actor extensively using default settings without enabling a proxy, and it was able to extract hundreds of leads with enrichment successfully.
⚠️ Important: Your experience may vary depending on:
- network location
- Houzz rate-limiting
- size of your scraping run
- email enrichment enabled
For larger runs, or if you encounter blocks or slowdowns, enabling Apify Proxy (Datacenter or Residential) is recommended. The Actor fully supports optional proxy configuration via the input UI.
Output
Each dataset item contains:
{"name": "Example Studio","location": "New York, NY","phone": "+1 555 123 4567","website": "https://example.com","rating": 5.0,"review_count": 12,"project_count": 8,"services": "Architecture, Interior Design","emails": ["hello@example.com"],"profile_url": "https://www.houzz.com/professionals/...","socials": {"linkedin": null,"instagram": "https://www.instagram.com/example","facebook": null,"twitter": null}}
Fields may be empty or null if data is unavailable.
Contact Enrichment Mode
When extractEmails is enabled, the Actor performs lightweight website enrichment to discover business contact emails.
This mode increases runtime because additional external websites are visited.
Recommended for:
- Outreach campaigns
- Lead generation
- CRM enrichment
- Sales prospecting
Local Development
Install dependencies
python -m venv venv.\venv\Scripts\Activate.ps1pip install -r requirements.txt
Run locally
python main.py
Local environment variables
$env:HOUZZ_START_URL="https://www.houzz.com/professionals/architect/new-york-city-ny-us-probr0-bo~t_11784~r_5128581"$env:MAX_PAGES="3"$env:MAX_RESULTS="10"$env:EXTRACT_EMAILS="true"python main.py
Local runs export results to:
houzz_results.json
Apify platform runs push results to the default dataset.
Recommended Use Cases
- Houzz lead generation
- Architect lead scraping
- Interior designer lead discovery
- Remodeling contractor prospecting
- Agency outreach
- CRM enrichment
- Market research
- Competitor research
Performance Notes
- Lightweight BeautifulSoup architecture
- Lower resource usage than browser-based scrapers
- Contact Enrichment mode increases runtime
- Some external websites may block scraping at scale
- JavaScript-rendered emails may not always be accessible through requests-based extraction
Scaling Recommendations
Small Runs
- Run without proxy
- Faster and cheaper
Larger Runs
Enable:
- Residential proxy
- Datacenter proxy
through the Apify UI for improved reliability.
Disclaimer
This Actor is intended for lawful data extraction and business research workflows. Users are responsible for complying with applicable website terms and local regulations.