Houzz Lead Scraper & Contact Enrichment avatar

Houzz Lead Scraper & Contact Enrichment

Pricing

Pay per usage

Go to Apify Store
Houzz Lead Scraper & Contact Enrichment

Houzz Lead Scraper & Contact Enrichment

Extract Houzz leads with this lightweight Python scraper. Get business names, websites, phone numbers, and social media. Features optional email enrichment by crawling business sites. Cost-efficient, fast, and ideal for B2B sales, architects, and contractor lead generation. Supports proxies.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

Mohammed Yusuf

Mohammed Yusuf

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

3

Monthly active users

3 days ago

Last modified

Share

Houzz Lead Scraper & Contact Enrichment Actor

Lightweight Houzz scraper built with Python, Requests, BeautifulSoup, and the Apify SDK.

Extract Houzz business leads including company websites, social media profiles, ratings, reviews, project counts, and optional contact email enrichment from business websites.

Unlike heavy browser-based scrapers, this Actor uses a lightweight requests-based architecture designed to reduce runtime costs while maintaining high-quality lead extraction.

Ideal for:

  • B2B lead generation
  • Sales prospecting
  • CRM enrichment
  • Marketing agencies
  • Architecture and remodeling outreach
  • Interior design lead sourcing
  • Contractor lead discovery

Features

Houzz Lead Extraction

Extract structured business lead data from Houzz professional listings:

  • Company / professional name
  • Business location
  • Phone number
  • Website URL
  • Ratings
  • Review counts
  • Project counts
  • Services provided
  • Houzz profile URL
  • Social media profiles

Supported social platforms:

  • LinkedIn
  • Instagram
  • Facebook
  • Twitter/X

Optional Contact Enrichment

Enable Contact Enrichment mode to visit discovered business websites and extract contact emails.

The Actor checks lightweight high-value pages only:

  • Homepage
  • /contact
  • /contact-us
  • /about

Supported extraction methods:

  • mailto: links
  • Visible page text
  • Common email obfuscations
  • Cloudflare email protection decoding

False positives such as image filenames and static assets are filtered automatically.

Pricing

This Actor is monetized on a pay-per-result basis.

  • Each run is priced per lead extracted, with enriched contact emails considered premium.
  • As a reference, standard usage (Houzz data + websites + socials) is priced at $3.99 per 1,000 leads.
    Note: Due to Apify limits, the maximum cost per run is currently $3.00. This typically allows 500–750 results per run.

⚠️ Actual cost may vary depending on:

  • number of leads extracted
  • whether contact enrichment is enabled
  • optional proxy usage
  • platform fees

Why This Actor?

Lightweight & Cost Efficient

Most Houzz scrapers rely on full browser automation, which significantly increases runtime costs.

This Actor uses:

  • Requests
  • BeautifulSoup
  • Lightweight HTML parsing

Benefits:

  • Lower runtime cost
  • Faster execution
  • Lower memory usage
  • Better scalability for lead generation
  • Configurable concurrent email enrichment for faster large-scale runs

Input

FieldTypeDefaultDescription
startUrlstringNYC architects exampleHouzz search/results URL to start scraping from
maxResultsinteger10Maximum profiles to push to the dataset
maxPagesinteger3Maximum paginated search pages to inspect
extractEmailsbooleanfalseEnable Contact Enrichment mode
proxyConfigurationobjectNo proxyOptional Apify proxy configuration

Example Input

{
"startUrl": "https://www.houzz.com/professionals/architect/new-york-city-ny-us-probr0-bo~t_11784~r_5128581",
"maxResults": 25,
"maxPages": 2,
"extractEmails": true,
"proxyConfiguration": {
"useApifyProxy": false
}
}

Proxy Support

Proxy usage is optional.

Small scraping jobs can run efficiently without proxy.

For larger scraping runs, users can enable:

  • Residential proxies
  • Datacenter proxies
  • Own proxies

through the Apify proxy configuration UI.

The Actor uses official Apify proxy integration and automatically applies configured proxies to:

  • Houzz requests
  • External website enrichment requests

Proxy Usage Notes

We have tested this Actor extensively using default settings without enabling a proxy, and it was able to extract hundreds of leads with enrichment successfully.

⚠️ Important: Your experience may vary depending on:

  • network location
  • Houzz rate-limiting
  • size of your scraping run
  • email enrichment enabled

For larger runs, or if you encounter blocks or slowdowns, enabling Apify Proxy (Datacenter or Residential) is recommended. The Actor fully supports optional proxy configuration via the input UI.


Output

Each dataset item contains:

{
"name": "Example Studio",
"location": "New York, NY",
"phone": "+1 555 123 4567",
"website": "https://example.com",
"rating": 5.0,
"review_count": 12,
"project_count": 8,
"services": "Architecture, Interior Design",
"emails": ["hello@example.com"],
"profile_url": "https://www.houzz.com/professionals/...",
"socials": {
"linkedin": null,
"instagram": "https://www.instagram.com/example",
"facebook": null,
"twitter": null
}
}

Fields may be empty or null if data is unavailable.


Contact Enrichment Mode

When extractEmails is enabled, the Actor performs lightweight website enrichment to discover business contact emails.

This mode increases runtime because additional external websites are visited.

Recommended for:

  • Outreach campaigns
  • Lead generation
  • CRM enrichment
  • Sales prospecting

Local Development

Install dependencies

python -m venv venv
.\venv\Scripts\Activate.ps1
pip install -r requirements.txt

Run locally

python main.py

Local environment variables

$env:HOUZZ_START_URL="https://www.houzz.com/professionals/architect/new-york-city-ny-us-probr0-bo~t_11784~r_5128581"
$env:MAX_PAGES="3"
$env:MAX_RESULTS="10"
$env:EXTRACT_EMAILS="true"
python main.py

Local runs export results to:

houzz_results.json

Apify platform runs push results to the default dataset.


Recommended Use Cases

  • Houzz lead generation
  • Architect lead scraping
  • Interior designer lead discovery
  • Remodeling contractor prospecting
  • Agency outreach
  • CRM enrichment
  • Market research
  • Competitor research

Performance Notes

  • Lightweight BeautifulSoup architecture
  • Lower resource usage than browser-based scrapers
  • Contact Enrichment mode increases runtime
  • Some external websites may block scraping at scale
  • JavaScript-rendered emails may not always be accessible through requests-based extraction

Scaling Recommendations

Small Runs

  • Run without proxy
  • Faster and cheaper

Larger Runs

Enable:

  • Residential proxy
  • Datacenter proxy

through the Apify UI for improved reliability.


Disclaimer

This Actor is intended for lawful data extraction and business research workflows. Users are responsible for complying with applicable website terms and local regulations.