Pricing

$2.30 / 1,000 runs

Go to Store

LeadScraper

Try for free

Developed by

Claire Dubiel

Scrape a list of urls and receive business contact information, social media links, and a description of the services. This actor will scrape across multiple pages in the sitemap and returns a confidence score to every phone number and email that it finds. webscraper, scrape leads, web scraper

0.0 (0)

Pricing

$2.30 / 1,000 runs

Last modified

3 months ago

Lead generation

Automation

Service Company Website Scraper

An Apify actor that scrapes service company websites and extracts structured information about the business, including contact information, services offered, hours of operation, and more.

Features

Extracts company name, description, and contact information
Identifies services offered by the company
Extracts business hours, social media links, and reviews
Finds pricing information and FAQs
Handles multiple URLs in a single run
Supports SSL verification options
Optional Cloudflare bypass capability

Input

The actor accepts the following input parameters:

urls - An array of service company website URLs to scrape (required)
verifySSL - Whether to verify SSL certificates (default: true)
bypassCloudflare - Whether to attempt to bypass Cloudflare protection (default: true)
metadata - Optional custom metadata to include with each result

Example input:

{
  "urls": [
    "https://www.example1.com/",
    "https://www.example2.com/"
  ],
  "verifySSL": true,
  "bypassCloudflare": true,
  "metadata": {
    "project_id": "example-project",
    "source": "manual",
    "category": "roofing"
  }
}

Output

The actor outputs a JSON object for each URL containing the following information:

url - The URL of the scraped website
title - The title of the website
meta_description - The meta description of the website
main_content - The main content of the website
contact_information - Contact information extracted from the website
- phones - List of phone numbers with confidence scores
- main_phone - The main phone number with highest confidence
- emails - List of email addresses with confidence scores
- main_email - The main email address with highest confidence
- address - The physical address of the business
services - List of services offered by the company
hours_of_operation - Business hours by day of the week
social_media_links - Links to social media profiles
reviews - Customer reviews found on the website
pricing - Pricing information for services
faqs - Frequently asked questions
success - Whether the scraping was successful
error - Error message if scraping failed

Example Usage

const Apify = require('apify');

Apify.main(async () => {
    const input = {
        urls: [
            "https://www.example1.com/",
            "https://www.example2.com/"
        ],
        verifySSL: true,
        bypassCloudflare: true,
        metadata: {
            project_id: "example-project",
            source: "manual",
            category: "roofing"
        }
    };
    
    // Run the actor and wait for it to finish
    const run = await Apify.call('your-username/service-company-scraper', input);
    
    // Print the results
    const dataset = await Apify.openDataset(run.defaultDatasetId);
    const { items } = await dataset.getData();
    console.log('Results:', items);
});

## Development

### Project Structure

- `main.py` - Entry point for the Apify actor
- `scraper.py` - Contains the `ServiceCompanyScraper` class
- `requirements.txt` - Python dependencies
- `INPUT_SCHEMA.json` - Input schema for the Apify actor
- `OUTPUT_SCHEMA.json` - Output schema for the Apify actor
- `Dockerfile` - Docker configuration for the Apify actor

### Adding New Features

To add new extraction capabilities:

1. Add a new method to the `ServiceCompanyScraper` class in `scraper.py`
2. Call the method from the `scrape` method
3. Update the output schema if necessary

On this page

Service Company Website Scraper
- Features
- Input
- Output
- Example Usage

Share Actor:

Contact Info Scraper: Pay Per Result

delicious_zebu/contact-info-scraper-pay-per-result

Effortlessly scrape contact information, including emails, phone numbers, and social media links like Twitter, YouTube, Facebook, Instagram, TikTok, and LinkedIn, from any website URL.

ВAH

257

Extract Contact Details from Any Website – Email, Phone, Social

creative_tablecloth/extract-email-phone-social-media-from-any-website

Discover our powerful scraper that effortlessly extracts emails, phone numbers, and social media links from any website. Ideal for marketers and businesses seeking to enhance their contact database quickly and efficiently.

Jinny Kim

1.8K

3.0

Dubicars Scraper

real_spidery/dubicars-scraper

Fast and lightweight DubiCars.com scraper allows you to deep dive in the the UAE’s fastest-growing online car market for buyers and sellers. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools

Real Spidery

Zillow School Scraper

axlymxp/zillow-school-scraper

Scrapes school data from Zillow's mobile API within specified geographic boundaries. Returns school name, address, rating, type (public/private/charter), level (elementary/middle/high), coordinates and other details. Allows filtering by school rating, level and type.

axly

Smartcontext AI Web Crawler

bluelightco/smartcontext-ai-crawler

Scrape any website and extract structured data using AI-powered instructions. Provide URLs and a natural language prompt to get tailored JSON outputs.

Bluelight

5.0

Local Business Lead Generator

james.logantech/local-business-lead-generator

Extracts details about businesses from Google Local Services, such as ratings, reviews, years in business, phone numbers, and websites. The scraper is customizable to target different business types and locations.

James

1.6

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

One Scales

5.0

Email scraper pro

scrapingxpert/email-scraper-pro

The Email Scraper Pro is a powerful tool designed to extract email addresses and social media links from websites It uses advanced web scraping techniques to crawl through web pages, identify social media profiles. This tool is ideal for lead generation, contact harvesting, and business intelligence

scrapingxpert

Hipages Lead Scraper

parsedom/hipages-lead-scraper

A lead scraper for extracting business details from Hipages, including names, contact information, ratings, and more. It supports pagination and proxy configuration, making it suitable for lead generation and market research.

Parsedom

Extract Contacts

maged120/extract-contacts

his Apify Actor extracts contact information from specified web pages, including email addresses, phone numbers, social media profiles, and contact-related links.