Contact Info Scraper avatar
Contact Info Scraper

Deprecated

Pricing

$20.00/month + usage

Go to Store
Contact Info Scraper

Contact Info Scraper

Deprecated

Developed by

Onidivo Technologies

Onidivo Technologies

Maintained by Community

Extract contact information from a list of websites.

0.0 (0)

Pricing

$20.00/month + usage

6

Total users

204

Monthly users

15

Runs succeeded

>99%

Last modified

8 months ago

Features

Get the following contact details:

Input Configuration

The actor offers several input options to let you specify which pages will be crawled:

  • Start URLs - Lets you add a list of URLs of web pages where the scraper should start. You can enter multiple URLs, upload a text file with URLs, or even use a Google Sheets document.
  • Maximum link depth - Specifies how deep the actor will scrape links from the web pages specified in the Start URLs. If zero, the actor ignores the links and only crawls the Start URLs.
  • Stay within domain - If enabled, the actor will only follow links that are on the same domain as the referring page. For example, if the setting is enabled and the actor finds a link on http://www.example.com/some-page to http://www.another-domain.com/, it will not crawl the second page, because www.example.com is not the same as www.another-domain.com.

The actor also accepts additional input options that let you specify proxy servers, limit the number of pages, etc.

Results

The actor stores its results into the default dataset associated with the actor run. You can then download the results in formats such as JSON, HTML, CSV, XML, or Excel. For each page crawled, the following contact information is extracted (examples shown):

  • Emails
    noone@example.com
    no.one@example.com
    no+one@example.co.in
  • Phone numbers - These are extracted from phone links in HTML (e.g. <a href='tel://123456789'>phone</a>).
    123456789
    +123456789
    00123456789
  • Uncertain phone numbers - These are extracted from the plain text of the web page using a number of regular expressions. Note that this approach can generate false positives.
    +123.456.7890
    123456789
    123-456-789
  • LinkedIn profiles
    https://www.linkedin.com/in/mercedes-benz-group-ag
    en.linkedin.com/in/mercedes-benz-group-ag
    linkedin.com/in/mercedes-benz-group-ag
  • Twitter profiles
    https://www.twitter.com/mercedesbenz
    twitter.com/mercedesbenz
  • Instagram profiles
    https://www.instagram.com/mercedesbenz_careers
    www.instagram.com/mercedesbenz_careers/
    instagr.am/mercedesbenz_careers
  • Facebook profiles or pages
    https://www.facebook.com/mercedesbenzcareers
    facebook.com/mercedesbenzcareers
    fb.com/mercedesbenzcareers
    https://www.facebook.com/profile.php?id=99999000

The results also contain information about the URL of the web page, domain, and referring URL (if the page was linked from another page), and depth (how many links away from Start URLs the page was found).

For each page crawled, the resulting dataset contains a single record, which looks like this (in JSON format):

{
"url": "https://group.mercedes-benz.com/investors/services/contact/",
"domain": "mercedes-benz.com",
"requestUrl": "https://group.mercedes-benz.com/investors/services/contact/",
"depth": 2,
"referrerUrl": null,
"startUrl": "http://group.mercedes-benz.com/en/",
"emails": [
"alexander.jasperneite@mercedes-benz.com",
"andreas.kusche@mercedes-benz.com",
"christian.ck.keller@mercedes-benz.com",
"dialog@mercedes-benz.com",
"ellen_christin.haehnlein@mercedes-benz.com",
"investorportal@computershare.de",
"ir.mbg@mercedes-benz.com",
"na.rothenberg@mercedes-benz.com",
"patrick.odermatt@mercedes-benz.com"
],
"phones": [
"+4989309036376"
],
"phonesUncertain": [
"+49 89 30903",
"0800 324 1111",
"32 12 81 763",
"711 17 94075"
],
"linkedIns": [
"https://www.linkedin.com/company/mercedes-benz-group-ag"
],
"twitters": [
"https://twitter.com/mercedesbenz"
],
"instagrams": [
"https://www.instagram.com/mercedesbenz_careers"
],
"facebooks": [
"https://www.facebook.com/mercedesbenzcareers"
],
"youtubes": [
"https://www.youtube.com/user/mercedesbenztv"
],
"tiktoks": [
"https://www.tiktok.com/@mercedesbenz"
],
"pinterests": [],
"discords": []
}