
Contact Info Scraper
Deprecated
Pricing
$20.00/month + usage

Contact Info Scraper
Deprecated
Extract contact information from a list of websites.
0.0 (0)
Pricing
$20.00/month + usage
6
Total users
204
Monthly users
15
Runs succeeded
>99%
Last modified
8 months ago
Features
Get the following contact details:
- Email addresses
- Phone numbers
- LinkedIn profiles
- Twitter handles
- Instagram profiles
- Facebook user profiles or pages
Input Configuration
The actor offers several input options to let you specify which pages will be crawled:
- Start URLs - Lets you add a list of URLs of web pages where the scraper should start. You can enter multiple URLs, upload a text file with URLs, or even use a Google Sheets document.
- Maximum link depth - Specifies how deep the actor will scrape links from the web pages specified in the Start URLs. If zero, the actor ignores the links and only crawls the Start URLs.
- Stay within domain - If enabled, the actor will only follow links that are on the same domain as the referring
page. For example, if the setting is enabled and the actor finds a link on http://www.example.com/some-page
to http://www.another-domain.com/, it will not crawl the second page, because
www.example.com
is not the same aswww.another-domain.com
.
The actor also accepts additional input options that let you specify proxy servers, limit the number of pages, etc.
Results
The actor stores its results into the default dataset associated with the actor run. You can then download the results in formats such as JSON, HTML, CSV, XML, or Excel. For each page crawled, the following contact information is extracted (examples shown):
- Emails
noone@example.comno.one@example.comno+one@example.co.in
- Phone numbers - These are extracted from phone links in HTML (e.g.
<a href='tel://123456789'>phone</a>
).123456789+12345678900123456789 - Uncertain phone numbers - These are extracted from the plain text of the web page using a number of regular
expressions. Note that this approach can generate false positives.
+123.456.7890123456789123-456-789
- LinkedIn profiles
https://www.linkedin.com/in/mercedes-benz-group-agen.linkedin.com/in/mercedes-benz-group-aglinkedin.com/in/mercedes-benz-group-ag
- Twitter profiles
https://www.twitter.com/mercedesbenztwitter.com/mercedesbenz
- Instagram profiles
https://www.instagram.com/mercedesbenz_careerswww.instagram.com/mercedesbenz_careers/instagr.am/mercedesbenz_careers
- Facebook profiles or pages
https://www.facebook.com/mercedesbenzcareersfacebook.com/mercedesbenzcareersfb.com/mercedesbenzcareershttps://www.facebook.com/profile.php?id=99999000
The results also contain information about the URL of the web page, domain, and referring URL (if the page was linked from another page), and depth (how many links away from Start URLs the page was found).
For each page crawled, the resulting dataset contains a single record, which looks like this (in JSON format):
{"url": "https://group.mercedes-benz.com/investors/services/contact/","domain": "mercedes-benz.com","requestUrl": "https://group.mercedes-benz.com/investors/services/contact/","depth": 2,"referrerUrl": null,"startUrl": "http://group.mercedes-benz.com/en/","emails": ["alexander.jasperneite@mercedes-benz.com","andreas.kusche@mercedes-benz.com","christian.ck.keller@mercedes-benz.com","dialog@mercedes-benz.com","ellen_christin.haehnlein@mercedes-benz.com","investorportal@computershare.de","ir.mbg@mercedes-benz.com","na.rothenberg@mercedes-benz.com","patrick.odermatt@mercedes-benz.com"],"phones": ["+4989309036376"],"phonesUncertain": ["+49 89 30903","0800 324 1111","32 12 81 763","711 17 94075"],"linkedIns": ["https://www.linkedin.com/company/mercedes-benz-group-ag"],"twitters": ["https://twitter.com/mercedesbenz"],"instagrams": ["https://www.instagram.com/mercedesbenz_careers"],"facebooks": ["https://www.facebook.com/mercedesbenzcareers"],"youtubes": ["https://www.youtube.com/user/mercedesbenztv"],"tiktoks": ["https://www.tiktok.com/@mercedesbenz"],"pinterests": [],"discords": []}