Email ✉️  & Phone ☎️ Extractor avatar
Email ✉️ & Phone ☎️ Extractor
Try for free

7 days trial then $30.00/month - No credit card required now

View all Actors
Email ✉️  & Phone ☎️ Extractor

Email ✉️ & Phone ☎️ Extractor

anchor/email-phone-extractor
Try for free

7 days trial then $30.00/month - No credit card required now

Extract emails, phone numbers, and other useful contact information from any list of websites you provide. Best tool for contact lead generation. Export data in structured formats and dominate your outreach game. Capture your leads almost for free, fast, and without limits.

User avatar

Spotty Results

Closed

aware_cricket opened this issue
2 years ago

When scraping websites, several emails are missed. I have found a number of instances where an email address in listed on either the landing page or the contact page of a website, but the scraper doesn't find it.

What can I do to get better results?

User avatar

guillim (anchor)

2 years ago

Can you send me

  • link to the webpage
  • screeshot
  • email that is missed

And I will have a look to improve the crawler !

Thanks 🙏

User avatar

aware_cricket

2 years ago

Here are a few examples:

missiondistricttherapy.com - email found on contact page help@missiondistricttherapy.com sfbluebuddha.com - there are five emails listed on the contact page, crawler did not scrape any of them landsendphysicaltherapy.com - two emails listed on the contact page https://www.bearrepublic.fit/ - email is on the landing page - info@bearrepubliccrossfit.com

User avatar

guillim (anchor)

2 years ago

Thank you very much for the information provided. To investigate, I started with the link "sfbluebuddha.com"

My experience I run it, and the crawler did find the 5 emails on the contact page, as you can see in this picture : [image: Screenshot 2022-11-03 at 17.11.12.png] Note : emails are deduplicated, which explains why you only see 4 of them on the /contact page because "info@" is here twice.

My recommendation Just to make sure it is not related to the INPUT parameters, can you run it again on your side with these parameters : { "considerChildFrames": true, "maxDepth": 1, "maxRequests": 50, "onlyEmails": true, "onlyOneEmailPerDomain": false, "proxyConfig": { "useApifyProxy": true }, "pseudoUrls": [ ".*" ], "sameDomain": true, "startUrls": [ { "url": "https://sfbluebuddha.com/contact/" } ] }

Thanks so much for your patience

Developer
Maintained by Community
Actor metrics
  • 232 monthly users
  • 99.7% runs succeeded
  • 1.2 days response time
  • Created in Oct 2021
  • Modified 19 days ago