Email ✉️  & Phone ☎️ Extractor avatar
Email ✉️ & Phone ☎️ Extractor
Try for free

7 days trial then $30.00/month - No credit card required now

View all Actors
Email ✉️  & Phone ☎️ Extractor

Email ✉️ & Phone ☎️ Extractor

anchor/email-phone-extractor
Try for free

7 days trial then $30.00/month - No credit card required now

Extract emails, phone numbers, and other useful contact information from any list of websites you provide. Best tool for contact lead generation. Export data in structured formats and dominate your outreach game. Capture your leads almost for free, fast, and without limits.

User avatar

Cant Skip After 1 Email Scraped

Closed

immaculate_wild opened this issue
2 years ago

Its wasting a massive amount of time on URLS where I have already got the email, and its just getting the same email over and over again, need a fix for this to use.

User avatar

guillim (anchor)

2 years ago

Ok. Can you send me the url you are trying to scrap ? I need this to understand the pb

User avatar

immaculate_wild

2 years ago

Hi!

a) This is an example of where it found several different emails on same domain, but it would be good to move to next domain once found 1 email:

hospitalitywifi.com info@hospitalitywifi.com sales@hospitalitywifi.com support@hospitalitywifi.com web@hospitalitywifi.com webmaster@hospitalitywifi.com

b) This is an example where its wasted lots of time scraping the same emails over and over again.

radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za aavenant@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za radiopulpit.co.za gospel@radiopulpit.co.za umifoods.co.uk radiopulpit.co.za gospel@radiopulpit.co.za umifoods.co.uk umifoods.co.uk sales@umifoods.co.uk radiopulpit.co.za gospel@radiopulpit.co.za

Kind Regards

User avatar

immaculate_wild

2 years ago

Hi, are you working on this soon? Another great feature would be to select if we want to scrape telephones, linkedin etc or just emails. This would also save a lot of time for people only searching for emails for example.

Kind Regards

User avatar

guillim (anchor)

2 years ago

Yes, I am working on it. Thanks for these suggestions, they are very helpful !

  • I already implemented a quick feature so that you can scrap "only emails" using a simple checkbox.
  • I have been thinking about your problem with "same email" that is a waste of time for the crawler. One very important thing is that the crawler tells every page it crawls, and this cannot change. So your pb is more specific because the website you are looking at has a footer in which the email is scrapped every time. I guess a solution could be to add some "html tag" that the crawler cannot scrap, but it would still go on every page and waste time anyway. I think You could look for the feature "pseudo-url" which are designed to prevent your crawler to go on deep part of a website that is pointless. For instance if the "blog part" is useless to you, simply create the pseudo-URL that will ban the crawler from going there.
  • about your suggestion "move to next domain once found 1 email" : it could be a solution yes, but your crawler might go in every direction though... and out of control lol. Would results be still expointable ?
User avatar

bilingual_pump

2 years ago

move to next domain once found 1 email - the other solutions I use have this feature, its particularly useful if stay in the domain, so it just that one domain for an email, then moves on

User avatar

guillim (anchor)

2 years ago

If I understand correctly you want to have a new option. This option would tell the crawler to stop crawling the domain as soon as one email is found right ?

So for instance I have in my starturl

Is that right ?

User avatar

bilingual_pump

2 years ago

Yes thats correct

User avatar

guillim (anchor)

2 years ago

Great! I will work on this today

User avatar

guillim (anchor)

2 years ago

I just released an update after last day of work on your option ! Enjoy !!!

Developer
Maintained by Community
Actor metrics
  • 228 monthly users
  • 99.7% runs succeeded
  • 1.2 days response time
  • Created in Oct 2021
  • Modified 18 days ago