Contact Details Scraper avatar
Contact Details Scraper
Try for free

Pay $3.00 for 1,000 results

View all Actors
Contact Details Scraper

Contact Details Scraper

vdrmota/contact-info-scraper
Try for free

Pay $3.00 for 1,000 results

Free email extractor to extract and download emails, phone numbers, Facebook, Twitter, LinkedIn, and Instagram profiles from any website. Extract contact information at scale from lists of URLs and download the data as Excel, CSV, JSON, HTML, and XML.

Do you want to learn more about this Actor?

Get a demo
KE

Is there a way to have this scraper look a little closer?

Closed

key.workflow opened this issue
3 months ago

Is there any reason this run didn't extract the phone number? https://www.hooksandtinestaxidermy.com/

The phone number is on this page twice, but wasn't extracted. Any help would be appreciated.

ondrejklinovsky avatar

Hi, thanks for the question. You'll find the number in phonesUncertain field in the output (click on "All fields" button to see it).

We store phone numbers in two fields: phones and phonesUncertain. Here's the explanation of the two from the crawlee's source code:

Note that the phones field contains phone numbers extracted from the special phone links such as [call us](tel:+1234556789) (see {@apilink phonesFromUrls}) and potentially other sources with high certainty, while phonesUncertain contains phone numbers extracted from the plain text, which might be very inaccurate.

KE

key.workflow

3 months ago

Ahhh got it, thanks.

KE

key.workflow

3 months ago

Actually one more thing. I ran another one and it failed to return the email on the contact page, it returned a generic godaddy.com email. Would I have to adjust the maxDepth for it to reach the contact page?

KE

key.workflow

3 months ago

It failed to return an email and phone number from this website too. https://www.gyblv.com/

Are these considered plain text? Should I be changing any settings? Thank you.

ondrejklinovsky avatar

I ran another one and it failed to return the email on the contact page, it returned a generic godaddy.com email. Would I have to adjust the maxDepth for it to reach the contact page?

Try to increase maxRequestsPerStartUrl in your input (or remove it completely). In your run it was set to 1, meaning the actor will always scrape just one page regardless of maxDepth. That should solve the issue with https://www.gyblv.com/ as well.

Developer
Maintained by Apify
Actor metrics
  • 1.8k monthly users
  • 114 stars
  • 99.2% runs succeeded
  • 2.9 days response time
  • Created in May 2019
  • Modified 2 days ago