Scrape Emails And Social Media Links With Just a Query avatar
Scrape Emails And Social Media Links With Just a Query
Try for free

No credit card required

View all Actors
Scrape Emails And Social Media Links With Just a Query

Scrape Emails And Social Media Links With Just a Query

kirada/scrape-emails-and-social-media-links-with-just-a-query
Try for free

No credit card required

Get Emails And Social Media Links With Just a Query !!!!! The actor will take a query and some parameter as input and scrape contact details with just a query. ACTOR IS FREE FOR LIMITED TIME.

Website and website's contact details scraper.

⚠️ ACTOR IS FREE FOR A LIMITED TIME !!! IT WILL SWITCH TO SUSCRIBTION IN 1-4 MONTHS SO DON'T SKIP THIS CHANCE !!!⚠️

The Website and contact details scraper is made to search websites through search engine such as google and duckduckgo. Then, it goes through each of these websites to find and collect their contact information such as emails and social media links.

🌟 Features

  • Handling of regex expression for custom data extraction.
  • Rapid processing speed.
  • Low cost.
  • Email and phone numbers extraction.
  • Social media links (facebook, linkedin, twitter, tiktok, instagram) extraction.

Note : Search engine can only show between 250-500 websites depending on the query.

🔍 What is regex ?

Regex gives you the ability to be precise by capturing specific patterns in text. You can dive into URLs further by customizing your search using personalized regex patterns. With Regex, you can extract exactly what you're looking for from your inputted expressions. It opens up a world of specificity, allowing you to tailor your extractions exactly to your needs.

🧩 How do I make a pattern ?

Utilize sophisticated tools like ChatGPT to generate a regex pattern. For example, let's make a pattern for identifying YouTube channel links. Craft a prompt without the use of quotation marks, directing to 'create a regex pattern to find YouTube channel links.'. The pattern should look like this : https://www.youtube.com/channel/[a-zA-Z0-9_-]+. Not this : "https://www.youtube.com/channel/[a-zA-Z0-9_-]+".

How to use ?

  1. Select your search engine. (duckduckgo is recommanded)
  2. Input your query for website extraction.
  3. Enter the numbers of results you want.
  4. [OPTIONNAL] Enter your regex patterns for custom data extraction.
  5. [OPTIONNAL] Enter the crawling depth for each website.

How it works ?

The scraper searches for websites through Google search and then goes through each of those websites to find and extract contact details information.

Input example (json)

1{
2    "maxCrawlDepth": 0,
3    "pattern": [
4        "home"
5    ],
6    "query": "test",
7    "results": 20
8}

Note : The "home" regex pattern has been used for testing purposes.

Input example (picture)

alt

Output (json)

1[
2  {
3    "url": "https://www.merriam-webster.com/dictionary/test",
4    "emails": [],
5    "twitter": "https://twitter.com/merriamwebster",
6    "facebook": "https://facebook.com/merriamwebster",
7    "instagram": "https://instagram.com/merriamwebster",
8    "linkedin": "",
9    "tiktok": "",
10    "phone": [],
11    "pattern_1": [
12      "home"
13    ]
14  },
15  {
16    "url": "https://www.dictionary.com/browse/test",
17    "emails": [],
18    "twitter": "https://twitter.com/mariassnflwrlou",
19    "facebook": "",
20    "instagram": "",
21    "linkedin": "",
22    "tiktok": "",
23    "phone": [],
24    "pattern_1": [
25      "home"
26    ]
27  },
28  {
29    "url": "https://aspr.hhs.gov/TestToTreat/Pages/default.aspx",
30    "emails": [
31      "DIAL@usaginganddisability.org"
32    ],
33    "twitter": "https://twitter.com/asprgov",
34    "facebook": "https://facebook.com/ASPRgov",
35    "instagram": "https://instagram.com/asprgov",
36    "linkedin": "https://linkedin.com/company/asprcompany",
37    "tiktok": "",
38    "phone": [],
39    "pattern_1": [
40      "home"
41    ]
42  },
43  {
44    "url": "https://support.google.com/websearch/answer/10106608?hl=en",
45    "emails": [
46      "asxvmprobertest@gmail.com"
47    ],
48    "twitter": "",
49    "facebook": "",
50    "instagram": "",
51    "linkedin": "",
52    "tiktok": "",
53    "phone": [],
54    "pattern_1": [
55      "home"
56    ]
57  },
58  {
59    "url": "https://www.speedtest.net/",
60    "emails": [],
61    "twitter": "https://twitter.com/speedtest",
62    "facebook": "https://facebook.com/speedtest",
63    "instagram": "",
64    "linkedin": "",
65    "tiktok": "",
66    "phone": [],
67    "pattern_1": [
68      "home"
69    ]
70  },
71  {
72    "url": "https://test.io/",
73    "emails": [],
74    "twitter": "https://twitter.com/_testio",
75    "facebook": "https://facebook.com/testiocommunity",
76    "instagram": "",
77    "linkedin": "https://linkedin.com/company/testiocompany",
78    "tiktok": "",
79    "phone": [],
80    "pattern_1": [
81      "home"
82    ]
83  },
84  {
85    "url": "https://en.wikipedia.org/wiki/Test",
86    "emails": "",
87    "twitter": "",
88    "facebook": "",
89    "instagram": "",
90    "linkedin": "",
91    "tiktok": "",
92    "phone": ""
93  },
94  {
95    "url": "https://fast.com/",
96    "emails": "",
97    "twitter": "",
98    "facebook": "",
99    "instagram": "",
100    "linkedin": "",
101    "tiktok": "",
102    "phone": ""
103  },
104  {
105    "url": "https://implicit.harvard.edu/implicit/takeatest.html",
106    "emails": [
107      "superman@hero.com",
108      "questions@projectimplicit.net",
109      "irbsbshelp@virginia.edu"
110    ],
111    "twitter": "",
112    "facebook": "",
113    "instagram": "",
114    "linkedin": "",
115    "tiktok": "",
116    "phone": [],
117    "pattern_1": []
118  },
119  {
120    "url": "https://www.cdc.gov/coronavirus/2019-ncov/testing/self-testing.html",
121    "emails": [],
122    "twitter": "https://twitter.com/CDCgov",
123    "facebook": "https://facebook.com/CDC",
124    "instagram": "https://instagram.com/CDCgov",
125    "linkedin": "https://linkedin.com/company/centers-for-disease-control-and-preventioncompany",
126    "tiktok": "",
127    "phone": [],
128    "pattern_1": [
129      "home"
130    ]
131  },
132  {
133    "url": "https://implicit.harvard.edu/implicit/",
134    "emails": "",
135    "twitter": "",
136    "facebook": "",
137    "instagram": "",
138    "linkedin": "",
139    "tiktok": "",
140    "phone": ""
141  },
142  {
143    "url": "https://www.merriam-webster.com/thesaurus/test",
144    "emails": [],
145    "twitter": "https://twitter.com/merriamwebster",
146    "facebook": "https://facebook.com/merriamwebster",
147    "instagram": "https://instagram.com/merriamwebster",
148    "linkedin": "",
149    "tiktok": "",
150    "phone": [],
151    "pattern_1": [
152      "home"
153    ]
154  },
155  {
156    "url": "https://languages.oup.com/google-dictionary-en/",
157    "emails": [],
158    "twitter": "https://twitter.com/OxLanguages",
159    "facebook": "https://facebook.com/OxLanguages",
160    "instagram": "",
161    "linkedin": "",
162    "tiktok": "",
163    "phone": [],
164    "pattern_1": []
165  },
166  {
167    "url": "https://www.av-test.org/en/",
168    "emails": [
169      "av-test@isurfer.de"
170    ],
171    "twitter": "https://twitter.com/avtestorg",
172    "facebook": "https://facebook.com/avtestorg",
173    "instagram": "",
174    "linkedin": "https://linkedin.com/company/avtestorgcompany",
175    "tiktok": "",
176    "phone": [],
177    "pattern_1": [
178      "home"
179    ]
180  },
181  {
182    "url": "https://www.ets.org/gre.html",
183    "emails": [],
184    "twitter": "https://twitter.com/etsinsights",
185    "facebook": "https://facebook.com/GREGeneralTest",
186    "instagram": "https://instagram.com/gregeneraltest",
187    "linkedin": "https://linkedin.com/company/educational-testing-service-grecompany",
188    "tiktok": "",
189    "phone": [],
190    "pattern_1": [
191      "home"
192    ]
193  }]

Output (picture)

alt Note : The output example has been condensed for readability purposes. Note : Websites where contact details or websites where their was an error while being processed don't produce a result. If an error occurs while processing a URL, there will be no output generated for that specific URL.

Developer
Maintained by Community
Actor metrics
  • 34 monthly users
  • 2 stars
  • 95.3% runs succeeded
  • 15 hours response time
  • Created in Apr 2024
  • Modified 7 days ago