Twitter Profile Scraper avatar

Twitter Profile Scraper

Try for free

3 days trial then $10.00/month - No credit card required now

Go to Store
Twitter Profile Scraper

Twitter Profile Scraper

epctex/twitter-profile-scraper
Try for free

3 days trial then $10.00/month - No credit card required now

Scrape any Twitter profile of any user. Extract tweets, replies, favorites, retweets, conversation threads and many more without any limit. Gather user related information such as verification, location, profile image, friends, followers, following and much more! No limits on Twitter!

Actor - Twitter Profile Scraper

Since Twitter doesn't provide a good and free API, this actor should help you to retrieve data from it.

Features

  • Scrape user detail - Scrape any user-related information from any profile such as friends, followers, following, verified account, location, profile image, profile banner, profile URL, date of creation, and so on.

  • Scrape tweet - You can retrieve all the tweets from any profile with all the details. Language, sensitive language, reply, quote, retweet, pinned, retweeted, and all sorts of Tweet-related information.

  • Scrape statistics - Gather all the statistical information of a tweet.

Why use this actor?

This actor is extremely fast and optimized. It'll scrape Twitter profiles around 26 times faster than the other equivalent scrapers. Therefore you will consume fewer resources and it will be cheaper to use it.

Bugs, fixes, updates, and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Input Parameters

The input of this scraper should be JSON containing the list of pages on the Twitter Profile Scraper that should be visited. Possible fields are:

  • startUrls: (Optional) (Array) List of Twitter Profile URLs. You should only provide profile URLs.

  • addUserInfo: (Optional) (Boolean) This will add user information into all scraped tweets. Please keep in mind that the size of the output will increase proportionally to the user information.

  • addNotFoundUsersToOutput: (Optional) (Boolean) Appends an object to the output where the users are not found. It will work only the 'Only User Info' option is enabled.

  • addSuspendedUsersToOutput: (Optional) (Boolean) Appends an object to the output where the users are suspended. It will work only the 'Only User Info' option is enabled.

  • onlyUserInfo: (Optional) (Boolean)This option enables you to retrieve only user information and not the tweets of that user.

  • customMapFunction: (Optional) (String) Function that takes each object's handle as an argument and returns the object with executing the function.

  • proxy: (Required) (Proxy Object) Proxy configuration.

This solution requires the use of Proxy servers, either your own proxy servers or you can use Apify Proxy.

Tip

When you want to scrape over a specific profile URL, just copy and paste the link as one of the startUrl.

Compute Unit Consumption

The actor is optimized to run blazing fast and scrape many tweets as possible. Therefore, it forefronts all tweet detail requests. If the actor doesn't block very often it'll scrape 100 tweets in 20 seconds with ~0.02-0.025 compute units.

Twitter Profile Scraper Input example

1{
2  "proxy":{
3    "useApifyProxy":true
4  },
5  "addUserInfo":true,
6  "startUrls":[
7    "https://twitter.com/apify"
8  ]
9}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with a failure state and output an explanation of what is wrong.

Twitter Profile Scraper Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this Twitter Profile actor.

Scraped Tweets Properties

The structure of each tweet in Twitter Profile Scraper looks like this:

Tweet Detail

1{
2  "user": {
3    "created_at": "Tue Sep 01 08:38:01 +0000 2015",
4    "default_profile": false,
5    "default_profile_image": false,
6    "description": "Apify is a software platform that lets you automate anything a person can do manually in a web browser, and then run it at scale.",
7    "entities": {
8      "description": {
9        "urls": []
10      },
11      "url": {
12        "urls": [
13          {
14            "display_url": "apify.com",
15            "expanded_url": "https://apify.com",
16            "url": "https://t.co/TmTF9tTQJv",
17            "indices": [
18              0,
19              23
20            ]
21          }
22        ]
23      }
24    },
25    "fast_followers_count": 0,
26    "favourites_count": 347,
27    "followers_count": 1032,
28    "friends_count": 164,
29    "has_custom_timelines": false,
30    "is_translator": false,
31    "listed_count": 39,
32    "location": "The Interweb",
33    "media_count": 216,
34    "name": "Apify",
35    "normal_followers_count": 1032,
36    "pinned_tweet_ids_str": [
37      "1322257049527472128"
38    ],
39    "possibly_sensitive": false,
40    "profile_banner_extensions": {
41      "mediaColor": {
42        "r": {
43          "ok": {
44            "palette": [
45              {
46                "percentage": 91.13,
47                "rgb": {
48                  "blue": 255,
49                  "green": 249,
50                  "red": 242
51                }
52              },
53              {
54                "percentage": 5.18,
55                "rgb": {
56                  "blue": 255,
57                  "green": 188,
58                  "red": 134
59                }
60              },
61              {
62                "percentage": 1.91,
63                "rgb": {
64                  "blue": 112,
65                  "green": 110,
66                  "red": 104
67                }
68              },
69              {
70                "percentage": 1.78,
71                "rgb": {
72                  "blue": 255,
73                  "green": 157,
74                  "red": 87
75                }
76              }
77            ]
78          }
79        }
80      }
81    },
82    "profile_banner_url": "https://pbs.twimg.com/profile_banners/3510729917/1602246779",
83    "profile_image_extensions": {
84      "mediaColor": {
85        "r": {
86          "ok": {
87            "palette": [
88              {
89                "percentage": 86.89,
90                "rgb": {
91                  "blue": 255,
92                  "green": 255,
93                  "red": 255
94                }
95              },
96              {
97                "percentage": 4.83,
98                "rgb": {
99                  "blue": 19,
100                  "green": 144,
101                  "red": 255
102                }
103              },
104              {
105                "percentage": 4.27,
106                "rgb": {
107                  "blue": 1,
108                  "green": 215,
109                  "red": 150
110                }
111              },
112              {
113                "percentage": 3.05,
114                "rgb": {
115                  "blue": 233,
116                  "green": 197,
117                  "red": 113
118                }
119              },
120              {
121                "percentage": 0.29,
122                "rgb": {
123                  "blue": 169,
124                  "green": 241,
125                  "red": 221
126                }
127              }
128            ]
129          }
130        }
131      }
132    },
133    "profile_image_url_https": "https://pbs.twimg.com/profile_images/1354484488697425920/aQ5dSaTK_normal.jpg",
134    "profile_interstitial_type": "",
135    "protected": false,
136    "screen_name": "apify",
137    "statuses_count": 814,
138    "translator_type": "none",
139    "url": "https://t.co/TmTF9tTQJv",
140    "verified": false,
141    "withheld_in_countries": []
142  },
143  "created_at": "Thu Jul 14 13:34:06 +0000 2022",
144  "conversation_id_str": "1547575166686793730",
145  "display_text_range": [
146    0,
147    280
148  ],
149  "entities": {
150    "media": [
151      {
152        "display_url": "pic.twitter.com/n5P2o3E7nG",
153        "expanded_url": "https://twitter.com/apify/status/1547575166686793730/photo/1",
154        "id_str": "1547575164660879363",
155        "indices": [
156          281,
157          304
158        ],
159        "media_url_https": "https://pbs.twimg.com/media/FXoXabDUEAMR-4X.jpg",
160        "type": "photo",
161        "url": "https://t.co/n5P2o3E7nG",
162        "features": {
163          "large": {
164            "faces": [
165              {
166                "x": 411,
167                "y": 60,
168                "h": 55,
169                "w": 55
170              },
171              {
172                "x": 644,
173                "y": 111,
174                "h": 57,
175                "w": 57
176              }
177            ]
178          },
179          "medium": {
180            "faces": [
181              {
182                "x": 411,
183                "y": 60,
184                "h": 55,
185                "w": 55
186              },
187              {
188                "x": 644,
189                "y": 111,
190                "h": 57,
191                "w": 57
192              }
193            ]
194          },
195          "small": {
196            "faces": [
197              {
198                "x": 272,
199                "y": 39,
200                "h": 36,
201                "w": 36
202              },
203              {
204                "x": 427,
205                "y": 73,
206                "h": 37,
207                "w": 37
208              }
209            ]
210          },
211          "orig": {
212            "faces": [
213              {
214                "x": 411,
215                "y": 60,
216                "h": 55,
217                "w": 55
218              },
219              {
220                "x": 644,
221                "y": 111,
222                "h": 57,
223                "w": 57
224              }
225            ]
226          }
227        },
228        "sizes": {
229          "large": {
230            "h": 640,
231            "w": 1024,
232            "resize": "fit"
233          },
234          "medium": {
235            "h": 640,
236            "w": 1024,
237            "resize": "fit"
238          },
239          "small": {
240            "h": 425,
241            "w": 680,
242            "resize": "fit"
243          },
244          "thumb": {
245            "h": 150,
246            "w": 150,
247            "resize": "crop"
248          }
249        },
250        "original_info": {
251          "height": 640,
252          "width": 1024,
253          "focus_rects": [
254            {
255              "x": 0,
256              "y": 0,
257              "w": 1024,
258              "h": 573
259            },
260            {
261              "x": 0,
262              "y": 0,
263              "w": 640,
264              "h": 640
265            },
266            {
267              "x": 1,
268              "y": 0,
269              "w": 561,
270              "h": 640
271            },
272            {
273              "x": 121,
274              "y": 0,
275              "w": 320,
276              "h": 640
277            },
278            {
279              "x": 0,
280              "y": 0,
281              "w": 1024,
282              "h": 640
283            }
284          ]
285        }
286      }
287    ],
288    "user_mentions": [],
289    "urls": [
290      {
291        "display_url": "apify.it/3uKtuY5",
292        "expanded_url": "https://apify.it/3uKtuY5",
293        "url": "https://t.co/3l26ZicZjk",
294        "indices": [
295          257,
296          280
297        ]
298      }
299    ],
300    "hashtags": [
301      {
302        "indices": [
303          225,
304          242
305        ],
306        "text": "instagramscraper"
307      },
308      {
309        "indices": [
310          243,
311          255
312        ],
313        "text": "webscraping"
314      }
315    ],
316    "symbols": []
317  },
318  "extended_entities": {
319    "media": [
320      {
321        "display_url": "pic.twitter.com/n5P2o3E7nG",
322        "expanded_url": "https://twitter.com/apify/status/1547575166686793730/photo/1",
323        "ext_alt_text": "Three iPhone displays showing the description of three Instagram Scrapers from Apify Store: Instagram Profile Scraper, Instagram Hashtag Scraper, and Instagram Comment Scraper.",
324        "id_str": "1547575164660879363",
325        "indices": [
326          281,
327          304
328        ],
329        "media_key": "3_1547575164660879363",
330        "media_url_https": "https://pbs.twimg.com/media/FXoXabDUEAMR-4X.jpg",
331        "type": "photo",
332        "url": "https://t.co/n5P2o3E7nG",
333        "ext_media_color": {
334          "palette": [
335            {
336              "percentage": 61.19,
337              "rgb": {
338                "blue": 242,
339                "green": 242,
340                "red": 242
341              }
342            },
343            {
344              "percentage": 18.76,
345              "rgb": {
346                "blue": 226,
347                "green": 119,
348                "red": 211
349              }
350            },
351            {
352              "percentage": 9.6,
353              "rgb": {
354                "blue": 189,
355                "green": 120,
356                "red": 147
357              }
358            },
359            {
360              "percentage": 3.7,
361              "rgb": {
362                "blue": 63,
363                "green": 48,
364                "red": 62
365              }
366            },
367            {
368              "percentage": 2.01,
369              "rgb": {
370                "blue": 195,
371                "green": 120,
372                "red": 228
373              }
374            }
375          ]
376        },
377        "ext_media_availability": {
378          "status": "Available"
379        },
380        "features": {
381          "large": {
382            "faces": [
383              {
384                "x": 411,
385                "y": 60,
386                "h": 55,
387                "w": 55
388              },
389              {
390                "x": 644,
391                "y": 111,
392                "h": 57,
393                "w": 57
394              }
395            ]
396          },
397          "medium": {
398            "faces": [
399              {
400                "x": 411,
401                "y": 60,
402                "h": 55,
403                "w": 55
404              },
405              {
406                "x": 644,
407                "y": 111,
408                "h": 57,
409                "w": 57
410              }
411            ]
412          },
413          "small": {
414            "faces": [
415              {
416                "x": 272,
417                "y": 39,
418                "h": 36,
419                "w": 36
420              },
421              {
422                "x": 427,
423                "y": 73,
424                "h": 37,
425                "w": 37
426              }
427            ]
428          },
429          "orig": {
430            "faces": [
431              {
432                "x": 411,
433                "y": 60,
434                "h": 55,
435                "w": 55
436              },
437              {
438                "x": 644,
439                "y": 111,
440                "h": 57,
441                "w": 57
442              }
443            ]
444          }
445        },
446        "sizes": {
447          "large": {
448            "h": 640,
449            "w": 1024,
450            "resize": "fit"
451          },
452          "medium": {
453            "h": 640,
454            "w": 1024,
455            "resize": "fit"
456          },
457          "small": {
458            "h": 425,
459            "w": 680,
460            "resize": "fit"
461          },
462          "thumb": {
463            "h": 150,
464            "w": 150,
465            "resize": "crop"
466          }
467        },
468        "original_info": {
469          "height": 640,
470          "width": 1024,
471          "focus_rects": [
472            {
473              "x": 0,
474              "y": 0,
475              "w": 1024,
476              "h": 573
477            },
478            {
479              "x": 0,
480              "y": 0,
481              "w": 640,
482              "h": 640
483            },
484            {
485              "x": 1,
486              "y": 0,
487              "w": 561,
488              "h": 640
489            },
490            {
491              "x": 121,
492              "y": 0,
493              "w": 320,
494              "h": 640
495            },
496            {
497              "x": 0,
498              "y": 0,
499              "w": 1024,
500              "h": 640
501            }
502          ]
503        }
504      }
505    ]
506  },
507  "favorite_count": 0,
508  "favorited": false,
509  "full_text": "Extract profile information from Instagram with Instagram Profile Scraper! 📸 Quick, ready-made, and easy to set up, Instagram Scraper lets you extract usernames, followers & follows, bio, likes, & more. Try it now! 👇 #instagramscraper #webscraping  https://t.co/3l26ZicZjk https://t.co/n5P2o3E7nG",
510  "is_quote_status": false,
511  "lang": "en",
512  "possibly_sensitive": false,
513  "possibly_sensitive_editable": true,
514  "quote_count": 0,
515  "reply_count": 0,
516  "retweet_count": 0,
517  "retweeted": false,
518  "source": "<a href=\"http://www.hubspot.com/\" rel=\"nofollow\">HubSpot</a>",
519  "user_id_str": "3510729917",
520  "id_str": "1547575166686793730"
521}

Contact

Please visit us through epctex.com to see all the products that are available for you. If you are looking for any custom integration or so, please reach out to us through the chat box in epctex.com. In need of support? devops@epctex.com is at your service.

Developer
Maintained by Community

Actor Metrics

  • 156 monthly users

  • 19 stars

  • 99% runs succeeded

  • 21 hours response time

  • Created in Jul 2022

  • Modified 12 hours ago