🏯 Tweet Scraper V2 - X / Twitter Scraper avatar
🏯 Tweet Scraper V2 - X / Twitter Scraper

Pricing

from $0.40 / 1,000 tweets

Go to Store
🏯 Tweet Scraper V2 - X / Twitter Scraper

🏯 Tweet Scraper V2 - X / Twitter Scraper

Developed by

API Dojo

API Dojo

Maintained by Community

⚡️ Lightning-fast search, URL, list, and profile scraping, with customizable filters. At $0.40 per 1000 tweets, and 30-80 tweets per second, it is ideal for researchers, entrepreneurs, and businesses! Get comprehensive insights from Twitter (X) now!

3.8 (84)

Pricing

from $0.40 / 1,000 tweets

771

Total users

19K

Monthly users

2.6K

Runs succeeded

>99%

Issues response

4.3 hours

Last modified

a day ago

RF

Scraper a lot of duplicate tweet.

Closed

relishable_finance opened this issue
10 months ago

For this run:https://console.apify.com/actors/runs/hcZD7Xyb7rBkXEQj7 id 1833141861617221670 duplicated for 5 times

And for this run : https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/0GBM6p0QcgxdmVLl9 id:1833197842661576793 duplicated for 104 times.

That cause my scraper fees about 100 times higher

Attach is result of hcZD7Xyb7rBkXEQj7

apidojo avatar

API Dojo (apidojo)

10 months ago

Hello,

Our engineering team checked the runs and we couldn't find any issues on our end. Our scraper uses the query you give to it with Twitter and returns you whatever it can get from it. There is no additional logic, including removing duplicate. It never alters the output.

Twitter changes behaviour constantly and it is better to test your queries on twitter web UI before running the actor. Another reason can be using multiple from queries withORs. Are you sure that works with Twitter? If you are using ORs, I think the best approach would be to create separate runs for each profile.

Does that make sense?

Cheers!

RF

relishable_finance

10 months ago

I tried agian with only one from, but there is many duplicate as well。 https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/8tXLNNdSegCRGCYue

RF

relishable_finance

10 months ago
TS

topical_summer

10 months ago

results are duplicated 27 times for my last run.

apidojo avatar

API Dojo (apidojo)

10 months ago

Hey hey,

As I mentioned, this is something we have no control over. Our actor uses Twitter search and paginates as long as it can get the pagination. And when you try to fetch lots of tweets, this is inevitable since Twitter acts very weird with long paginations.

For fetching a profile, I suggest you to use a similar approach as we explained here:

{
"includeSearchTerms": false,
"onlyImage": false,
"onlyQuote": false,
"onlyTwitterBlue": false,
"onlyVerifiedUsers": false,
"onlyVideo": false,
"searchTerms": [
"from:NASA since:2023-01-01 until:2023-02-01",
"from:NASA since:2023-02-01 until:2023-03-01",
"from:NASA since:2023-03-01 until:2023-04-01",
"from:NASA since:2023-04-01 until:2023-05-01",
"from:NASA since:2023-05-01 until:2023-06-01",
"from:NASA since:2023-06-01 until:2023-07-01",
"from:NASA since:2023-07-01 until:2023-08-01",
"from:NASA since:2023-08-01 until:2023-09-01",
"from:NASA since:2023-09-01 until:2023-10-01",
"from:NASA since:2023-10-01 until:2023-11-01",
"from:NASA since:2023-11-01 until:2023-12-01"
],
"sort": "Latest",
"tweetLanguage": "en"
}

That way, you will have less paginations and your results will have less duplicates.

Cheers!

RF

relishable_finance

10 months ago

https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/sJdAPAajvSfCpQpud

I've tried this way.. but when there is no reuslt return. it still show 33 outputs.

apidojo avatar

API Dojo (apidojo)

10 months ago

Hello,

Yes, that is expected. When your query returns 0 results, you get a zeroResult object for us to cover the cost of the run. You can check the output of the dataset in order to seet his.

Cheers!