YouTube Scraper avatar
YouTube Scraper

Pricing

$5.00 / 1,000 videos

Go to Store
YouTube Scraper

YouTube Scraper

streamers/youtube-scraper

Developed by

Streamers

Maintained by Apify

YouTube crawler and video scraper. Alternative YouTube API with no limits or quotas. Extract and download channel name, likes, number of views, and number of subscribers.

4.6 (28)

Pricing

$5.00 / 1,000 videos

618

Monthly users

2.6k

Runs succeeded

95%

Response time

2.4 days

Last modified

18 hours ago

title

id

url

viewCount

likes

channelName

numberOfSubscribers

duration

Crawlee, the web scraping and browser automation library

g1Ll9OlFwEQ

https://www.youtube.com/watch?v=g1Ll9OlFwEQ

10150

136

Apify

6640

00:03:15

Crawlee for Python: Build reliable crawlers. Fast.

Ejhudr7e-h4

https://www.youtube.com/watch?v=Ejhudr7e-h4

916

23

Apify

6640

00:03:39

Build a Web Scraper from Scratch | JavaScript | Playwright | Crawlee

DOtJEwVsJic

https://www.youtube.com/watch?v=DOtJEwVsJic

4411

109

deejaydev

1680

00:22:43

The data above is synthetic and does not reflect real-world values. View full dataset

DW

How do I deduplicate results so I'm not getting the same channels, videos, etc.

Closed
dwaveswrld opened this issue
2 months ago

The scraper works perfectly, but I just want to be able to de-duplicate according by channel name.

The same channels pop up again and again, and it is messing up what I am trying to do with this YouTuber scraper.

I have tried to find integrations that may support this, but none pop up.

Wondering if YouTube scraper natively has this, or if it is possible to get this integrated with this scraper.

lukas.prusa avatar

Hi, thanks for opening this issue!

To confirm, you mean deduplication of the channels in a single run, or multiple? Implementing this in a single run is doable for us, however in multiple ones, you will have to use some outside tools (because ultimately they will have to compare the values with the previous runs via some database).

If you want to do this, then ideally you would create your own solution with a proper database, but you could go full Apify integrations if you want to. You could use the Merge, Dedup & Transform Datasets Actor to deduplicate them. Then use the Append to dataset Actor to put it into one big dataset.

I hope this helps, thanks and happy scraping!

Pricing

Pricing model

Pay per result 

This Actor is paid per result. You are not charged for the Apify platform usage, but only a fixed price for each dataset of 1,000 items in the Actor outputs.

Price per 1,000 items

$5.00