Youtube Scraper avatar
Youtube Scraper
Try for free

Pay $5.00 for 1,000 videos

View all Actors
Youtube Scraper

Youtube Scraper

streamers/youtube-scraper
Try for free

Pay $5.00 for 1,000 videos

YouTube crawler and video scraper. Alternative YouTube API with no limits or quotas. Extract and download channel name, likes, number of views, and number of subscribers.

User avatar

No transcript scraped for some videos with transcript.

Closed

mwatch opened this issue
3 months ago

There are some videos where transcript menu is accessed not through a "3 dot" link on the right side below the video player window, but by clicking on "more" link (see attached screenshot).

Here is an example:

I wonder if you can add scraping transcripts from those videos? We are using this actor in a production application.

User avatar

Hi, sorry for late reply. Transcripts can be added in short time, we just need to discuss it with the team.

I noticed that content of subtitles is very similar to content of transcripts, the only difference seems to be that the subtitles are split into more time slots. Would subtitles work for you or do you need specifically transcripts?

Subtitles:

100:00:2,360 --> 00:00:8,840 [Music]
200:00:6,879 --> 00:00:11,440 hello everyone good morning good 
300:00:8,840 --> 00:00:14,280 afternoon and good evening everywhere um
400:00:11,440 --> 00:00:18,160 thank you so much for joining us today

Transcript:

10:02 [Music] hello everyone good morning good
20:08 afternoon and good evening everywhere um thank you so much for joining us today
User avatar

mwatch

3 months ago

Subtitles would work for us, thanks

User avatar

SashaEpstein

3 months ago

Thanks, the latest actor is able to get transcripts for many videos with transcript accessible from the "more" link. However, there are still quite a few videos with transcript accessible from the "more" link, that the actor is not able to get subtitles. Here are a few examples: 0JqsVhglXKA, 2-JTtnqABqc, 3ArykuHsPrI, 4MCYxNAYM5A, 4mFpI3vcz54

User avatar

SashaEpstein

3 months ago

Hi Ondrej Klinovsk媒, any update on my question? I am from the MedWatch development team, we are using your actor in our production application.

User avatar

Hi, I tried to scrape all the videos you provided and it looks like the issue in not in the actor but in our proxies - when the actor uses certain proxies, Youtube doesn't send any data containing subtitles information. I'll discuss this with our team and we'll try to fix it asap.

User avatar

SashaEpstein

3 months ago

Thanks.

User avatar

Hi,

thanks for your patience, we're still trying to figure out the best solution. We'll have an update soon!

User avatar

Hi! Thanks for waiting. I've checked and saw that the videos that didn't get subtitles downloaded have non-english subtitles, while the default sub language is English. I've modified the input schema so that the default value for sub language field is now "Any" (make sure to check your input for this). Here is a test run with downloaded subs: https://console.apify.com/view/runs/ghTDkPzAjdSfvqoqs. Let me know if you have any more cases where you don't get subtitles downloaded by reopening the issue.

User avatar

SashaEpstein

2 months ago

Thanks, Sviatozar,

With the 鈥渁ny鈥 value in the language field in the input request I can successfully get subtitles in the actor output.

We appreciate your help. If I find any more cases where subtitles (transcript) is not available, I will let you know.

Developer
Maintained by Apify
Actor metrics
  • 691 monthly users
  • 66.9% runs succeeded
  • 2 days response time
  • Created in Jul 2023
  • Modified 6 days ago