Instagram Scraper avatar
Instagram Scraper

Pricing

from $1.50 / 1,000 results

Go to Store
Instagram Scraper

Instagram Scraper

Developed by

Apify

Apify

Maintained by Apify

Scrape and download Instagram posts, profiles, places, hashtags, photos, and comments. Get data from Instagram using one or more Instagram URLs or search queries. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

3.6 (62)

Pricing

from $1.50 / 1,000 results

1870

Total users

112K

Monthly users

13K

Runs succeeded

98%

Issues response

3.1 days

Last modified

7 hours ago

RC

Download Video Transcripts

Closed

robert.castagna opened this issue
2 months ago

Is it possible to support downloading of automatic transcription text from a video or reel?

Similar scrapers have the functionality to do the same from tiktok videos, youtube videos and x videos - yet to find one for Instagram!

weuts avatar

Hi 😊

Thanks for reaching out! I've created an internal issue to look into whether it is possible - we'll get back to you once we know more 😊

RC

robert.castagna

2 months ago

Sweet, thanks! Currently processing video streams for audio extraction and its quite the implementation... lol

alexey avatar

Hi!

Currently, it is not found anywhere in Instagram's public reels. You can try as follows:

  1. Scrape the post with the audio, i.e., https://console.apify.com/view/runs/8V6LCcb0a2jqHWfdu

  2. Use musicInfo.audio_id to compose a direct audio URL (https://www.instagram.com/reels/audio/1363185204928041/ for the post above) and scrape it again https://console.apify.com/view/runs/4UKLOm2miVEhCfKqb

As you can see https://api.apify.com/v2/datasets/i8QtEzbCSbN0lkuCr/items?clean=true&format=json there is no transcript, not in metadata, nor audio items. If you can find a working sample when the transcript is revealed in the public IG post or reels, we will be happy to add such details to the actor's output.

I will close the issue now, but if there is anything else we can help with, please let us know.

RC

robert.castagna

6 days ago

Hi Alexey, this method was working for a while however now, the second scrape with the direct audio URL is referencing a different post than the one from the first scrape that gave the musicInfo.audio_id. Ex:

  1. scrape post with audio, https://console.apify.com/actors/shu8hvrXbJbY3Eb9W/runs/mflhN3w9PodI6N71R#output (musicInfo.audio_id:'1238404804144710', musicInfo.uses_original_audio:true)

  2. use audio_id to get direct audio url ( "https://www.instagram.com/reels/audio/1238404804144710/" ) and scrape again: https://console.apify.com/actors/shu8hvrXbJbY3Eb9W/runs/PpSj6nPA4Iyqfu889#output gives a different post.

Is this behavior expected in this case? It's happened to me on more than one occasion today.

alexey avatar

@robert.castagna Audio can be used in multiple reels, as you can see in the browser https://www.instagram.com/reels/audio/1238404804144710/ attached to the many reels.

Still, it is the same audio, and the audio details are the same for all the reels with it.

RC

robert.castagna

6 days ago

Okay, I understand. Since the flag musicInfo.uses_original_audio:true is not representative of a unique audio, is there a way I could identify a unique or original audio only used in that post?

alexey avatar

As I understand, "original" is something uploaded by the profile owner and not detected as a duplicate of copyrighted content. If original content is reused across many reels, it is still original. I think for your case, it's enough to check in the audio details if only one reel is referenced (items).

Hope it helps.