Instagram Post Scraper
Pricing
from $0.50 / 1,000 results
Instagram Post Scraper
Scrape public Instagram posts, reels, IGTV and carousel posts from direct URLs with no login, no cookies, no browser required. Supply a list of Instagram links and receive a structured dataset with author details, caption, media URLs, engagement counts, publish date, music, location, and more.
Pricing
from $0.50 / 1,000 results
Rating
0.0
(0)
Developer
Crawler Bros
Maintained by CommunityActor stats
0
Bookmarked
7
Total users
5
Monthly active users
a day ago
Last modified
Categories
Share
Scrape public Instagram posts, reels, IGTV and carousel posts from direct URLs — no login, no cookies, no browser required. Supply a list of Instagram links (/p/, /reel/, /reels/, /tv/, or the canonical username/p/shortcode format) and receive a structured dataset with author details, caption, media URLs, engagement counts, publish date, music, location, and more. Handles batches of 10,000 URLs per run with per-IP proxy rotation that keeps pace fast even under rate limiting.
What this actor does
- Scrape posts by URL — accepts any mix of
/p/,/reel/,/reels/,/tv/, and canonicalusername/p/shortcodelinks in a single run - Full media extraction — returns direct CDN URLs for every photo, video, reel and carousel item, including per-item resolution and duration
- Rich author data — username, display name, verified status, follower count, post count, and profile picture URL
- Engagement metrics — like count, comment count, play count, video duration, and whether likes are publicly hidden
- Captions, hashtags, and mentions — caption text parsed with hashtags and @mentions extracted as separate arrays
- Location and music — tagged location (name, coordinates) and reel music (artist, title, audio ID) when present
- Status tracking — every input URL produces a record; inaccessible, deleted, age-restricted, and invalid URLs each carry a clear
statusvalue so no URL is silently lost - Scale without blocks — per-IP cooldown with exponential back-off, automatic slot recycling, and UUID-prefixed sessions that guarantee fresh IPs on every run
Output per post record
Always present
status— scrape outcome:success,not_found,invalid_url,error, orblocked(see Status values section below)post_url— full Instagram post URL (https://www.instagram.com/p/{shortcode}/)shortcode— the post's unique shortcode identifierscraped_at— ISO 8601 timestamp of when the record was scrapedhashtags— array of hashtags found in the caption (empty array[]when none)mentions— array of @mentioned usernames in the caption (empty array when none)tagged_users— array of usernames tagged in the post media (empty array when none)collaborators— array of co-author usernames (empty array when none)caption— full caption text (empty string""when there is no caption)likes_hidden—trueif the author has hidden like counts,falseotherwisecomments_disabled—trueif comments are turned off on the post,falseotherwise
Present on successfully scraped posts (status: success)
post_id— Instagram's internal numeric post IDusername— author's Instagram usernamefull_name— author's display nameowner_id— author's numeric account IDprofile_url— full URL to the author's Instagram profileis_verified—trueif the author has a verified badgeowner_profile_pic_url— CDN URL of the author's profile pictureowner_is_business—trueif the account is a business accountowner_followers— author's follower countowner_post_count— total number of posts on the author's profilepub_date— ISO 8601 publish timestampmedia_type— one ofPhoto,Video,Reel,IGTV, orCarouselmedia_count— number of media items (always1except for carousels)media_urls— array of direct CDN URLs for each media itemthumbnail_url— CDN URL of the post thumbnail imagelike_count— number of likes (nullwhenlikes_hiddenistrue)comment_count— number of commentsmedia_items— array of per-item objects, each withindex,type,url,width,height. Single-item array for photos, videos, reels, and IGTV; multi-item array for carousels. Video items also includedurationandhas_audiodimensions—{width, height}object of the primary media item
Present only for video, reel, and IGTV posts
play_count— total view / play countvideo_duration— duration in seconds (float)has_audio—trueif the video has an audio track
Present only when location is tagged
location— object withname,id,slug,lat,lng
Present only for reels with music attribution
music— object withartist,title,audio_id,is_trending
Present only for paid partnerships
sponsor_tags— array of sponsor usernamesis_paid_partnership—trueif the post is a paid partnership
Present only for posts with accessibility captions
accessibility_caption— auto-generated accessibility description of the image
Status values
Every input URL — whether valid or not — produces exactly one output record with a status field:
| Status | Meaning |
|---|---|
success | Post scraped successfully; all fields populated |
not_found | Post is deleted, made private, or requires login to view (includes age-restricted content) |
invalid_url | The URL is not a recognisable Instagram post link |
error | Unexpected failure after all retry attempts |
blocked | All proxy IPs were exhausted before the post could be fetched |
Age-restricted content
Instagram age-gates some posts (alcohol brands, adult content) at the API level. Without a logged-in session the anonymous endpoint returns a not-found response for these posts — they appear with status: "not_found" rather than raising an error. This is a server-side restriction; no workaround exists without authentication.
Input
| Field | Type | Required | Description |
|---|---|---|---|
post_urls | array | yes | Instagram post, reel, or IGTV URLs. Accepts /p/, /reel/, /reels/, /tv/, and username/p/shortcode formats. Duplicates are silently de-duplicated. Up to 10,000 URLs per run |
Example: scrape a single post
{"post_urls": ["https://www.instagram.com/p/DUtk38cj2TA/"]}
Example: mixed batch of posts, reels, and canonical URLs
{"post_urls": ["https://www.instagram.com/p/DUtk38cj2TA/","https://www.instagram.com/reel/C9uzjsSpHJC/","https://www.instagram.com/leomessi/p/DZifppZj_cB/","https://www.instagram.com/tv/CQa7HaGlRPD/"]}
Example: large batch
{"post_urls": ["https://www.instagram.com/p/DUtk38cj2TA/","... (up to 10,000 URLs)"]}
Example output
{"status": "success","post_url": "https://www.instagram.com/p/DUtk38cj2TA/","shortcode": "DUtk38cj2TA","post_id": "3672810928374650123","username": "natgeo","full_name": "National Geographic","owner_id": "787132","profile_url": "https://www.instagram.com/natgeo/","is_verified": true,"owner_profile_pic_url": "https://scontent.cdninstagram.com/v/...","owner_is_business": false,"owner_followers": 284000000,"owner_post_count": 32400,"caption": "The ancient forest at dawn. #nature #wildlife #photography","hashtags": ["nature", "wildlife", "photography"],"mentions": [],"pub_date": "2025-11-12T08:30:00+00:00","media_type": "Photo","media_count": 1,"media_urls": ["https://scontent.cdninstagram.com/v/...jpg"],"media_items": [{ "index": 1, "type": "Photo", "url": "https://scontent.cdninstagram.com/v/...jpg", "width": 1080, "height": 1350 }],"thumbnail_url": "https://scontent.cdninstagram.com/v/...jpg","dimensions": { "width": 1080, "height": 1350 },"like_count": 482910,"comment_count": 3204,"likes_hidden": false,"comments_disabled": false,"tagged_users": [],"collaborators": [],"is_paid_partnership": false,"scraped_at": "2025-11-12T09:14:33.421000+00:00"}
Records for inaccessible or invalid URLs look like this:
{ "status": "not_found", "post_url": "https://www.instagram.com/p/DELETED123/", "shortcode": "DELETED123", "scraped_at": "..." }{ "status": "invalid_url", "post_url": "https://www.google.com/", "scraped_at": "..." }
Age-restricted and private content
Instagram does not expose age-gated content through its anonymous API. Posts from accounts that require age verification (alcohol brands, adult content) return status: "not_found" rather than producing a data record. Private account posts behave the same way. This is a server-side restriction enforced by Instagram and cannot be bypassed without authenticated session cookies.
| Content type | status | Data returned |
|---|---|---|
| Public post | success | All fields |
| Deleted post | not_found | post_url, shortcode, scraped_at |
| Private post | not_found | post_url, shortcode, scraped_at |
| Age-restricted post | not_found | post_url, shortcode, scraped_at |
| Invalid URL format | invalid_url | post_url, scraped_at |
Use cases
- Competitor content analysis — track post frequency, engagement rates, media types and captions across competitor accounts to benchmark your own content strategy
- Influencer vetting — verify follower counts, engagement ratios and posting history before signing a partnership deal
- Brand monitoring — collect all posts tagged with your brand name or product from any source URL and feed them into a monitoring dashboard
- Social media archiving — create a permanent structured archive of a campaign's posts before content is deleted or made private
- Research and journalism — gather public post metadata at scale for academic studies on misinformation, trends, or audience behaviour
- Content repurposing — extract caption text and media URLs from your own posts to migrate or republish content across platforms
- Hashtag performance tracking — scrape posts sharing a known hashtag to measure reach and identify top-performing formats
- E-commerce product research — collect posts from brand accounts to analyse product launches, promotions and customer response
FAQ
Do I need an Instagram account or cookies to use this actor? No. This actor scrapes public posts anonymously without any login, cookies, or account credentials. No authentication is required for publicly accessible content.
Why do some posts return status: "not_found"?
A not_found status means Instagram's API returned no data for that URL. This happens when a post has been deleted, made private, or when the post is from an age-restricted account (such as alcohol brands or accounts requiring age verification). These are server-side restrictions that cannot be bypassed without an authenticated session.
Why is like_count missing on some posts?
When an author hides their like counts, Instagram does not return the figure in the API response. The record will have likes_hidden: true and like_count will be absent from the output.
How many URLs can I submit in one run? Up to 10,000 URLs per run. The actor automatically uses residential proxies with per-IP session rotation to handle large batches reliably — no proxy configuration is required on your end.
How fresh is the data? Data is scraped live at the time of the run. Post metadata reflects the current state of Instagram at the moment of scraping — including the latest like and comment counts.
What URL formats are supported?
The actor accepts /p/, /reel/, /reels/, and /tv/ links, as well as canonical username/p/shortcode URLs (e.g. https://www.instagram.com/natgeo/p/DUtk38cj2TA/). Duplicate URLs pointing to the same post are automatically de-duplicated.
Is this actor affiliated with Instagram or Meta? No. This is an independent third-party tool that automates interaction with the public Instagram website. It is not endorsed by or affiliated with Meta Platforms, Inc.
Other Instagram Scrapers
Want to get other data from Instagram? Check out our complete suite of Instagram scrapers:
| Actor | Description |
|---|---|
| Instagram Comment Scraper | Scrape comments from any Instagram post or reel |
| Instagram Profile Scraper | Extract profile data, bio, follower counts, and more |
| Instagram Followers & Following Scraper | Scrape followers and following lists from any profile |
| Instagram Tagged Posts Scraper | Collect posts where a user has been tagged |
| Instagram Hashtag Scraper | Scrape posts and profiles by hashtag |
| Instagram Story Downloader | Download stories from Instagram profiles |
| Instagram Downloader API | Download photos, videos, and reels from Instagram |
| Instagram Keyword Scraper | Search and scrape posts by keyword |
| Instagram Keyword Search Scraper | Search Instagram accounts and posts by keyword |
| Instagram Transcript Scraper | Extract transcripts from Instagram video content |