Instagram Post Details Scraper
Pricing
$19.99/month + usage
Instagram Post Details Scraper
๐ธ Instagram Post Details Scraper extracts public post data โ captions, hashtags, mentions, likes, comments, views, location, publish date, media URLs & engagement metrics. ๐ Perfect for social analytics, competitor research & content strategy. CSV/JSON export.
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScrapeBase
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
Instagram Post Details Scraper
The Instagram Post Details Scraper is a purpose-built Instagram post data scraper that extracts structured metadata from public post, reel, and IGTV URLs at scale. It solves the repetitive, error-prone task of manually collecting post details by acting as an Instagram post details extractor and Instagram post metadata scraper for marketers, developers, data analysts, and researchers. With robust parsing and smart proxy fallback, this IG post details scraper lets you automate social analytics, competitor research, and content strategy workflows at scale.
What data / output can you get?
Below are examples of the structured fields this Instagram post info scraper tool returns for each input URL. Results can be exported to CSV and JSON from the Apify dataset.
| Data field | Description | Example value |
|---|---|---|
| url | The processed Instagram post/reel/tv URL | https://www.instagram.com/p/ABC123/ |
| post_id | Instagram media ID (string) | "314159265358979" |
| short_code | Post shortcode derived from the URL | "ABC123" |
| username | Username associated with the media (or first tagged user) | "natgeo" |
| post_date | ISO-8601 UTC timestamp of publish time | "2024-01-05T12:34:56.000Z" |
| like_count | Total like count on the post | 18342 |
| comment_count | Total comment count | 427 |
| video_view_count | Video view count (if applicable) | 120045 |
| display_url | Original media image URL (largest available) | "https://scontent.cdninstagram.com/media/image.jpg" |
| video_url | Original video file URL (if applicable) | "https://scontent.cdninstagram.com/media/video.mp4" |
| caption | Post caption text | "Exploring the wild. #nature" |
| thumbnail_url | Thumbnail image URL (small candidate) | "https://scontent.cdninstagram.com/media/thumb.jpg" |
Bonus fields also included in each record:
- alt (accessibility caption), location (object), video_duration, video_play_count, sponsor_user (array), upcoming_event, and engagement/metadata booleans such as is_affiliate, is_paid_partnership, is_ad, is_insights_as_brand, is_caption_edited, has_ranked_comments.
- Nested objects: owner (with follower/following/post counts when available) and latest_comments (array of recent comments).
- Exports: CSV and JSON supported from the Apify dataset. A key-value store item named OUTPUT also provides a URL โ postDetails JSON mapping per run.
Key features
-
๐ Robust extraction via embedded payloads
Uses embedded data-sjs JSON payloads to reliably collect post metadata, media URLs, engagement counts, and more โ a resilient Instagram post data extractor approach. -
๐ค Owner enrichment (when available)
Calls the web_profile_info API to enrich the owner object with edge_owner_to_timeline_media.count, edge_followed_by.count, and edge_following.count for deeper Instagram post insights scraper use cases. -
๐ง Smart proxy fallback & resilience
Starts direct to Instagram, then automatically escalates to datacenter, then residential proxies on blocks (401/403/429), with retries and sticky residential for the remainder โ keeping your Instagram post details extractor reliable over long runs. -
๐ฆ Bulk-ready input
Paste many post/reel/tv URLs at once. The actor de-duplicates targets and processes them concurrently for scalable Instagram post URL scraper workflows. -
๐๏ธ Live-saving & clean outputs
Each processed item is immediately pushed to the Apify dataset with a one-row-per-URL shape, and mirrored to the OUTPUT key-value store as a URL โ postDetails mapping. -
๐ No login required
Designed for publicly available data. No cookies, sessions, or account access needed โ ideal for a no-code Instagram post scraper setup. -
๐งฉ Developer-friendly on Apify
Built with the Apify Python SDK for easy automation and API-based runs. Integrate results into pipelines or use from your Instagram post scraper Python scripts via the Apify API. -
โ๏ธ Built-in retry logic
Graceful handling of transient errors with exponential backoff and clear logs that summarize collected metrics (post_id, short_code, likes, comments, date).
How to use Instagram Post Details Scraper - step by step
-
Sign in to Apify
Create or log into your Apify account to access the actor. -
Open the actor
Find โInstagram Post Details Scraperโ in the Apify Store and click Try it. -
Add input URLs
In the urls field, paste one or many Instagram post, reel, or tv URLs (one per line). You can also include shortcode strings in this list โ theyโll be normalized automatically to post URLs. -
(Optional) Set proxyConfiguration
Leave it empty to start without a proxy, or configure the Apify Proxy. The actor automatically escalates to datacenter then residential proxies if needed. -
Start the run
Click Start. The run will list your targets and begin processing, with clear logs showing progress and proxy changes when applicable. -
Monitor results in real time
Each successful result is saved immediately to the dataset (one row per URL). Errors, if any, are also recorded per URL. -
Download your data
Go to the dataset tab to export results in CSV or JSON. A full URL โ postDetails JSON mapping is also saved under the key-value store item named OUTPUT.
Pro Tip: Automate end-to-end by triggering the actor via the Apify API and piping the dataset to your analytics stack. Perfect for recurring social analytics and competitor monitoring.
Use cases
| Use case name | Description |
|---|---|
| Social analytics for marketing teams | Aggregate likes, comments, views, and captions to measure campaign performance across profiles using an Instagram post data scraper. |
| Competitor content benchmarking | Track engagement per post to benchmark competitors and optimize content strategy with an Instagram post details extractor. |
| Content asset management | Collect display_url/video_url and alt text for archiving or reuse in CMS pipelines using an Instagram media and caption scraper. |
| Research & academic studies | Build datasets of public posts (timestamps, captions, owner metrics) for time-series or NLP analysis in a compliant manner. |
| API-driven enrichment | Run the actor via API and merge outputs (owner counts, engagement metrics) into your data lake as part of an Instagram post insights scraper workflow. |
| Comment monitoring | Capture latest_comments to keep an eye on recent audience reactions for moderation or qualitative analysis. |
| No-code dashboards | Combine the dataset CSV/JSON export with BI tools to create a no-code Instagram post scraper dashboard for ongoing reporting. |
Why choose Instagram Post Details Scraper?
Built for precision, automation, and reliability, this Instagram post metadata scraper focuses on clean, structured outputs for production workflows.
- ๐ฏ Accurate by design: Extracts embedded data-sjs payloads for consistent field-level accuracy.
- โก Scales to batches: Processes many URLs concurrently with de-duplication and incremental saves.
- ๐งฉ Developer access: Apify-powered actor for API automation and integration with Python-based data pipelines.
- ๐ Public-data only: No login or cookies required for this Instagram post info scraper tool.
- ๐ง Smart proxy strategy: Automatic direct โ datacenter โ residential fallback for resilient operations.
- ๐พ Clean exports: Download results from the dataset in CSV or JSON without post-processing.
- โ Production-ready: Clear logs, error capture per URL, and a KV โOUTPUTโ mapping for easy downstream use.
In short, itโs a dependable IG post details scraper compared to brittle browser extensions or adโhoc scripts โ purpose-built for automated, repeatable workflows.
Is it legal / ethical to use Instagram Post Details Scraper?
Yes, when used responsibly. This actor is designed to extract publicly available Instagram post data only and does not access private or authenticated content.
Guidelines for compliant use:
- Collect data that is publicly visible on Instagram posts, reels, or IGTV pages.
- Avoid personal or private data; focus on post-level metadata and engagement figures.
- Ensure your use complies with applicable laws (e.g., GDPR, CCPA) and Instagramโs terms.
- Use the data for analysis, research, or reporting โ not for spam or misuse.
- Consult your legal team for edge cases or jurisdiction-specific requirements.
Input parameters & output format
Example JSON input
{"urls": ["https://www.instagram.com/p/Bq-mIJWgUAU/","https://www.instagram.com/reel/CgzOZz5DWaQ/","https://www.instagram.com/tv/CgzOZz5DWaQ"],"proxyConfiguration": {"useApifyProxy": false}}
Parameters
| Field | Type | Description | Default | Required |
|---|---|---|---|---|
| urls | array | Paste Instagram post/reel/TV URLs here. One per line โ bulk input supported. Example: https://www.instagram.com/p/ABC123/ or https://www.instagram.com/reel/XYZ456/ | โ | Yes |
| proxyConfiguration | object | Optional. Default: No proxy โ requests go direct to Instagram. If blocked: fallback to Datacenter, then Residential. If residential fails, retry 3ร. Once on residential, stick with it for all remaining requests. All proxy changes are logged. | {"useApifyProxy": false} | No |
Tip: The urls list accepts both full Instagram links and shortcode strings โ shortcodes are normalized automatically to a standard /p/
Example JSON output (successful row)
{"url": "https://www.instagram.com/p/ABC123/","post_id": "314159265358979","short_code": "ABC123","username": "natgeo","thumbnail_url": "https://scontent.cdninstagram.com/media/thumb.jpg","is_affiliate": false,"is_paid_partnership": false,"is_ad": false,"is_insights_as_brand": false,"is_caption_edited": false,"has_ranked_comments": true,"like_count": 18342,"comment_count": 427,"video_view_count": 120045,"video_play_count": 121003,"post_date": "2024-01-05T12:34:56.000Z","display_url": "https://scontent.cdninstagram.com/media/image.jpg","alt": "A mountain landscape at sunset.","location": {"name": "Yosemite National Park"},"caption": "Exploring the wild. #nature","video_duration": 12.34,"video_url": "https://scontent.cdninstagram.com/media/video.mp4","sponsor_user": [],"upcoming_event": null,"owner": {"id": "987654321","username": "natgeo","is_verified": true,"profile_pic_url": "https://scontent.cdninstagram.com/profile.jpg","blocked_by_viewer": false,"restricted_by_viewer": null,"followed_by_viewer": false,"full_name": "National Geographic","has_blocked_viewer": false,"is_embeds_disabled": false,"is_private": false,"is_unpublished": false,"requested_by_viewer": false,"pass_tiering_recommendation": null,"edge_owner_to_timeline_media": { "count": 20500 },"edge_followed_by": { "count": 28600000 },"edge_following": { "count": 120 }},"latest_comments": [{"id": "c1","text": "Amazing shot!","created_at": 1714938000,"did_report_as_spam": false,"owner": {"id": "u1","is_verified": false,"profile_pic_url": "https://scontent.cdninstagram.com/u1.jpg","username": "traveler_one"},"viewer_has_liked": false,"edge_liked_by": { "count": 4 },"is_restricted_pending": false,"edge_threaded_comments": {"count": 0,"page_info": { "has_next_page": false, "end_cursor": null },"edges": []}},{"id": "c2","text": "Where is this?","created_at": 1714939000,"did_report_as_spam": false,"owner": {"id": "u2","is_verified": false,"profile_pic_url": "https://scontent.cdninstagram.com/u2.jpg","username": "photo_fan"},"viewer_has_liked": false,"edge_liked_by": { "count": 2 },"is_restricted_pending": false,"edge_threaded_comments": {"count": 0,"page_info": { "has_next_page": false, "end_cursor": null },"edges": []}}]}
Example JSON output (failed row)
{"url": "https://www.instagram.com/p/DEF456/","error": "Media item not found in HTML"}
Additionally, an OUTPUT key-value store item is maintained during the run with a URL โ postDetails mapping:
{"https://www.instagram.com/p/ABC123/": { "...": "postDetails object as above" },"https://www.instagram.com/p/DEF456/": null}
Fields such as video_url, video_duration, alt, location, and owner counts may be null or missing when not available on the public page.
FAQ
Do I need to log in to use this scraper?
No. The Instagram Post Details Scraper works on publicly available posts and does not require login or cookies. Itโs designed as a safe, no-code Instagram post scraper for public content only.
What types of data does it extract from posts?
It returns post_id, short_code, username, caption, display_url/video_url, like_count, comment_count, video_view_count, post_date, and more. It also enriches the owner object with follower/following/post counts when available and includes latest_comments for recent audience reactions.
Can I scrape by URL or shortcode?
Both. Paste full post/reel/tv URLs into urls, or include shortcode strings in the same list โ theyโll be normalized automatically to standard post URLs so you can scrape Instagram post details by URL or shortcode.
Does it scrape comments?
It captures latest_comments sourced from preview or the comments connection payload when available. It is not a full historical comments crawler but is effective as an Instagram post comments scraper for recent interactions.
How many posts can I process in one run?
You can paste many URLs into the urls field. The actor de-duplicates inputs and processes them concurrently with built-in retries, making it suitable as a scalable Instagram post data extractor for bulk runs.
What export formats are supported?
CSV and JSON exports are available via the Apify dataset for every run. You can also consume the OUTPUT key-value store mapping directly in downstream workflows.
Is there a free trial?
Yes. The actor includes trial minutes (120) so you can test the workflow before subscribing. Refer to the actorโs Apify Store listing for current pricing.
Can I integrate this with Python or an API?
Yes. As an Apify actor, it can be triggered via the Apify API and integrated with your Instagram post scraper Python scripts or automation pipelines. Results are accessible through the dataset or the OUTPUT key-value store item.
How does proxy handling work?
The scraper starts direct to Instagram, then automatically falls back to datacenter and finally residential proxies if blocked (401/403/429), with retries on residential and sticky usage for the rest of the run. All proxy changes are logged for transparency.
Closing CTA / Final thoughts
The Instagram Post Details Scraper is built for accurate, automated extraction of public Instagram post-level data at scale. It delivers clean JSON records with captions, media URLs, engagement metrics, owner enrichment, and recent comments โ ready for analytics and research.
Whether youโre a marketer, developer, analyst, or researcher, use this Instagram post data scraper to power competitor tracking, content strategy, and reporting. Run it on Apify, export CSV/JSON, or trigger it via API from your Instagram post scraper Python workflows. Start extracting smarter, structured post insights today.