Instagram Public Scraper
Pricing
from $10.00 / 1,000 scraped results
Instagram Public Scraper
Scrape public Instagram profiles, posts, and hashtags without login. Export clean data to CSV/JSON for marketing, research, and analytics.
Pricing
from $10.00 / 1,000 scraped results
Rating
0.0
(0)
Developer

CodePoetry
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
Instagram Public Profile & Post Scraper (No Login) πΈ
Scrape public Instagram profiles, posts, and hashtags without login and export clean, analysis-ready data to CSV or JSON.
This production-ready Apify Actor is designed for marketers, founders, researchers, and analysts who need reliable public Instagram data without dealing with authentication, cookies, or private APIs.
β οΈ PUBLIC DATA ONLY This Actor scrapes only publicly visible information. It does not log in to Instagram, access private profiles, stories, or use internal APIs.
π― Who This Is For
Perfect for:
- π Market & competitor research
- π£ Influencer discovery & analysis
- π§ Content & hashtag research
- π·οΈ Lead generation from public profiles
- π Academic & social media research
β¨ Key Features
- β No login required (public data only)
- π€ Profile scraping (followers, bio, verification, counts)
- πΌοΈ Post scraping (captions, hashtags, media, engagement)
- π Hashtag scraping (public posts)
- π¦ Clean output (CSV & JSON friendly)
- π‘οΈ Anti-blocking measures (sessions, delays, fingerprinting)
- π Proxy support (residential required for hashtags)
- π§ͺ Debug mode for development & troubleshooting
π¨ Limitations (Important)
| Capability | Supported | Notes |
|---|---|---|
| Public profiles | β Yes | Proxy optional |
| Hashtag scraping | β οΈ Yes | Residential proxies required |
| Private profiles | β No | Public data only |
| Stories / Reels (private) | β No | Requires login |
| Login / cookies | β No | By design |
β οΈ Hashtag scraping fails fast without residential proxies to prevent wasted compute.
π₯ Quick Start (30 seconds)
{"profiles": ["instagram", "natgeo"],"maxPostsPerSource": 10,"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
π What Data You Get
Profile Data
- Username, full name, biography
- Followers & following counts
- Total posts count
- Profile picture URL
- Verified status
- Profile URL
Post Data
- Post URL & ID
- Caption text & extracted hashtags
- Likes & comments counts
- Media type (image / video / carousel)
- Media URLs
- Timestamp (ISO-8601)
- Owner username
π€ Output Schema (Consistent)
All dataset items β including errors β follow one schema:
{"sourceType": "profile | hashtag","sourceValue": "string","scrapedAt": "ISO-8601","data": {"status": "success | error","type": "profile | post","message": "error message (if error)","...": "scraped fields (if success)"}}
This guarantees:
- CSV export compatibility
- BigQuery / Sheets compatibility
- Stable downstream processing
π Proxy Requirements
| Scraping Type | Proxy |
|---|---|
| Profiles | Optional (recommended) |
| Hashtags | Residential required |
Recommended (Hashtags)
{"proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
Datacenter proxies may work for profiles only but usually fail for hashtags.
βοΈ Technical Overview
- Runtime: Node.js 18+
- Crawler: PlaywrightCrawler
- Framework: Apify SDK v3 + Crawlee
- Browser: Headless Chromium
Anti-Blocking Strategy
- Session rotation
- Randomized delays
- Human-like scrolling
- Browser fingerprint randomization
- Automatic retries
βοΈ Legal & Compliance
This Actor:
- Scrapes only public data visible to non-logged-in users
- Does not bypass paywalls, logins, or CAPTCHAs
- Does not store cookies or sessions
You are responsible for:
- Complying with Instagramβs Terms of Service
- Complying with GDPR, CCPA, and local data laws
- Using scraped data ethically and legally
π Troubleshooting
Login wall detected
- Use residential proxies (mandatory for hashtags)
- Reduce
maxPostsPerSource - Increase delays
Actor fails at startup
- Happens when hashtags are requested without residential proxies
Slow performance
- Intentional to avoid blocking
- Increase concurrency at your own risk
π Project Structure
src/βββ main.jsβββ input-schema.jsonβββ helpers/βββ extractProfile.jsβββ extractPosts.jsβββ scrollPage.jsβββ utils.js
π Ready to Use
Push and run:
$apify push
π License
ISC License
π€ Support
Questions, issues, or feature requests? Open an issue in the repository β feedback is welcome.