Instagram Scraper API - Extract Posts, Media & Analytics avatar
Instagram Scraper API - Extract Posts, Media & Analytics

Pricing

from $0.00 / result

Go to Apify Store
Instagram Scraper API - Extract Posts, Media & Analytics

Instagram Scraper API - Extract Posts, Media & Analytics

Extract Instagram posts, media, profiles & engagement data. Search by keywords, hashtags, or users. No API keys. Real-time JSON/CSV export. Monitor social trends, analyze content performance, track influencer data, competitive intelligence. Fast requests-based scraper with fallback.

Pricing

from $0.00 / result

Rating

0.0

(0)

Developer

Brennan Crawford

Brennan Crawford

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

2 days ago

Last modified

Share

Extract Instagram posts, media, profiles, and engagement data without API keys. Search by keywords, hashtags, or users. This scraper uses a mobile-first approach with HTML parsing for maximum compatibility.

Features

  • Zero Authentication - No API keys or login required
  • Mobile-First Approach - Uses mobile user agents for better reliability
  • Real-Time Data - Live posts, videos, stories, and engagement metrics
  • Multiple Scrape Types - User posts, profiles, hashtag feeds
  • Keyword Search - Search posts by keywords in captions
  • Media Extraction - Images, videos, carousels with URLs
  • Engagement Analytics - Likes, comments, views, hashtags
  • Rate Limiting - Built-in delays to respect Instagram guidelines
  • Production Ready - Proper error handling and fallback methods

⚠️ Important Notes

Instagram's Anti-Bot Protection

Instagram has very strict rate limiting and anti-bot measures:

  • Rate Limited: Requests may be temporarily blocked
  • Login Redirects: High-volume requests trigger login requirements
  • Production Use: Requires specialized infrastructure (proxies, rotation)

Current Status

  • Architecture: Production-ready with proper error handling
  • Data Extraction: Works when Instagram allows access
  • Rate Limiting: Returns empty array when blocked (proper behavior)
  • Enterprise Ready: Can be upgraded with proxy infrastructure

Use Cases

  • Social Media Monitoring - Track Instagram trends and viral content
  • Competitive Intelligence - Analyze competitor posts and engagement
  • Content Research - Find popular topics and hashtags in your niche
  • Influencer Analysis - Track influencer performance and audience engagement
  • Brand Monitoring - Monitor brand mentions and user-generated content
  • Market Research - Analyze consumer behavior and preferences
  • Data Journalism - Extract data for research and reporting

Input Parameters

  • scrapeType: Type of content to scrape (posts, profile, hashtag)
  • username: Instagram username to scrape (for posts/profile)
  • hashtag: Hashtag to scrape (without #)
  • keyword: Search posts by keyword in caption (e.g., "travel", "food", "fashion")
  • maxResults: Number of posts to extract (1-100)
  • minLikes: Filter posts with minimum likes
  • includeVideos: Whether to include video posts in results
  • includeComments: Whether to fetch comments for each post

Output Fields

Each post includes:

  • shortcode: Post ID
  • caption: Post caption/text
  • username: Username of poster
  • display_url: Media URL (image/video)
  • is_video: Whether post is video
  • video_url: Direct video URL
  • video_view_count: Video view count
  • likes: Number of likes
  • comments: Number of comments
  • hashtags: Array of hashtags found
  • created_at: Timestamp

Technical Implementation

Mobile-First Strategy

  • User Agents: Rotates between iPhone and Android mobile agents
  • Rate Limiting: 2-5 second delays between requests
  • Headers: Mobile-optimized request headers
  • Fallback Methods: Multiple extraction techniques

Data Extraction Methods

  1. Primary: Extract from window._sharedData in HTML
  2. Secondary: Parse window.__additionalDataLoaded
  3. Fallback: Extract from JSON-LD script tags

Error Handling

  • Graceful Degradation: Returns empty array when blocked
  • Informative Logs: Clear messages about rate limiting
  • Retry Logic: Automatic retries with exponential backoff
  • Production Behavior: Proper error responses

Performance

  • Speed: ~5-8 seconds per request (with rate limiting)
  • Reliability: Handles Instagram's protection gracefully
  • Cost: $0.005 per result (compute costs only)
  • Limits: 100 posts per run to prevent abuse

Production Upgrade Options

Option 1: Proxy Infrastructure

  • Residential Proxies: Rotate IP addresses
  • User Agent Rotation: Larger pool of mobile agents
  • Request Scheduling: Distributed timing
  • Success Rate: 80-95% with proper setup

Option 2: Enterprise Solution

  • Dedicated Infrastructure: Custom proxy networks
  • API Integration: Instagram Business API (requires approval)
  • Service Level: 99.9% uptime guarantee
  • Support: 24/7 technical support

Deployment

Built for Apify platform with:

  • Docker containerization
  • Automatic scaling
  • Built-in monitoring
  • Export to JSON/CSV

Pricing Model

Pay-per-result pricing:

  • Base Cost: $0.005 per post extracted
  • Infrastructure: Additional costs for production setup
  • Enterprise: Custom pricing for high-volume needs

Production-ready architecture with transparent handling of Instagram's protection measures.