Social Media Intelligence Scraper avatar
Social Media Intelligence Scraper
Under maintenance

Pricing

Pay per event

Go to Apify Store
Social Media Intelligence Scraper

Social Media Intelligence Scraper

Under maintenance

Account Overview, All Posts, Audience Metrics, Caption Analysis, Collaboration Strategy, Competitive Intel, Content Breakdown, Engagement Analysis, Hashtag Strategy, Location Strategy, Platform, Posting Patterns, Profile URL, Scraped At, SEO Optimization, Top Performing Posts

Pricing

Pay per event

Rating

5.0

(2)

Developer

personal

personal

Maintained by Community

Actor stats

0

Bookmarked

7

Total users

4

Monthly active users

23 days ago

Last modified

Share

Social Media Intelligence Scraper ๐Ÿš€ A powerful, self-contained social media scraper that extracts deep intelligence from Instagram, TikTok, LinkedIn, and Facebook profiles without relying on external actors or APIs.

โœจ Features ๐ŸŽฏ Comprehensive Data Extraction Account Overview: Username, bio, verification status, followers, following Audience Metrics: Follower counts, engagement ratios, growth indicators Content Analysis: Post types, performance metrics, best/worst performers Hashtag Intelligence: Top hashtags, usage frequency, diversity scores Posting Patterns: Best days/hours, frequency, consistency analysis Engagement Analytics: Likes, comments, shares, views, engagement rates Competitive Intelligence: Strengths, weaknesses, content themes, recommendations ๐Ÿ”ฅ Self-Contained Architecture โœ… No external actor dependencies โœ… Direct web scraping using Playwright โœ… Works independently without third-party APIs โœ… Fully customizable and maintainable ๐Ÿ“Š Supported Platforms Instagram - Full profile + post analysis TikTok - Videos, sounds, engagement metrics LinkedIn - Professional profile data (limited) Facebook - Page analytics (limited) ๐Ÿš€ Quick Start Input Configuration Provide at least one social media profile URL:

json { "instagramUrl": "https://www.instagram.com/cristiano/", "tiktokUrl": "https://www.tiktok.com/@khaby.lame", "linkedinUrl": "https://www.linkedin.com/in/williamhgates/", "facebookUrl": "https://www.facebook.com/nike" } Running the Actor On Apify Platform: Create new actor Upload the code Configure input Run! Locally: bash

Install dependencies

npm install

Set environment variable

export APIFY_TOKEN=your_token_here

Run locally

apify run ๐Ÿ“ฅ Input Schema Field Type Required Description Example instagramUrl string No* Instagram profile URL https://www.instagram.com/username/ tiktokUrl string No* TikTok profile URL https://www.tiktok.com/@username linkedinUrl string No* LinkedIn profile URL https://www.linkedin.com/in/username/ facebookUrl string No* Facebook page URL https://www.facebook.com/pagename *At least one URL is required

๐Ÿ“ค Output Schema Instagram Output json { "platform": "Instagram", "profileUrl": "https://www.instagram.com/cristiano/", "scrapedAt": "2025-10-27T12:00:00.000Z", "accountOverview": { "username": "cristiano", "displayName": "Cristiano Ronaldo", "followers": 500000000, "following": 500, "totalPosts": 3500 }, "engagementAnalysis": { "averageLikesPerPost": 5000000, "averageCommentsPerPost": 50000, "engagementRate": "3.5%", "totalEngagement": 17500000000 }, "hashtagStrategy": { "totalHashtagsUsed": 150, "topPerformingHashtags": [ {"hashtag": "football", "timesUsed": 45}, {"hashtag": "cr7", "timesUsed": 38} ] }, "postingPatterns": { "mostActiveDay": "Monday", "bestPostingHour": "18:00", "postingFrequency": "5.2 posts/week - Regular poster" }, "topPerformers": { "top3Posts": [...] }, "allPosts": [...], "competitiveIntel": { "keyStrengths": [ "Outstanding engagement - Highly engaged audience", "Excellent hashtag strategy" ], "recommendations": [ "Keep up the great work!" ] } } TikTok Output json { "platform": "TikTok", "profileUrl": "https://www.tiktok.com/@khaby.lame", "accountOverview": { "username": "khaby.lame", "followers": 150000000, "totalVideos": 500 }, "performanceMetrics": { "averageViews": 50000000, "averageLikes": 5000000, "engagementRate": "8.5%", "viralityScore": "3.2" }, "soundStrategy": { "topSounds": [ {"sound": "Original Sound", "timesUsed": 300} ] }, "allPosts": [...], "competitiveIntel": {...} } ๐ŸŽฏ Key Metrics Explained Engagement Rate (Total Engagement / Total Followers) ร— 100 Excellent: > 5% Good: 2-5% Average: 1-2% Poor: < 1% Performance Score Calculated based on:

Engagement rate (60% weight) Reach (40% weight) Virality Score (TikTok) Average Views / Followers High: > 2.0 (viral content) Medium: 1.0-2.0 (good reach) Growing: < 1.0 (building audience) Consistency Score High: 20+ posts analyzed Medium: 10-20 posts Low: < 10 posts ๐Ÿ”ง Technical Details Architecture Language: Node.js (ES Modules) Browser Automation: Playwright Framework: Apify SDK + Crawlee Scraping Method: Direct DOM parsing + API extraction How It Works URL Validation: Validates and cleans input URLs Browser Launch: Opens headless Chrome via Playwright Page Navigation: Visits profile pages with realistic user agents Data Extraction: Parses JSON-LD structured data Extracts GraphQL/SIGI_STATE data Scrapes visible DOM elements (fallback) Analysis: Processes raw data through intelligence algorithms Output: Returns comprehensive analytics JSON Rate Limiting 5-second delay between multiple profile scrapes Realistic user agent rotation Headless browser with stealth plugins โš ๏ธ Limitations Instagram โœ… Public profiles fully supported โŒ Private profiles: Basic info only โœ… Up to 50 recent posts analyzed โŒ Stories/Highlights not included TikTok โœ… Public profiles fully supported โŒ Private profiles: Basic info only โœ… Up to 50 recent videos analyzed โš ๏ธ Rate limiting possible on high volume LinkedIn โš ๏ธ Limited without authentication โœ… Basic profile info (name, headline, location) โŒ Posts require login ๐Ÿ’ก Recommendation: Use LinkedIn data export Facebook โš ๏ธ Limited without authentication โœ… Page name, likes, followers โŒ Posts require login ๐Ÿ’ก Recommendation: Use Facebook Business tools ๐Ÿ›ก๏ธ Best Practices

  1. Respect Rate Limits Don't scrape too many profiles at once Use delays between requests Monitor for IP blocks
  2. Handle Errors Gracefully Check error field in output Implement retry logic Log failures for debugging
  3. Data Privacy Only scrape public profiles Don't store personal data unnecessarily Comply with platform ToS
  4. Optimize Performance Scrape during off-peak hours Use residential proxies if needed Cache results when possible ๐Ÿ“Š Use Cases
  5. Influencer Marketing Identify top influencers by engagement Analyze content strategy Compare competitor performance
  6. Brand Monitoring Track brand mentions Analyze hashtag performance Monitor competitor activity
  7. Content Strategy Identify best posting times Analyze top-performing content types Discover trending hashtags
  8. Market Research Understand audience demographics Track industry trends Analyze competitor strategies
  9. Competitive Intelligence Compare engagement rates Identify content gaps Benchmark performance ๐Ÿ› Troubleshooting "Actor failed to start" Check Dockerfile configuration Verify package.json dependencies Ensure Playwright is installed "No data returned" Profile may be private Check URL format Platform may have changed structure "Rate limited" Reduce scraping frequency Add longer delays Use proxy rotation "Timeout errors" Increase timeout in page.goto() Check internet connection Profile may be slow to load ๐Ÿ“ Example Queries Single Profile json { "instagramUrl": "https://www.instagram.com/nike/" } Multiple Platforms json { "instagramUrl": "https://www.instagram.com/redbull/", "tiktokUrl": "https://www.tiktok.com/@redbull", "facebookUrl": "https://www.facebook.com/redbull" } Competitor Analysis json { "instagramUrl": "https://www.instagram.com/nike/", "tiktokUrl": "https://www.tiktok.com/@nike" } ๐Ÿ”„ Future Enhancements Add Twitter/X support Include YouTube analytics Add Pinterest scraping Implement proxy rotation Add sentiment analysis Create visualization dashboard Add historical data tracking Implement ML-based predictions ๐Ÿ“œ License Apache-2.0

๐Ÿค Contributing Contributions welcome! Please:

Fork the repository Create a feature branch Submit a pull request ๐Ÿ“ง Support For issues or questions:

Open a GitHub issue Contact: your-email@example.com Documentation: [Link to docs] โญ Acknowledgments Built with:

Apify SDK Crawlee Playwright Made with โค๏ธ for data-driven marketers and analysts