π€ Instagram Profile Scraper Enhanced
Pricing
from $1.77 / 1,000 results
π€ Instagram Profile Scraper Enhanced
Rich Instagram profile data with contact extraction, engagement scoring, follower bands, and MCP-ready metadata. Desktop+mobile fallback chain. 3 modes. No paid API keys required.
Pricing
from $1.77 / 1,000 results
Rating
0.0
(0)
Developer
Virtual Footprint LLC
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
2 days ago
Last modified
Categories
Share
Instagram Profile Scraper Enhanced
Creator intelligence from Instagram profiles: bio contact extraction, engagement-rate scoring, follower-band classification, and MCP-ready
providerHealthmetadata. Desktop+mobile fallback chain. No login cookies required.
Why This Actor Is Better
This actor is engineered to outperform every Instagram profile scraper on the Apify Store on reliability, creator-intelligence depth, and price.
Competitor comparison
| Feature | This Actor | Apify IG Scraper (top free) | IG API Scraper | Influencer analytics tools |
|---|---|---|---|---|
| No login cookies required | β | β | β | β |
| Desktop+mobile fallback | β | β | β | β |
| Bio email extraction | β | partial | β | β |
| Bio-link website email scrape | β | β | β | β |
| Engagement-rate score | β | β | β | β |
| Follower-band classification | β nano/micro/macro/mega | β | β | β |
| Confidence score (0-1) | β every result | β | β | β |
| Source attribution | β per-field | β | partial | partial |
| MCP-ready metadata | β
providerHealth | β | β | β |
| Optional HypeAuditor fraud check | β user key | β | β | β paid only |
| Price / 1K profiles | $1.77 | free (rate-limited) | ~$2.50 | ~$5-10 |
Key Features
- π‘οΈ Multi-API fallback chain β Instagram desktop (Playwright) primary with automatic mobile-HTML fallback. Never fails silently.
- βοΈ Contact extraction β emails/phones from bio + bio-link website (open-source, no paid API).
- π Engagement scoring β engagement-rate proxy + label (very_high/high/medium/low) on every profile.
- π₯ Follower bands β automatic classification: starter / nano / micro / macro / mega for influencer marketing.
- π― Confidence scoring β every result gets a 0.0β1.0 score so you can filter noise.
- π Source attribution β know exactly which provider contributed each field (
instagram,website_scraper,hypeauditor). - β‘ Cache-first mode β
fast_lookuphits the Apify KVS cache for <800ms responses (1h TTL β profiles change fast). - π€ MCP-ready β every result carries
providerHealth{}so MCP agents can route around failed providers. - π Optional paid enrichment β drop in
HYPEAUDITOR_API_KEYfor fraud detection. Disabled by default. - π° Transparent PPE pricing β pay only for successful profiles, no subscription.
Architecture
flowchart TDA[Input: usernames/URLs + mode] --> B{Cache hit?}B -- yes --> C[Return cached base profile]B -- no --> D[Primary: Instagram desktop Playwright]D -- fails --> E[Fallback: Instagram mobile HTML httpx]D -- ok --> F[Normalize: followers/posts/bio/profile pic]E --> FF --> G[Enrichment layer]G --> G1[Bio email/phone extraction]G --> G2[Bio-link website contact scraper]G --> G3[Engagement-rate + follower-band scoring]G --> G4[Optional: HypeAuditor fraud detection]G1 --> H[Confidence scoring + source attribution]G2 --> HG3 --> HG4 --> HH --> I[Progressive dataset push]I --> J[Webhook + MCP-ready metadata]C --> J
Data flow: Input β Cache check β Primary scraper (fallback on failure) β Profile normalization β Bio + website contact enrichment β Engagement + follower-band scoring β Optional fraud detection β Confidence scoring β Progressive dataset push β Webhook delivery.
Every result includes providerHealth{} tracking per-provider status, latency, and errors β making this actor safe to call from MCP agents that need to route around failures.
Modes
| Mode | Description | Target latency | Use case |
|---|---|---|---|
fast_lookup | Cache-first, base profile data only | <800ms cached | Quick lookups, deduplication |
enrich | Full enrichment: contacts + engagement + bands | ~2-4s/profile | Influencer prospecting, lead gen |
batch | Queue-based, full enrichment, per-item isolation | varies | Large username lists (100+) |
Input
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
mode | string | β | enrich | fast_lookup | enrich | batch |
queries | array | β | ["cristiano"] | Usernames, profile URLs, or search queries |
maxResults | integer | β | 25 | Max profiles per query (1β1000) |
webhookUrl | string | β | β | Webhook for completion notification |
Example input
{"mode": "enrich","queries": ["cristiano", "leomessi", "instagram"],"maxResults": 50,"webhookUrl": "https://your-app.com/webhook"}
Output
Results are stored in the Apify dataset as structured JSON. See .actor/output_schema.json for the canonical schema.
| Field | Type | Description |
|---|---|---|
query | string | Input query that produced this result |
username | string | Instagram handle |
name | string | Display name |
url | string | Profile URL |
bio | string | Bio text |
profilePic | string | Profile picture URL |
followers | string | Follower count (as displayed, e.g. "1.2M") |
following | string | Following count |
posts | string | Post count |
followerCount | integer | Parsed follower count (numeric) |
followerBand | string | starter | nano | micro | macro | mega |
engagementRate | number | Engagement-rate proxy (0-100) |
engagementLabel | string | very_high | high | medium | low | unknown |
recentPosts | array | Recent post URLs (up to 12) |
website | string | null | External link from bio |
emails | array | Extracted email addresses (bio + website) |
phones | array | Extracted phone numbers |
fraudAnalysis | object | null | HypeAuditor data (if key provided) |
confidenceScore | number | 0.0β1.0 reliability score |
sources | array | Provider attribution (instagram, website_scraper, hypeauditor) |
providerHealth | object | Per-provider status/latency/error for MCP routing |
cacheStatus | string | hit | miss | degraded |
mode | string | Execution mode used |
extractedAt | string | ISO timestamp |
Example output
{"query": "cristiano","username": "cristiano","name": "Cristiano Ronaldo","url": "https://instagram.com/cristiano","bio": "...","profilePic": "https://...","followers": "642M","following": "580","posts": "3,600","followerCount": 642000000,"followerBand": "mega","engagementRate": 0.56,"engagementLabel": "low","recentPosts": ["https://instagram.com/p/..."],"website": "https://cr7.com","emails": ["contact@cr7.com"],"phones": [],"confidenceScore": 0.85,"sources": ["instagram", "website_scraper"],"providerHealth": {"instagram_desktop": {"status": "ok", "latency_ms": 2100, "error": null},"website_scraper": {"status": "ok", "latency_ms": 1400, "error": null},"engagement_scorer": {"status": "ok", "latency_ms": 0, "error": null}},"cacheStatus": "miss","mode": "enrich","extractedAt": "2026-06-28T23:45:00.000Z"}
Pricing
Transparent pay-per-event (PPE) pricing. You only pay for successful profiles.
| Plan | Price per 1K profiles | Savings vs. top competitor |
|---|---|---|
| Leading competitors | ~$2.50/1K | β |
| This actor (β€10K/mo) | $1.77/1K | 29% cheaper |
| This actor (10Kβ100K/mo) | $1.50/1K | 40% cheaper |
| This actor (100K+/mo) | $1.25/1K | 50% cheaper |
Volume discounts apply automatically based on monthly usage. A small one-time Actor Start fee ($0.00005) applies per run. Optional enrichment events:
| Event | Price | When charged |
|---|---|---|
contact_found | $0.50/1K | Per profile with an extracted email or phone |
No monthly subscription required. No paid API keys required for core extraction.
Use Cases
- Influencer prospecting β find creators by audience size (micro/macro/mega) with verified contact emails
- Creator outreach β build email lists of Instagram creators for brand partnerships
- Engagement benchmarking β compare engagement rates across a creator shortlist
- Lead generation β extract contact info from business Instagram bios and their websites
- MCP agent workflows β call from LLM agents;
providerHealthlets agents route around failures - Market research β map creator density by niche and follower band
- CRM enrichment β append Instagram follower counts and engagement to existing contact records
- Fraud screening β optional HypeAuditor integration to flag fake-follower accounts before outreach
Integration Examples
Python (Apify SDK)
from apify_client import ApifyClientclient = ApifyClient("YOUR_APIFY_TOKEN")run = client.actor("ayeeyee/instagram-profile-scraper-enhanced").call(run_input={"mode": "enrich","queries": ["cristiano", "leomessi", "instagram"],"maxResults": 50,})for item in client.dataset(run["defaultDatasetId"]).iterate_items():print(f"@{item['username']}: {item['followerBand']} band, "f"engagement {item['engagementLabel']}, emails {item['emails']}")
cURL
curl -X POST "https://api.apify.com/v2/acts/ayeeyee~instagram-profile-scraper-enhanced/runs?token=YOUR_TOKEN" \-H "Content-Type: application/json" \-d '{"mode":"enrich","queries":["cristiano","leomessi"],"maxResults":25}'
Zapier / n8n / Make.com
- Add an HTTP Request or Apify module.
- Use actor ID
ayeeyee/instagram-profile-scraper-enhanced. - Map your trigger output to the
queriesinput array. - Consume the dataset items in the next step.
MCP (Model Context Protocol)
This actor is MCP-ready. Register it via @apify/actors-mcp-server:
$npx -y @apify/actors-mcp-server --tools actors,ayeeyee/instagram-profile-scraper-enhanced
Agents can then call call-actor with the input above and use providerHealth + confidenceScore + followerBand to route around failed providers and filter creator quality.
FAQ
Q: Do I need Instagram login cookies? No. This actor scrapes public profile data via meta tags and JSON-LD β no login or session cookies required. Private profiles return base data only.
Q: How does the desktop+mobile fallback work?
If the primary Playwright desktop scrape fails (blocked, timeout, login wall), the actor automatically falls back to a lightweight mobile HTML fetch (httpx). The providerHealth field on every result shows which providers succeeded/failed and their latency.
Q: Where do the emails come from? Two sources: (1) emails found directly in the Instagram bio, (2) emails scraped from the website linked in the bio. No paid email-finder API is required. For higher match rates, provide a HypeAuditor key.
Q: What is the engagement rate?
A posts-to-follower ratio proxy expressed as a percentage. Real engagement (likes+comments per post) would require scraping each post β left as optional enrichment. The label (very_high/high/medium/low) lets you filter creators quickly.
Q: What are follower bands?
Automatic classification by audience size: starter (<1K), nano (1K-10K), micro (10K-100K), macro (100K-1M), mega (1M+). Standard influencer-marketing tiers.
Q: Can I call this from an LLM agent?
Yes. The actor is MCP-ready. Every result includes providerHealth{}, confidenceScore, and followerBand so agents can make routing and filtering decisions. Register it via @apify/actors-mcp-server.
Q: What is the cache TTL?
1 hour. Cached results in fast_lookup mode return in <800ms. Enrichment results are not cached (always fresh). Profiles change faster than business listings, so the TTL is shorter than directory scrapers.
Q: Is this legal? Yes. This actor scrapes publicly available Instagram profile data. See the Legal & Compliance section below.
Legal & Compliance
This actor scrapes publicly available Instagram profile data via meta tags and JSON-LD. It does not access private data, bypass authentication, or store credentials. Users are responsible for complying with applicable data protection laws (GDPR, CCPA, etc.) and Instagram's Terms of Service.
This actor is intended for legitimate research, analysis, and creator-outreach use cases. It must not be used for spam, harassment, or unlawful activity.
AI-DLC / Data Lifecycle
This actor follows AI-DLC principles for ethical data handling:
- Collection β Public data only; respects robots.txt and rate limits.
- Processing β In-memory normalization; no PII logging.
- Storage β Results are pushed to the user's Apify dataset, not retained by the actor.
- Usage β Designed for analysis, enrichment, and legitimate creator outreach.
- Disposal β No long-term caching of user data between runs (cache TTL 1h for base results only).
Enhancement Roadmap (API / MCP Integrations)
- HypeAuditor fraud-detection MCP β flag fake-follower accounts before outreach (optional, user key)
- Google Vision image tagging MCP β auto-tag profile pictures and recent posts for niche classification
- OpenAI sentiment MCP β analyze bio sentiment and brand-fit scoring
- LangGraph conditional workflow β multi-step: IG profile β website β LinkedIn β AI scoring (see
multi-api-orchestration-spec.md) - Vector store integration β semantic creator deduplication across runs
Changelog
- v3.0 β Multi-API orchestration edition: added desktop+mobile fallback chain, MCP-ready
providerHealthmetadata, engagement-rate + follower-band scoring, optional HypeAuditor fraud detection, expanded FAQ, integration examples, volume pricing tiers. - v2.0 β Premium README, full output schema, AI-DLC documentation, confidence scoring, source attribution, bio-link website enrichment.
- v1.0 β Initial release with core Playwright scraping and structured output.
Links
- Apify Store: https://apify.com/ayeeyee/instagram-profile-scraper-enhanced
- Actor ID:
2NiTjgPTRTmO5aYiI - Documentation: See this README
- Support: Open an issue on the actor's Apify Store page