πŸ‘€ Instagram Profile Scraper Enhanced avatar

πŸ‘€ Instagram Profile Scraper Enhanced

Pricing

from $1.77 / 1,000 results

Go to Apify Store
πŸ‘€ Instagram Profile Scraper Enhanced

πŸ‘€ Instagram Profile Scraper Enhanced

Rich Instagram profile data with contact extraction, engagement scoring, follower bands, and MCP-ready metadata. Desktop+mobile fallback chain. 3 modes. No paid API keys required.

Pricing

from $1.77 / 1,000 results

Rating

0.0

(0)

Developer

Virtual Footprint LLC

Virtual Footprint LLC

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 days ago

Last modified

Share

Instagram Profile Scraper Enhanced

Apify Version Pricing Platform Modes MCP-ready

Creator intelligence from Instagram profiles: bio contact extraction, engagement-rate scoring, follower-band classification, and MCP-ready providerHealth metadata. Desktop+mobile fallback chain. No login cookies required.


Why This Actor Is Better

This actor is engineered to outperform every Instagram profile scraper on the Apify Store on reliability, creator-intelligence depth, and price.

Competitor comparison

FeatureThis ActorApify IG Scraper (top free)IG API ScraperInfluencer analytics tools
No login cookies requiredβœ…βŒβŒβŒ
Desktop+mobile fallbackβœ…βŒβŒβŒ
Bio email extractionβœ…partialβŒβœ…
Bio-link website email scrapeβœ…βŒβŒβœ…
Engagement-rate scoreβœ…βŒβŒβœ…
Follower-band classificationβœ… nano/micro/macro/megaβŒβŒβœ…
Confidence score (0-1)βœ… every result❌❌❌
Source attributionβœ… per-field❌partialpartial
MCP-ready metadataβœ… providerHealth❌❌❌
Optional HypeAuditor fraud checkβœ… user keyβŒβŒβœ… paid only
Price / 1K profiles$1.77free (rate-limited)~$2.50~$5-10

Key Features

  • πŸ›‘οΈ Multi-API fallback chain β€” Instagram desktop (Playwright) primary with automatic mobile-HTML fallback. Never fails silently.
  • βœ‰οΈ Contact extraction β€” emails/phones from bio + bio-link website (open-source, no paid API).
  • πŸ“Š Engagement scoring β€” engagement-rate proxy + label (very_high/high/medium/low) on every profile.
  • πŸ‘₯ Follower bands β€” automatic classification: starter / nano / micro / macro / mega for influencer marketing.
  • 🎯 Confidence scoring β€” every result gets a 0.0–1.0 score so you can filter noise.
  • πŸ”— Source attribution β€” know exactly which provider contributed each field (instagram, website_scraper, hypeauditor).
  • ⚑ Cache-first mode β€” fast_lookup hits the Apify KVS cache for <800ms responses (1h TTL β€” profiles change fast).
  • πŸ€– MCP-ready β€” every result carries providerHealth{} so MCP agents can route around failed providers.
  • πŸ”Œ Optional paid enrichment β€” drop in HYPEAUDITOR_API_KEY for fraud detection. Disabled by default.
  • πŸ’° Transparent PPE pricing β€” pay only for successful profiles, no subscription.

Architecture

flowchart TD
A[Input: usernames/URLs + mode] --> B{Cache hit?}
B -- yes --> C[Return cached base profile]
B -- no --> D[Primary: Instagram desktop Playwright]
D -- fails --> E[Fallback: Instagram mobile HTML httpx]
D -- ok --> F[Normalize: followers/posts/bio/profile pic]
E --> F
F --> G[Enrichment layer]
G --> G1[Bio email/phone extraction]
G --> G2[Bio-link website contact scraper]
G --> G3[Engagement-rate + follower-band scoring]
G --> G4[Optional: HypeAuditor fraud detection]
G1 --> H[Confidence scoring + source attribution]
G2 --> H
G3 --> H
G4 --> H
H --> I[Progressive dataset push]
I --> J[Webhook + MCP-ready metadata]
C --> J

Data flow: Input β†’ Cache check β†’ Primary scraper (fallback on failure) β†’ Profile normalization β†’ Bio + website contact enrichment β†’ Engagement + follower-band scoring β†’ Optional fraud detection β†’ Confidence scoring β†’ Progressive dataset push β†’ Webhook delivery.

Every result includes providerHealth{} tracking per-provider status, latency, and errors β€” making this actor safe to call from MCP agents that need to route around failures.


Modes

ModeDescriptionTarget latencyUse case
fast_lookupCache-first, base profile data only<800ms cachedQuick lookups, deduplication
enrichFull enrichment: contacts + engagement + bands~2-4s/profileInfluencer prospecting, lead gen
batchQueue-based, full enrichment, per-item isolationvariesLarge username lists (100+)

Input

ParameterTypeRequiredDefaultDescription
modestringβ€”enrichfast_lookup | enrich | batch
queriesarrayβœ…["cristiano"]Usernames, profile URLs, or search queries
maxResultsintegerβ€”25Max profiles per query (1–1000)
webhookUrlstringβ€”β€”Webhook for completion notification

Example input

{
"mode": "enrich",
"queries": ["cristiano", "leomessi", "instagram"],
"maxResults": 50,
"webhookUrl": "https://your-app.com/webhook"
}

Output

Results are stored in the Apify dataset as structured JSON. See .actor/output_schema.json for the canonical schema.

FieldTypeDescription
querystringInput query that produced this result
usernamestringInstagram handle
namestringDisplay name
urlstringProfile URL
biostringBio text
profilePicstringProfile picture URL
followersstringFollower count (as displayed, e.g. "1.2M")
followingstringFollowing count
postsstringPost count
followerCountintegerParsed follower count (numeric)
followerBandstringstarter | nano | micro | macro | mega
engagementRatenumberEngagement-rate proxy (0-100)
engagementLabelstringvery_high | high | medium | low | unknown
recentPostsarrayRecent post URLs (up to 12)
websitestring | nullExternal link from bio
emailsarrayExtracted email addresses (bio + website)
phonesarrayExtracted phone numbers
fraudAnalysisobject | nullHypeAuditor data (if key provided)
confidenceScorenumber0.0–1.0 reliability score
sourcesarrayProvider attribution (instagram, website_scraper, hypeauditor)
providerHealthobjectPer-provider status/latency/error for MCP routing
cacheStatusstringhit | miss | degraded
modestringExecution mode used
extractedAtstringISO timestamp

Example output

{
"query": "cristiano",
"username": "cristiano",
"name": "Cristiano Ronaldo",
"url": "https://instagram.com/cristiano",
"bio": "...",
"profilePic": "https://...",
"followers": "642M",
"following": "580",
"posts": "3,600",
"followerCount": 642000000,
"followerBand": "mega",
"engagementRate": 0.56,
"engagementLabel": "low",
"recentPosts": ["https://instagram.com/p/..."],
"website": "https://cr7.com",
"emails": ["contact@cr7.com"],
"phones": [],
"confidenceScore": 0.85,
"sources": ["instagram", "website_scraper"],
"providerHealth": {
"instagram_desktop": {"status": "ok", "latency_ms": 2100, "error": null},
"website_scraper": {"status": "ok", "latency_ms": 1400, "error": null},
"engagement_scorer": {"status": "ok", "latency_ms": 0, "error": null}
},
"cacheStatus": "miss",
"mode": "enrich",
"extractedAt": "2026-06-28T23:45:00.000Z"
}

Pricing

Transparent pay-per-event (PPE) pricing. You only pay for successful profiles.

PlanPrice per 1K profilesSavings vs. top competitor
Leading competitors~$2.50/1Kβ€”
This actor (≀10K/mo)$1.77/1K29% cheaper
This actor (10K–100K/mo)$1.50/1K40% cheaper
This actor (100K+/mo)$1.25/1K50% cheaper

Volume discounts apply automatically based on monthly usage. A small one-time Actor Start fee ($0.00005) applies per run. Optional enrichment events:

EventPriceWhen charged
contact_found$0.50/1KPer profile with an extracted email or phone

No monthly subscription required. No paid API keys required for core extraction.


Use Cases

  • Influencer prospecting β€” find creators by audience size (micro/macro/mega) with verified contact emails
  • Creator outreach β€” build email lists of Instagram creators for brand partnerships
  • Engagement benchmarking β€” compare engagement rates across a creator shortlist
  • Lead generation β€” extract contact info from business Instagram bios and their websites
  • MCP agent workflows β€” call from LLM agents; providerHealth lets agents route around failures
  • Market research β€” map creator density by niche and follower band
  • CRM enrichment β€” append Instagram follower counts and engagement to existing contact records
  • Fraud screening β€” optional HypeAuditor integration to flag fake-follower accounts before outreach

Integration Examples

Python (Apify SDK)

from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_TOKEN")
run = client.actor("ayeeyee/instagram-profile-scraper-enhanced").call(run_input={
"mode": "enrich",
"queries": ["cristiano", "leomessi", "instagram"],
"maxResults": 50,
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(f"@{item['username']}: {item['followerBand']} band, "
f"engagement {item['engagementLabel']}, emails {item['emails']}")

cURL

curl -X POST "https://api.apify.com/v2/acts/ayeeyee~instagram-profile-scraper-enhanced/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{"mode":"enrich","queries":["cristiano","leomessi"],"maxResults":25}'

Zapier / n8n / Make.com

  1. Add an HTTP Request or Apify module.
  2. Use actor ID ayeeyee/instagram-profile-scraper-enhanced.
  3. Map your trigger output to the queries input array.
  4. Consume the dataset items in the next step.

MCP (Model Context Protocol)

This actor is MCP-ready. Register it via @apify/actors-mcp-server:

$npx -y @apify/actors-mcp-server --tools actors,ayeeyee/instagram-profile-scraper-enhanced

Agents can then call call-actor with the input above and use providerHealth + confidenceScore + followerBand to route around failed providers and filter creator quality.


FAQ

Q: Do I need Instagram login cookies? No. This actor scrapes public profile data via meta tags and JSON-LD β€” no login or session cookies required. Private profiles return base data only.

Q: How does the desktop+mobile fallback work? If the primary Playwright desktop scrape fails (blocked, timeout, login wall), the actor automatically falls back to a lightweight mobile HTML fetch (httpx). The providerHealth field on every result shows which providers succeeded/failed and their latency.

Q: Where do the emails come from? Two sources: (1) emails found directly in the Instagram bio, (2) emails scraped from the website linked in the bio. No paid email-finder API is required. For higher match rates, provide a HypeAuditor key.

Q: What is the engagement rate? A posts-to-follower ratio proxy expressed as a percentage. Real engagement (likes+comments per post) would require scraping each post β€” left as optional enrichment. The label (very_high/high/medium/low) lets you filter creators quickly.

Q: What are follower bands? Automatic classification by audience size: starter (<1K), nano (1K-10K), micro (10K-100K), macro (100K-1M), mega (1M+). Standard influencer-marketing tiers.

Q: Can I call this from an LLM agent? Yes. The actor is MCP-ready. Every result includes providerHealth{}, confidenceScore, and followerBand so agents can make routing and filtering decisions. Register it via @apify/actors-mcp-server.

Q: What is the cache TTL? 1 hour. Cached results in fast_lookup mode return in <800ms. Enrichment results are not cached (always fresh). Profiles change faster than business listings, so the TTL is shorter than directory scrapers.

Q: Is this legal? Yes. This actor scrapes publicly available Instagram profile data. See the Legal & Compliance section below.


This actor scrapes publicly available Instagram profile data via meta tags and JSON-LD. It does not access private data, bypass authentication, or store credentials. Users are responsible for complying with applicable data protection laws (GDPR, CCPA, etc.) and Instagram's Terms of Service.

This actor is intended for legitimate research, analysis, and creator-outreach use cases. It must not be used for spam, harassment, or unlawful activity.


AI-DLC / Data Lifecycle

This actor follows AI-DLC principles for ethical data handling:

  • Collection β€” Public data only; respects robots.txt and rate limits.
  • Processing β€” In-memory normalization; no PII logging.
  • Storage β€” Results are pushed to the user's Apify dataset, not retained by the actor.
  • Usage β€” Designed for analysis, enrichment, and legitimate creator outreach.
  • Disposal β€” No long-term caching of user data between runs (cache TTL 1h for base results only).

Enhancement Roadmap (API / MCP Integrations)

  • HypeAuditor fraud-detection MCP β€” flag fake-follower accounts before outreach (optional, user key)
  • Google Vision image tagging MCP β€” auto-tag profile pictures and recent posts for niche classification
  • OpenAI sentiment MCP β€” analyze bio sentiment and brand-fit scoring
  • LangGraph conditional workflow β€” multi-step: IG profile β†’ website β†’ LinkedIn β†’ AI scoring (see multi-api-orchestration-spec.md)
  • Vector store integration β€” semantic creator deduplication across runs

Changelog

  • v3.0 β€” Multi-API orchestration edition: added desktop+mobile fallback chain, MCP-ready providerHealth metadata, engagement-rate + follower-band scoring, optional HypeAuditor fraud detection, expanded FAQ, integration examples, volume pricing tiers.
  • v2.0 β€” Premium README, full output schema, AI-DLC documentation, confidence scoring, source attribution, bio-link website enrichment.
  • v1.0 β€” Initial release with core Playwright scraping and structured output.