Chrome Web Store Scraper avatar

Chrome Web Store Scraper

Pricing

from $3.99 / 1,000 results

Go to Apify Store
Chrome Web Store Scraper

Chrome Web Store Scraper

Pricing

from $3.99 / 1,000 results

Rating

0.0

(0)

Developer

ScrapeFlow

ScrapeFlow

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

🧩 Chrome Web Store Scraper

πŸš€ Pull every public Chrome Web Store extension into clean, structured JSON β€” ratings, install counts, manifest, developer, screenshots, badges, and more. Built for speed, designed to survive blocks, ready for API + MCP.


✨ Why choose this actor?

  • πŸ” Two run modes β€” paste raw listing URLs, or just type a keyword and let the actor build the search URL for you.
  • 🎯 Search filters built in β€” Featured, Established Publisher, Extension / Theme, minimum star rating.
  • πŸ“¦ Full manifest, optionally included β€” get permissions, version, manifest_version, host permissions, content scripts, etc.
  • πŸ›‘οΈ Automatic proxy ladder β€” starts direct, falls back to datacenter, then residential with 3 retries, and sticks with whatever first works.
  • 🧠 Resilient parser β€” no minified-key hardcoding. Survives Google rotating internal RPC / WIZ_global_data identifiers.
  • πŸ’Ύ Live dataset push β€” every extension lands in the dataset the moment it's parsed; crashes mid-run never lose work.
  • πŸ’³ Pay-per-event ready β€” one billable event per scraped extension.

πŸŽ›οΈ Key features

πŸ”Ž Search keyword + filtersitem type, Featured, Established Publisher, min star rating
πŸ”— Bulk URL inputany Chrome Web Store category, collection, or search-result URL
πŸ›‘οΈ Smart proxy fallbackdirect β†’ datacenter β†’ residential (sticky, with retries)
⚑ Concurrent detail scrapingtune detailConcurrency (1–32, default 8)
🧱 Cap output volumemaxItems ceiling (1–5000)
πŸ“¦ Manifest toggleinclude or strip the full Chrome manifest dict
πŸ“Š Multi-view datasetoverview, developer, media, technical tabs in Apify Console

πŸ“₯ Input

{
"mode": "search",
"searchQuery": "ai",
"itemTypes": ["EXTENSION"],
"filterBy": ["featured"],
"minimalRating": 4,
"maxItems": 100,
"extractManifest": false,
"detailConcurrency": 8,
"proxyConfiguration": { "useApifyProxy": false }
}

Field reference

FieldTypePurpose
modestring "url" / "search"Pick which set of inputs the run uses.
startUrlsarray of URLs(URL mode) Chrome Web Store listing URLs to scrape.
searchQuerystring(Search mode) Keyword.
itemTypesarray(Search mode) Subset of EXTENSION, THEME. Empty = no filter.
filterByarray(Search mode) Subset of featured, establishedPublisher. Empty = no filter.
minimalRatinginteger 0–5(Search mode) Minimum star rating; 0 disables.
maxItemsintegerHard cap on collected extensions.
extractManifestbooleanInclude the full manifest dict in each record.
detailConcurrencyinteger 1–32Parallel detail-page workers.
proxyConfigurationproxy objectOverride the starting tier. Default = direct.

πŸ“€ Output

Each record looks like this (full manifest omitted for brevity):

{
"extensionId": "difoiogjjojoaoomphldepapgpbgkhkb",
"iconUrl": "https://lh3.googleusercontent.com/.../wTNe...",
"promoTileUrl": "https://.../irg6...",
"marqueeTileUrl": "https://.../eQr...",
"name": "Sider: Chat with all AI: GPT-5, Claude, DeepSeek, Gemini, Grok",
"rating": 4.918653426548515,
"ratingCount": 112285,
"users": 5000000,
"version": "5.25.12",
"category": "productivity/tools",
"shortDescription": "ChatGPT, DeepSeek, Gemini, Claude, Grok all in one AI sidebar…",
"websiteUrl": "sider.ai",
"permissions": ["storage", "cookies", "scripting", "activeTab", "tabs"],
"manifestVersion": 3,
"manifest": { "...": "(only when extractManifest = true)" },
"author": "care@sider.ai",
"url": "https://chromewebstore.google.com/detail/difoiogjjojoaoomphldepapgpbgkhkb",
"scrapedAt": "2026-05-17T11:29:57.813Z",
"description": "ChatGPT, DeepSeek, Gemini, Claude, Grok all in one AI sidebar…",
"screenshots": ["https://lh3.googleusercontent.com/.../jEZ..."],
"developer": {
"email": "care@sider.ai",
"displayName": "ChatGPT Sidebar",
"address": "335 Huntington Ave APT 35 Boston, MA 02115 US",
"phone": "+18577560822",
"legalEntity": "Vidline Inc.",
"registrationNumber": "106977314",
"developerId": "u7940399ce3ad5f805172c7a395b8b66b"
},
"developerWebsite": "https://sider.ai",
"privacyPolicyUrl": "https://sider.ai/privacy.html",
"isFeatured": true,
"isEstablishedPublisher": true
}

The dataset offers four pre-configured views in Apify Console: πŸ“Š Overview, πŸ‘€ Developer, πŸ–ΌοΈ Media & content, and βš™οΈ Technical.


πŸš€ How to use (Apify Console)

  1. πŸ” Log in at console.apify.com β†’ Actors.
  2. πŸ”Ž Open this actor (chrome-web-store-scraper).
  3. πŸŽ›οΈ Pick a run mode and fill the inputs.
  4. ▢️ Click Start.
  5. πŸ“œ Watch the log β€” you'll see every extension as it's discovered and scraped, with proxy events called out live.
  6. πŸ“€ Open Output when the run finishes β€” export to JSON / CSV / XLSX.

πŸ€– Use via API

curl -X POST "https://api.apify.com/v2/acts/USER~chrome-web-store-scraper/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"mode": "search",
"searchQuery": "password manager",
"itemTypes": ["EXTENSION"],
"minimalRating": 4,
"maxItems": 50
}'

🎯 Best use cases

  • 🧠 Market research β€” competitor catalogues, install counts, rating trends.
  • πŸ›’ Store intelligence β€” track Featured and Established Publisher badges across categories.
  • πŸ” Permission audits β€” bulk-collect manifests and surface risky permission patterns.
  • πŸ“ˆ Lead generation β€” pull developer contact + website fields straight from the store.
  • πŸ“Š Trend tracking β€” schedule the actor on a cron and diff rating + user counts over time.

πŸ’° Pricing

This actor uses Apify's Pay-Per-Event model. You only pay for results β€” not for compute, proxies, or storage.

EventWhat it bills forPrice
🧩 row_resultOne Chrome Web Store extension successfully written to the dataset (full record β€” rating, users, version, permissions, developer, screenshots, badges).$0.005

That works out to $5 per 1,000 extensions scraped. Failed extensions (page errors, blocks that survive the proxy ladder) are not charged β€” only successful rows count.

Runs stop gracefully when your spend limit (ACTOR_MAX_TOTAL_CHARGE_USD) is reached β€” partial results stay in the dataset, no failed-run penalty.


❓ FAQ

Does this need a Google account or login? No. Everything is fetched anonymously from public Chrome Web Store pages.

Will it work if I leave proxy on "direct"? Usually yes for small runs. If the store rate-limits, the actor escalates to datacenter, then residential, automatically. Large runs benefit from setting proxyConfiguration.useApifyProxy = true up front.

Why is the manifest field sometimes missing? You disabled extractManifest to keep records small. Summary fields (version, permissions, manifestVersion) are kept either way.

What happens if the Chrome Web Store HTML layout changes? The parser discovers RPC IDs, request templates, and WIZ_global_data keys at runtime, so most layout / minification changes are absorbed automatically. Issues are tracked in the Issues tab.


  • 🌐 Only publicly available Chrome Web Store data is collected.
  • βœ… The Chrome Web Store does not require authentication for the pages this actor reads.
  • πŸ§‘β€βš–οΈ You are responsible for compliance with the Chrome Web Store Terms of Service, as well as any GDPR / CCPA / anti-spam obligations when you use the output (especially the developer.email / developer.phone / developer.address fields).
  • 🀝 The actor honors reasonable rate limits and politely backs off when the store pushes back.

πŸ’¬ Support & feedback

Open an issue on the actor's Issues tab or use the Feedback button in Apify Console. PRs and suggestions welcome.