Chrome Web Store Scraper
Pricing
from $3.99 / 1,000 results
Chrome Web Store Scraper
Pricing
from $3.99 / 1,000 results
Rating
0.0
(0)
Developer
ScrapeFlow
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
π§© Chrome Web Store Scraper
π Pull every public Chrome Web Store extension into clean, structured JSON β ratings, install counts, manifest, developer, screenshots, badges, and more. Built for speed, designed to survive blocks, ready for API + MCP.
β¨ Why choose this actor?
- π Two run modes β paste raw listing URLs, or just type a keyword and let the actor build the search URL for you.
- π― Search filters built in β Featured, Established Publisher, Extension / Theme, minimum star rating.
- π¦ Full manifest, optionally included β get
permissions,version,manifest_version, host permissions, content scripts, etc. - π‘οΈ Automatic proxy ladder β starts direct, falls back to datacenter, then residential with 3 retries, and sticks with whatever first works.
- π§ Resilient parser β no minified-key hardcoding. Survives Google rotating internal RPC /
WIZ_global_dataidentifiers. - πΎ Live dataset push β every extension lands in the dataset the moment it's parsed; crashes mid-run never lose work.
- π³ Pay-per-event ready β one billable event per scraped extension.
ποΈ Key features
| π Search keyword + filters | item type, Featured, Established Publisher, min star rating |
| π Bulk URL input | any Chrome Web Store category, collection, or search-result URL |
| π‘οΈ Smart proxy fallback | direct β datacenter β residential (sticky, with retries) |
| β‘ Concurrent detail scraping | tune detailConcurrency (1β32, default 8) |
| π§± Cap output volume | maxItems ceiling (1β5000) |
| π¦ Manifest toggle | include or strip the full Chrome manifest dict |
| π Multi-view dataset | overview, developer, media, technical tabs in Apify Console |
π₯ Input
{"mode": "search","searchQuery": "ai","itemTypes": ["EXTENSION"],"filterBy": ["featured"],"minimalRating": 4,"maxItems": 100,"extractManifest": false,"detailConcurrency": 8,"proxyConfiguration": { "useApifyProxy": false }}
Field reference
| Field | Type | Purpose |
|---|---|---|
mode | string "url" / "search" | Pick which set of inputs the run uses. |
startUrls | array of URLs | (URL mode) Chrome Web Store listing URLs to scrape. |
searchQuery | string | (Search mode) Keyword. |
itemTypes | array | (Search mode) Subset of EXTENSION, THEME. Empty = no filter. |
filterBy | array | (Search mode) Subset of featured, establishedPublisher. Empty = no filter. |
minimalRating | integer 0β5 | (Search mode) Minimum star rating; 0 disables. |
maxItems | integer | Hard cap on collected extensions. |
extractManifest | boolean | Include the full manifest dict in each record. |
detailConcurrency | integer 1β32 | Parallel detail-page workers. |
proxyConfiguration | proxy object | Override the starting tier. Default = direct. |
π€ Output
Each record looks like this (full manifest omitted for brevity):
{"extensionId": "difoiogjjojoaoomphldepapgpbgkhkb","iconUrl": "https://lh3.googleusercontent.com/.../wTNe...","promoTileUrl": "https://.../irg6...","marqueeTileUrl": "https://.../eQr...","name": "Sider: Chat with all AI: GPT-5, Claude, DeepSeek, Gemini, Grok","rating": 4.918653426548515,"ratingCount": 112285,"users": 5000000,"version": "5.25.12","category": "productivity/tools","shortDescription": "ChatGPT, DeepSeek, Gemini, Claude, Grok all in one AI sidebarβ¦","websiteUrl": "sider.ai","permissions": ["storage", "cookies", "scripting", "activeTab", "tabs"],"manifestVersion": 3,"manifest": { "...": "(only when extractManifest = true)" },"author": "care@sider.ai","url": "https://chromewebstore.google.com/detail/difoiogjjojoaoomphldepapgpbgkhkb","scrapedAt": "2026-05-17T11:29:57.813Z","description": "ChatGPT, DeepSeek, Gemini, Claude, Grok all in one AI sidebarβ¦","screenshots": ["https://lh3.googleusercontent.com/.../jEZ..."],"developer": {"email": "care@sider.ai","displayName": "ChatGPT Sidebar","address": "335 Huntington Ave APT 35 Boston, MA 02115 US","phone": "+18577560822","legalEntity": "Vidline Inc.","registrationNumber": "106977314","developerId": "u7940399ce3ad5f805172c7a395b8b66b"},"developerWebsite": "https://sider.ai","privacyPolicyUrl": "https://sider.ai/privacy.html","isFeatured": true,"isEstablishedPublisher": true}
The dataset offers four pre-configured views in Apify Console: π Overview, π€ Developer, πΌοΈ Media & content, and βοΈ Technical.
π How to use (Apify Console)
- π Log in at console.apify.com β Actors.
- π Open this actor (
chrome-web-store-scraper). - ποΈ Pick a run mode and fill the inputs.
- βΆοΈ Click Start.
- π Watch the log β you'll see every extension as it's discovered and scraped, with proxy events called out live.
- π€ Open Output when the run finishes β export to JSON / CSV / XLSX.
π€ Use via API
curl -X POST "https://api.apify.com/v2/acts/USER~chrome-web-store-scraper/run-sync-get-dataset-items?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"mode": "search","searchQuery": "password manager","itemTypes": ["EXTENSION"],"minimalRating": 4,"maxItems": 50}'
π― Best use cases
- π§ Market research β competitor catalogues, install counts, rating trends.
- π Store intelligence β track Featured and Established Publisher badges across categories.
- π Permission audits β bulk-collect manifests and surface risky permission patterns.
- π Lead generation β pull developer contact + website fields straight from the store.
- π Trend tracking β schedule the actor on a cron and diff rating + user counts over time.
π° Pricing
This actor uses Apify's Pay-Per-Event model. You only pay for results β not for compute, proxies, or storage.
| Event | What it bills for | Price |
|---|---|---|
π§© row_result | One Chrome Web Store extension successfully written to the dataset (full record β rating, users, version, permissions, developer, screenshots, badges). | $0.005 |
That works out to $5 per 1,000 extensions scraped. Failed extensions (page errors, blocks that survive the proxy ladder) are not charged β only successful rows count.
Runs stop gracefully when your spend limit (ACTOR_MAX_TOTAL_CHARGE_USD) is reached β partial results stay in the dataset, no failed-run penalty.
β FAQ
Does this need a Google account or login? No. Everything is fetched anonymously from public Chrome Web Store pages.
Will it work if I leave proxy on "direct"? Usually yes for small runs. If the store rate-limits, the actor escalates to datacenter, then residential, automatically. Large runs benefit from setting proxyConfiguration.useApifyProxy = true up front.
Why is the manifest field sometimes missing? You disabled extractManifest to keep records small. Summary fields (version, permissions, manifestVersion) are kept either way.
What happens if the Chrome Web Store HTML layout changes? The parser discovers RPC IDs, request templates, and WIZ_global_data keys at runtime, so most layout / minification changes are absorbed automatically. Issues are tracked in the Issues tab.
βοΈ Cautions & legal
- π Only publicly available Chrome Web Store data is collected.
- β The Chrome Web Store does not require authentication for the pages this actor reads.
- π§ββοΈ You are responsible for compliance with the Chrome Web Store Terms of Service, as well as any GDPR / CCPA / anti-spam obligations when you use the output (especially the
developer.email/developer.phone/developer.addressfields). - π€ The actor honors reasonable rate limits and politely backs off when the store pushes back.
π¬ Support & feedback
Open an issue on the actor's Issues tab or use the Feedback button in Apify Console. PRs and suggestions welcome.