Cratejoy Scraper — Subscription Box Data & Intelligence avatar

Cratejoy Scraper — Subscription Box Data & Intelligence

Pricing

from $10.00 / 1,000 successful result scrapeds

Go to Apify Store
Cratejoy Scraper — Subscription Box Data & Intelligence

Cratejoy Scraper — Subscription Box Data & Intelligence

Scrape Cratejoy subscription box data — price, rating, niche, shipping, cancel policy, eco flags, coupon codes, affiliate program, and social handles. Auto-pagination, checkpoint resume. 30+ fields per box. $10 per 1,000 results. Pay per result.

Pricing

from $10.00 / 1,000 successful result scrapeds

Rating

0.0

(0)

Developer

Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

📦 Cratejoy Scraper — Subscription Box Data & Market Intelligence

The most complete Cratejoy Scraper on Apify. Extract full subscription box data from Cratejoy.com — box name, price, rating, review count, niche, shipping details, cancel policy, eco/cruelty-free flags, coupon codes, affiliate program availability, Instagram and TikTok handles, pros, cons, and full description. Auto-discovers products from any Cratejoy category or collection URL. Pay only for results.

Apify Actor


📌 Table of Contents


🔍 What Is This Actor?

Cratejoy Scraper is a production-ready Apify actor that extracts comprehensive subscription box data from Cratejoy.com — the largest subscription box marketplace in the world.

Provide any Cratejoy category, collection, or search results URL and receive back a clean, structured record for every subscription box found: name, price, rating, review count, niche, category, shipping details, cancellation policy, skip and pause options, gift availability, trial offers, coupon codes, eco-friendly and cruelty-free flags, affiliate program availability, social media handles, availability status, inferred pros and cons, and full description.

This Cratejoy box data scraper auto-discovers product links from the start URL, paginates through all result pages, and scrapes each box detail page with concurrent fetching — delivering the most complete subscription box intelligence dataset available on Apify.


🚀 Why Use This Cratejoy Scraper?

FeatureThis ActorManual ResearchOther Scrapers
Full subscription box data✅ 30+ fields❌ Slow⚠️ Basic
Rating & review count⚠️
Cancel / skip / pause policy
Eco / cruelty-free flags✅ Auto-detected
Coupon code extraction
Affiliate program flag
Instagram & TikTok handles
Inferred pros & cons✅ Auto-generated
Auto-pagination⚠️
Checkpoint resumeN/A

Bottom line: This subscription box scraper is the only actor that extracts cancellation policy, eco flags, coupon codes, affiliate program availability, and social handles — alongside ratings, pricing, and full descriptions — making it the most complete Cratejoy box data tool on Apify.


🎯 Use Cases

📊 Subscription Box Market Research

  • Scrape Cratejoy box data across niches to analyze pricing, rating distributions, and shipping policies
  • Map the competitive landscape of subscription boxes in any category — beauty, food, gaming, books, pets
  • Identify trending niches by comparing review counts, availability flags, and growth signals across categories

💼 Competitor & Brand Intelligence

  • Monitor competitor subscription boxes for price changes, new coupon codes, and policy updates
  • Track eco-friendly and cruelty-free positioning across competing brands in your niche
  • Identify which subscription boxes offer affiliate programs for partnership outreach

🤖 AI & E-Commerce Datasets

  • Build structured datasets of Cratejoy box data for product recommendation or pricing models
  • Collect subscription box descriptions and metadata for NLP and content classification research
  • Use rating and review data for e-commerce sentiment analysis and purchase prediction models

🛒 Affiliate & Influencer Research

  • Find subscription boxes with active affiliate programs using the affiliate_program flag
  • Discover boxes with coupon codes for affiliate promotions
  • Extract Instagram and TikTok handles to identify social-first subscription brands

🏪 Subscription Box Launch & Positioning

  • Research existing boxes in your target niche before launching your own subscription product
  • Benchmark pricing, cancel policies, and trial offers across direct competitors
  • Identify gaps — niches with high demand but low review volume or limited eco-friendly options

🎓 Academic & Consumer Research

  • Build datasets of subscription box offerings for consumer behavior or e-commerce studies
  • Analyse policy flexibility (skip, pause, cancel anytime) as a driver of subscription satisfaction
  • Study the relationship between eco-friendly positioning and pricing across subscription products

⚙️ Input Parameters

{
"startUrl": "https://www.cratejoy.com/collections/top-trending-boxes",
"maxItems": 100,
"maxPages": 25,
"concurrency": 4,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}
ParameterTypeDefaultDescription
startUrlstringTrending boxes pageAny Cratejoy category, collection, or search results URL
maxItemsinteger100Maximum subscription boxes to scrape
maxPagesinteger25Maximum result pages to paginate through
concurrencyinteger4Number of product pages fetched in parallel (max 6)
proxyConfigurationobjectOptionalApify proxy config — residential proxy recommended

Tip: Use any Cratejoy collection, category, or search URL as startUrl. The actor auto-discovers product links and paginates automatically — no manual page navigation needed.


📋 Output Fields

Every record from this Cratejoy scraper includes complete Cratejoy box data:

📦 Core Box Fields

FieldTypeDescriptionExample
box_namestringSubscription box name"BarkBox Monthly Dog Treats"
nichestringBox niche or sub-category"Dog Toys & Treats"
categorystringBroad category from breadcrumb"Pet Subscription Boxes"
price_monthly_quarterly_annualstringListed price"23.00"
price_currencystringPrice currency symbol"$"
ratingstringAverage customer rating"4.8"
review_countintegerTotal number of reviews342
availabilitystringStock status"In Stock", "Sold Out"
descriptionstringFull box description"A monthly box of..."
source_urlstringCratejoy product page URL"https://www.cratejoy.com/..."

🚚 Shipping & Policy Fields

FieldTypeDescriptionExample
shipping_freebooleanFree shipping offeredtrue
shipping_countriesstringShipping destination"Worldwide", "US / UK"
cancel_policystringCancellation policy"Cancel Anytime", "Flexible Policy"
skip_allowedbooleanSkip a month option availabletrue
pause_allowedbooleanPause subscription optionfalse
gift_optionbooleanGift subscription availabletrue
trial_offerbooleanFree trial or first box offerfalse
coupon_codestringPromo code found on page"SAVE10"

🌱 Brand & Values Fields

FieldTypeDescriptionExample
eco_friendlybooleanEco-friendly or sustainable packagingtrue
cruelty_freebooleanCruelty-free productstrue
certified_organicbooleanCertified organic or 100% naturalfalse
affiliate_programbooleanAffiliate or ambassador program availabletrue
is_newbooleanRecently launched boxfalse
is_trendingbooleanTrending statustrue
growth_signalstringDemand signal from page content"High Demand", "Stable Growth"

📱 Social & Content Fields

FieldTypeDescriptionExample
instagram_handlestringInstagram username found on page"barkbox"
tiktok_handlestringTikTok username found on page"barkbox"
items_per_boxstringNumber of items per delivery"5-7 items"
box_value_claimedstringClaimed value of box contents"$50+ value"
prosstringAuto-inferred pros based on data"Strong customer ratings; Free shipping"
consstringAuto-inferred cons based on data"Shipping costs may apply"

Note on Subscribers_count and Top_positive_review: These fields are included in the schema but are never populated — Cratejoy does not publicly display subscriber counts or individual review text in a consistent extractable format. They will always be null in your output.


📦 Example Input & Output

Input:

{
"startUrl": "https://www.cratejoy.com/collections/top-trending-boxes",
"maxItems": 5
}

Output (one record):

{
"box_name": "BarkBox Monthly Dog Treats",
"niche": "Dog Toys & Treats",
"category": "Pet Subscription Boxes",
"price_monthly_quarterly_annual": "23.00",
"price_currency": "$",
"rating": "4.8",
"review_count": 342,
"availability": "In Stock",
"shipping_free": true,
"shipping_countries": "US / UK",
"cancel_policy": "Cancel Anytime",
"skip_allowed": false,
"pause_allowed": false,
"gift_option": true,
"trial_offer": false,
"coupon_code": null,
"eco_friendly": false,
"cruelty_free": false,
"certified_organic": false,
"affiliate_program": true,
"is_new": false,
"is_trending": true,
"growth_signal": "High Demand",
"instagram_handle": "barkbox",
"tiktok_handle": null,
"items_per_box": "5-7 items",
"box_value_claimed": "$40+ value",
"pros": "Strong customer ratings; Free shipping; Gift-friendly",
"cons": "Item mix may vary by month",
"description": "A monthly box of dog toys and treats customized to your dog's size...",
"source_url": "https://www.cratejoy.com/subscription-box/barkbox/"
}

💰 Pricing

This actor uses pay-per-event pricing — you only pay for subscription boxes successfully scraped.

EventPrice
Actor start fee$0.50 per run
Per box successfully scraped$0.01 per result ($10.00 per 1,000 boxes)

How billing works:

  • ✅ The $0.50 start fee applies once per run regardless of results
  • ✅ Each box record pushed to the dataset is charged at $0.01
  • ✅ Pages that fail to load or return no data are not charged
  • ✅ The actor stops automatically when your Apify account charge limit is reached
  • ✅ Checkpoint saves progress — restarting after a spending limit uses credits only for new boxes

Example: Scrape 200 boxes = $0.50 (start) + $2.00 (200 × $0.01) = $2.50 total

2-hour free trial available — click Try for free at the top of this page.


⚡ Performance & Limits

BoxesEstimated Time
50~4–8 minutes
100~8–15 minutes
500~35–60 minutes
1,000~70–120 minutes
  • Product pages fetched in parallel (up to 6 concurrent) per listing page
  • Results pushed to the dataset immediately after each box is scraped
  • Checkpoint saved after every box — restart resumes from the last completed page
  • The actor stops cleanly when maxItems or spending limit is reached
  • Images, media, fonts, and stylesheets are blocked during browsing for maximum speed

❓ FAQ

Q: What should I use as startUrl? A: Any Cratejoy category, collection, or search results page URL. Examples: https://www.cratejoy.com/collections/top-trending-boxes, https://www.cratejoy.com/subscription-boxes/beauty/, or a custom search URL. The actor discovers product links automatically.

Q: How are pros and cons generated? A: Pros and cons are automatically inferred from the box data — not scraped text. pros is built from high ratings, free shipping, gift options, trial offers, and review volume. cons reflects sold-out status, lack of free shipping, or low review count.

Q: Why are Subscribers_count and Top_positive_review always null? A: Cratejoy does not display subscriber counts or structured review text in a consistent extractable format on public listing pages. These fields are always null and should be ignored in your workflow.

Q: How are eco-friendly, cruelty-free, and organic flags detected? A: These boolean flags are detected by scanning the full page text for specific keywords — "eco-friendly", "sustainable", "cruelty-free", "not tested on animals", "certified organic", etc. They reflect what the seller claims on their Cratejoy listing.

Q: Can I scrape a specific niche or category? A: Yes. Navigate to that category on Cratejoy, copy the URL, and use it as startUrl. The actor will only scrape boxes from that category.

Q: What happens if a box page fails to load? A: The actor logs the failure, skips that URL, and continues with the remaining boxes. No charge is applied for failed pages.

Q: Can I export results to Excel or CSV? A: Yes. All results are pushed to the Apify dataset, which can be exported to JSON, CSV, Excel, and more directly from the Apify Console after each run.


📜 Changelog

v1.0.0 (Current)

  • ✅ Full subscription box data — 30+ fields per record
  • ✅ Auto-discovery of product links from any Cratejoy category or collection URL
  • ✅ Automatic pagination up to maxPages
  • ✅ Concurrent product page fetching (up to 6 parallel)
  • ✅ Cancel, skip, and pause policy detection
  • ✅ Eco-friendly, cruelty-free, and organic flag detection
  • ✅ Coupon code and affiliate program extraction
  • ✅ Instagram and TikTok handle extraction
  • ✅ Auto-inferred pros and cons from structured data
  • ✅ Checkpoint/resume — saves progress after every box
  • ✅ Pay-per-event billing — charged per successfully scraped box
  • ✅ Spending limit respect — stops cleanly when charge limit reached

🏷️ Tags

cratejoy scraper subscription box scraper cratejoy box data subscription box data cratejoy data extractor subscription marketplace scraper box niche research subscription pricing data cratejoy product scraper e-commerce subscription data subscription box market research cratejoy intelligence


This actor accesses publicly visible subscription box listing data on Cratejoy.com in the same way a regular user browses the platform.

Please note:

  • Use extracted Cratejoy box data only for lawful purposes — market research, competitive analysis, affiliate research, and academic study are common legitimate uses
  • Do not republish Cratejoy listing content in a way that violates Cratejoy's Terms of Service
  • Box descriptions, images, and metadata belong to the respective subscription box brands
  • The actor developer is not responsible for how extracted data is used

🤝 Support & Feedback

  • Bug report? Contact us via the Apify actor page
  • Feature request? Post in the Apify Community forum
  • Loving it? Please leave a ⭐ review — it helps other users find this actor!

Built with ❤️ on Apify
The most complete Cratejoy Scraper — 30+ fields, auto-pagination, pay per result

💰 $0.50 per run + $10.00 per 1,000 boxes · Pay only for results