Barnes & Noble Intelligence Scraper
Pricing
from $5.00 / 1,000 results
Barnes & Noble Intelligence Scraper
Scrape Barnes & Noble search, category, and product pages with pagination. Extract book/product metadata, pricing, ratings, availability, and commerce intelligence signals.
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer
Ian Dikhtiar
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share
Scrape Barnes & Noble catalog data with pagination-aware discovery, book metadata, product URLs, ratings/prices when page access allows them, and commerce intelligence.
This actor is built for publishing, retail analytics, book-market research, catalog enrichment, and product URL discovery. It uses structured data when Barnes & Noble pages are accessible, falls back to visible product cards, follows pagination, and includes a public sitemap fallback so protected pages do not turn into empty runs.
What It Extracts
- Product title and URL
- Author / contributor
- ISBN-13, ISBN-10, UPC, SKU when available
- Format, publisher, and publication date
- Price, list price, currency, discount percent
- Availability / stock text
- Rating and review count
- Image URL
- Product description / overview
- Search rank and source query
- Buyer signals and an intelligence score
Intelligence Fields
Every row is enriched with:
- discountPercent: detected markdown from list price to current price
- intelligenceScore: 0-100 score based on metadata completeness, price, availability, rating, and reviews
- buyerSignals: tags such as high-rated, review-heavy, discounted, preorder, availability-risk, and book-format-detected
- opportunitySummary: compact readable summary for dashboards or alerts
The actor also writes SUMMARY and OUTPUT key-value-store records with product count, price stats, format breakdown, availability breakdown, top authors, best enriched rows, and warnings.
Pagination
Pagination is enabled by default. The actor discovers next-page links from search/category pages and respects maxPages so runs remain predictable.
For broad market snapshots, increase maxItems and maxPages. Increase requestDelayMs if the site starts pushing back.
Input
- queries: Barnes & Noble search terms, for example atomic habits, romantasy paperback, manga box set
- startUrls: search, category, or product URLs from barnesandnoble.com
- maxItems: maximum unique products to save
- maxPages: maximum pages to visit across search/category/detail/pagination requests
- includeProductDetails: opens detail pages for richer metadata
- includePagination: follows next-page links
- minIntelligenceScore: optional post-filter
- requestDelayMs: delay between requests
- useSitemapFallback: returns matching products from the public Barnes & Noble product sitemap when search/product pages are blocked
- maxSitemapFiles: maximum sitemap files to scan in fallback mode
- proxyConfiguration: Apify proxy configuration
Example:
{ "queries": ["atomic habits", "manga box set"], "maxItems": 100, "maxPages": 10, "includeProductDetails": true, "includePagination": true, "useSitemapFallback": true }
Output Example
{ "title": "Atomic Habits", "author": "James Clear", "isbn13": "9780735211292", "price": 16.99, "currency": "USD", "availability": "InStock", "rating": 4.8, "reviewCount": 12345, "url": "https://www.barnesandnoble.com/w/atomic-habits-james-clear/1129201155", "discountPercent": 0, "intelligenceScore": 86, "buyerSignals": ["high-rated", "review-heavy"], "opportunitySummary": "by James Clear | $16.99 | 4.8 rating | high-rated, review-heavy" }
Reliability Notes
Barnes & Noble is protected by Akamai and can deny search, category, and product HTML. This actor includes a browser fallback and a sitemap fallback. Sitemap fallback rows prioritize reliable product discovery and usually include title, inferred author, Barnes & Noble product URL, SKU, last modified date, and intelligence fields; live price/rating/availability require page access.
For best reliability:
- Keep includeProductDetails on for smaller, richer runs
- Increase requestDelayMs for large catalog runs
- Use Apify Residential Proxy
- Keep sitemap fallback enabled for smoke tests and QA runs
- Keep maxPages bounded