Barnes & Noble Intelligence Scraper avatar

Barnes & Noble Intelligence Scraper

Pricing

from $5.00 / 1,000 results

Go to Apify Store
Barnes & Noble Intelligence Scraper

Barnes & Noble Intelligence Scraper

Scrape Barnes & Noble search, category, and product pages with pagination. Extract book/product metadata, pricing, ratings, availability, and commerce intelligence signals.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Ian Dikhtiar

Ian Dikhtiar

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

Scrape Barnes & Noble catalog data with pagination-aware discovery, book metadata, product URLs, ratings/prices when page access allows them, and commerce intelligence.

This actor is built for publishing, retail analytics, book-market research, catalog enrichment, and product URL discovery. It uses structured data when Barnes & Noble pages are accessible, falls back to visible product cards, follows pagination, and includes a public sitemap fallback so protected pages do not turn into empty runs.

What It Extracts

  • Product title and URL
  • Author / contributor
  • ISBN-13, ISBN-10, UPC, SKU when available
  • Format, publisher, and publication date
  • Price, list price, currency, discount percent
  • Availability / stock text
  • Rating and review count
  • Image URL
  • Product description / overview
  • Search rank and source query
  • Buyer signals and an intelligence score

Intelligence Fields

Every row is enriched with:

  • discountPercent: detected markdown from list price to current price
  • intelligenceScore: 0-100 score based on metadata completeness, price, availability, rating, and reviews
  • buyerSignals: tags such as high-rated, review-heavy, discounted, preorder, availability-risk, and book-format-detected
  • opportunitySummary: compact readable summary for dashboards or alerts

The actor also writes SUMMARY and OUTPUT key-value-store records with product count, price stats, format breakdown, availability breakdown, top authors, best enriched rows, and warnings.

Pagination

Pagination is enabled by default. The actor discovers next-page links from search/category pages and respects maxPages so runs remain predictable.

For broad market snapshots, increase maxItems and maxPages. Increase requestDelayMs if the site starts pushing back.

Input

  • queries: Barnes & Noble search terms, for example atomic habits, romantasy paperback, manga box set
  • startUrls: search, category, or product URLs from barnesandnoble.com
  • maxItems: maximum unique products to save
  • maxPages: maximum pages to visit across search/category/detail/pagination requests
  • includeProductDetails: opens detail pages for richer metadata
  • includePagination: follows next-page links
  • minIntelligenceScore: optional post-filter
  • requestDelayMs: delay between requests
  • useSitemapFallback: returns matching products from the public Barnes & Noble product sitemap when search/product pages are blocked
  • maxSitemapFiles: maximum sitemap files to scan in fallback mode
  • proxyConfiguration: Apify proxy configuration

Example:

{ "queries": ["atomic habits", "manga box set"], "maxItems": 100, "maxPages": 10, "includeProductDetails": true, "includePagination": true, "useSitemapFallback": true }

Output Example

{ "title": "Atomic Habits", "author": "James Clear", "isbn13": "9780735211292", "price": 16.99, "currency": "USD", "availability": "InStock", "rating": 4.8, "reviewCount": 12345, "url": "https://www.barnesandnoble.com/w/atomic-habits-james-clear/1129201155", "discountPercent": 0, "intelligenceScore": 86, "buyerSignals": ["high-rated", "review-heavy"], "opportunitySummary": "by James Clear | $16.99 | 4.8 rating | high-rated, review-heavy" }

Reliability Notes

Barnes & Noble is protected by Akamai and can deny search, category, and product HTML. This actor includes a browser fallback and a sitemap fallback. Sitemap fallback rows prioritize reliable product discovery and usually include title, inferred author, Barnes & Noble product URL, SKU, last modified date, and intelligence fields; live price/rating/availability require page access.

For best reliability:

  • Keep includeProductDetails on for smaller, richer runs
  • Increase requestDelayMs for large catalog runs
  • Use Apify Residential Proxy
  • Keep sitemap fallback enabled for smoke tests and QA runs
  • Keep maxPages bounded