๐ŸŽฌ IMDB Scraper โ€” Movies, Ratings & Box Office avatar

๐ŸŽฌ IMDB Scraper โ€” Movies, Ratings & Box Office

Pricing

from $20.00 / 1,000 results

Go to Apify Store
๐ŸŽฌ IMDB Scraper โ€” Movies, Ratings & Box Office

๐ŸŽฌ IMDB Scraper โ€” Movies, Ratings & Box Office

Extract movies & TV shows from IMDb โ€” ratings, cast, crew, box office, plot summaries. TMDB, OMDb & JustWatch alternative for streaming apps, recommendation engines and entertainment data pipelines. Pay per title โ€” no API key needed.

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

NexGenData

NexGenData

Maintained by Community

Actor stats

0

Bookmarked

9

Total users

2

Monthly active users

4 days ago

Last modified

Share

๐ŸŽฌ IMDb Scraper โ€” Movies, TV, Cast, Ratings, Box Office & Reviews at Scale

Extract structured movie + TV data from IMDb โ€” titles, year, runtime, genre, cast, crew, ratings, box-office, plot, reviews, parental guide. A drop-in alternative to IMDb's non-commercial datasets, TMDB API, OMDb API, and JustWatch โ€” with full reviews + box-office data and no daily call quota.

Why IMDb Scraper Beats IMDb Datasets, TMDB, OMDb & JustWatch

FeatureNexGenData IMDb ScraperIMDb non-commercial datasetsTMDB APIOMDb APIJustWatch
Cost$3 per 1,000 results, pay-per-eventFree but flat-file onlyFree + paid tiers$1+ / month with limitsPartner-only
Commercial useYesNon-commercial onlyWith API keyWith API keyPartner contract
Box-office dataYes (opening, gross, budget)NoLimitedLimitedNo
Full reviewsYesNoUser-submittedNoNo
Cast + crewFull creditsYesYesLimitedLimited
RatingsIMDb rating + Metacritic + Rotten Tomatoes where shownIMDb onlyTMDB ratingIMDb + MetacriticYes
Parental guideYesNoLimitedNoNo
Bulk exportCSV / JSON / Excel / APIFlat file onlyAPIAPIPartner contract
AuthApify tokenNoneAPI keyAPI keyPartner contract
Time-to-first-row< 60 secondsDownload GB-size dumpsSignupSignupSales cycle

Most media analytics + recommendation-engine teams pick this scraper because it is commercial-use friendly (unlike the IMDb non-commercial dumps), broader than OMDb's title-search response, and richer than TMDB's missing box-office numbers. JSON drops straight into your recommendation pipeline.

What You Get

Per title returned, the scraper includes:

  • IMDb ID + canonical IMDb URL
  • Title + original title + year + content type (movie, tv series, short, miniseries, video game)
  • Genre โ€” array of genres
  • Runtime (minutes) + content rating (PG, PG-13, R, etc.)
  • Plot summary + full synopsis where available
  • IMDb rating (1โ€“10) + vote count
  • Metacritic score + Rotten Tomatoes score where shown
  • Release date(s) โ€” worldwide + per-country
  • Cast โ€” array of name + role + actor IMDb ID
  • Crew โ€” director, writer, producer, cinematographer, composer
  • Production companies + countries + languages
  • Box office โ€” opening weekend, domestic gross, worldwide gross, budget
  • Reviews โ€” array of user reviews (author, rating, date, text), configurable depth
  • Awards โ€” wins + nominations summary
  • Parental guide โ€” sex, violence, profanity, alcohol, frightening
  • Where to watch flags where shown
  • Photos + posters โ€” image URLs
  • Scrape timestamp

Output streams to a clean Apify dataset โ€” CSV, JSON, Excel, RSS, or API.

Use Cases

  • Streaming services + content acquisition โ€” build a structured catalog of every title under consideration for licensing
  • Recommendation engines โ€” train collaborative-filtering models on rating + genre + cast vectors
  • Media analytics + journalism โ€” benchmark box-office performance across studios, years, genres
  • Trivia + entertainment apps โ€” power the database behind a movie-trivia mobile app
  • Academic research โ€” film studies, sociology of media, cinematic-language NLP
  • Fan sites + verticals โ€” build a niche horror-genre database with full reviews
  • Marketing intelligence โ€” monitor the review-sentiment swing of a launching title in week one

Quick Start

from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_TOKEN")
run_input = {
"searchQueries": ["Christopher Nolan films"],
"maxResults": 50,
"includeReviews": True,
"maxReviewsPerTitle": 5,
}
run = client.actor("nexgendata/imdb-scraper").call(run_input=run_input)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item.get("title"), item.get("year"), item.get("imdbRating"))

Pricing

Pay-per-event. $3 per 1,000 results plus a small Actor Start fee. No monthly subscription. Compare to OMDb at $1+ per month per limit-tier or TMDB's paid plans โ€” this actor stays flat-cost regardless of volume.

Use caseActor
YouTube video metadata + downloadYT-DLP Video Tools
Detect YouTube channel monetization signalYouTube Channel Monetization Detector
Wikipedia structured-knowledge scraperWikipedia Scraper
Hacker News stories + comments scraperHacker News Scraper
Reddit subreddit + post trend trackerReddit Subreddit Trends
AI sentiment analysis on reviewsAI Sentiment Analyzer
Google Scholar paper searchGoogle Scholar Scraper
Bilibili Chinese video searchBilibili Video Search

FAQ

Is this legal? This actor reads publicly displayed IMDb pages. IMDb publishes non-commercial flat-file datasets for free; commercial use of those is prohibited. This actor reads the live HTML the same way a browser would and is suitable for commercial use under your own jurisdiction's rules โ€” we recommend checking IMDb's terms for your use case.

How fresh is the data? Live on every run. Each invocation pulls the current IMDb page at runtime โ€” critical for box-office numbers that change weekly.

Does it include reviews? Yes โ€” set includeReviews: true and maxReviewsPerTitle: N to attach up to N user reviews per title with author / rating / date / full text.

Does it include cast and crew? Yes โ€” every result includes the full top-billed cast and key crew with their IMDb IDs so you can resolve actor-graph relationships.

TV series support? Yes โ€” series + season + episode metadata are all supported.

Can I schedule it weekly? Yes โ€” native Apify scheduling. Common pattern: refresh every Friday for box-office update.

Output formats? JSON by default; CSV / Excel / RSS via the dataset API.

About NexGenData

NexGenData publishes 260+ buyer-intent actors covering SEC filings, YC alumni, lead generation, competitive intelligence, stock fundamentals across 30+ exchanges, and more. All pay-per-result. Browse the full catalog at https://apify.com/nexgendata?fpr=2ayu9b


How NexGenData Pricing Works

Every NexGenData actor uses pay-per-event pricing โ€” you only pay for results that actually land in your dataset. No monthly minimum, no seat fees, no surprise overage bills.

  • Actor Start: a single-event charge each time you spin the actor up (scaled to memory size)
  • Result: charged per item written to the default dataset
  • No charge for retries, internal proxy rotation, or failed sub-requests โ€” those are absorbed by the platform

If you only need the data once a quarter, you only pay once a quarter. If you scale to millions of records, the unit cost stays the same.

Apify Platform Bonus

New to Apify? Sign up with the NexGenData referral link โ€” you get free platform credits on signup (enough for several thousand free results) and you help fund the maintenance of this actor fleet.

Integration Surface

Every actor in the NexGenData catalog can be triggered from:

  • Apify console โ€” point-and-click run
  • Apify API โ€” REST + webhooks
  • Apify Python / JS SDKs โ€” programmatic batch
  • Zapier, Make.com, n8n โ€” official integrations
  • MCP โ€” many actors are exposed as MCP tools for Claude / ChatGPT / Cursor agents
  • Schedules โ€” built-in cron for daily / weekly / monthly runs
  • Webhooks โ€” POST results to any HTTPS endpoint on dataset write

Support

NexGenData maintains 260+ Apify actors and ships updates regularly. Bug reports via the Apify console issues tab get a response within 24 hours. Roadmap requests are welcome โ€” high-demand features ship in the next version.

Home: thenextgennexus.com Full catalog: apify.com/nexgendata