TMDB Movie & TV Metadata Scraper avatar

TMDB Movie & TV Metadata Scraper

Pricing

Pay per event

Go to Apify Store
TMDB Movie & TV Metadata Scraper

TMDB Movie & TV Metadata Scraper

Scrape rich metadata for movies and TV shows from The Movie Database (TMDB) — no API key required. Discovers titles from public browse pages and extracts full detail records including cast, directors, genres, keywords, ratings, runtime, and production companies.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

17 hours ago

Last modified

Share

Scrape rich metadata for movies and TV shows from The Movie Database (TMDB)no API key required. Discovers titles from TMDB's public discover/browse pages and extracts full detail records including cast, directors, genres, keywords, ratings, runtime, original language, and production companies.

What you get

Each record in the output dataset contains:

FieldDescription
tmdb_idNumeric TMDB ID
titleMovie or TV show title
media_typemovie or tv
tmdb_urlCanonical TMDB page URL
original_titleTitle in the original language
release_dateFirst release or air date (YYYY-MM-DD)
vote_averageAggregate rating (0–10 scale)
vote_countNumber of votes
user_score_percentUser score percentage (0–100)
overviewPlot summary or show description
genresComma-separated genre names
runtime_minutesRuntime in minutes
original_languageOriginal language
production_companiesComma-separated production company names
imdb_idIMDb ID (e.g. tt0137523) — when listed on the TMDB page
cast_topComma-separated top-billed cast names
directorsComma-separated director names
keywordsComma-separated TMDB keyword tags

Why use this instead of the TMDB API?

Most TMDB scrapers on the Apify Store are thin wrappers around the TMDB REST API — they require you to register for and manage your own API key and stay within TMDB's per-account rate limits. This actor scrapes TMDB's public web pages directly, so:

  • No API key registration or management
  • No per-account rate limits to worry about
  • Both movies and TV shows in one unified output schema
  • Includes fields not always easily queryable via API (IMDb cross-ID, keyword tags, top cast)

Inputs

InputTypeDefaultDescription
maxItemsinteger15Maximum number of records to return. Set to 0 for no limit.
mediaTypestringbothWhich media type to scrape: movie, tv, or both.
startPageinteger1Discover page to start from (each page has ~20 titles).

Example use cases

  • Media server catalogs: Build or enrich metadata catalogs for Plex, Jellyfin, or Kodi libraries without managing API credentials.
  • Recommendation engines: Feed movie/TV metadata into ML pipelines — genres, keywords, cast, and ratings in one schema.
  • Cross-referencing: Use imdb_id to join TMDB data with IMDb datasets for enriched analytics.
  • Market research: Track ratings and popularity trends across the TMDB catalog over time.

How it works

  1. Discover: Crawls paginated TMDB browse pages (/movie?language=en-US&page=N, /tv?...) — 20 titles per page, up to 500 pages per type.
  2. Detail: For each title, fetches the detail page and extracts:
    • JSON-LD (schema.org Movie / TVSeries): name, description, rating, genres, runtime, release date
    • DOM: user score chart, directors, cast, keywords, original title, language, production companies

Notes

  • TMDB's discover pages order titles by popularity (most popular first). Use startPage to offset into the catalog.
  • The imdb_id field is populated only when TMDB links to IMDb on the detail page — this is common for well-known titles but may be absent for obscure entries.
  • Runtime is in minutes for movies. For TV shows, TMDB typically reports the average episode length.
  • The language=en-US parameter is appended to all requests to ensure English metadata in the output.