TheMoviedb Scraper avatar
TheMoviedb Scraper

Pricing

Pay per usage

Go to Apify Store
TheMoviedb Scraper

TheMoviedb Scraper

Introducing TheMoviedb Scraper! This versatile actor extracts rich movie, TV, and celebrity data from TheMovieDB. It offers two modes: use the official API for fast, stable results, or scrape directly without a key. Your complete media data solution.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

1

Monthly active users

13 hours ago

Last modified

Share

TMDb Scraper

Extract comprehensive data from The Movie Database (TMDb) effortlessly. Scrape movies, TV shows, and celebrity information with ease.

Apify Ready Build Status Runs

📖 Description

TMDb Scraper is a powerful and flexible tool designed for extracting detailed information from The Movie Database (TMDb). Whether you're building a personal project, conducting academic research, or developing a commercial application, this scraper offers a reliable and user-friendly solution to gather data on movies, TV shows, and people.

This actor prioritizes the official TMDb API for fast and accurate data retrieval, with a built-in fallback to web scraping methods when necessary. It supports customizable search parameters, allowing you to filter by genre, year, popularity, and more.

Key Benefits

  • Comprehensive Data Collection: Gather rich metadata including ratings, reviews, cast, crew, images, and keywords.
  • Flexible Search Capabilities: Search by keywords, genres, release years, and specific queries.
  • High-Quality Output: Structured JSON data ready for integration into your applications.
  • Reliable Performance: Optimized for speed and efficiency with configurable delays to respect rate limits.

✨ Features

  • Multi-Content Support: Scrape data for movies, TV shows, and people.
  • Advanced Filtering: Use genre IDs, year ranges, and sorting options to refine results.
  • Rich Metadata Extraction: Collect reviews, images, keywords, and collection details.
  • API-First Approach: Leverages TMDb's official API for superior data quality and speed.
  • Customizable Limits: Control the number of results, pages, and items per content.
  • Delay Management: Built-in delays to ensure compliance with TMDb's terms of service.

🔧 Input Configuration

Configure the scraper using the following input parameters to tailor the data extraction to your needs.

FieldTypeDescriptionDefault
apiKeyStringYour TMDb API key for enhanced reliability and speed. Obtain one from TMDb.YOUR_TMDB_API_KEY_HERE
useApiFirstBooleanPrioritize the TMDb API over web scraping.true
contentTypeStringType of content to scrape: movie, tv, or person.tv
searchQueriesStringComma-separated list of search terms for content queries.""
genreIdsStringComma-separated genre IDs to filter results (e.g., "28" for Action).""
yearFromIntegerStarting year for release date filtering.
yearToIntegerEnding year for release date filtering.
resultsWantedIntegerMaximum number of main results to return.5
maxPagesIntegerMaximum pages to scrape from search results.3
sortByStringSorting criteria (e.g., popularity.desc, vote_average.desc).popularity.desc
collectPeopleBooleanInclude cast and crew information.true
collectReviewsBooleanGather user reviews and ratings.true
collectKeywordsBooleanExtract associated keywords and tags.true
collectImagesBooleanDownload high-resolution images.true
collectCollectionsBooleanFor movies, include collection details.false
maxReviewsPerContentIntegerLimit reviews collected per item.25
maxImagesPerContentIntegerLimit images collected per item.20
minDelayMsIntegerMinimum delay between requests (ms).1000
maxDelayMsIntegerMaximum delay between requests (ms).3000
peopleQueryStringSearch term for people (when contentType is person).
peopleResultsWantedIntegerMaximum people results to return.3

📤 Output Data

The scraper outputs structured JSON data for each scraped item. Below are examples of the data formats for different content types.

Movie Data

{
"tmdb_id": 27205,
"title": "Inception",
"overview": "Cobb, a skilled thief who commits corporate espionage by infiltrating the subconscious of his targets, is offered a chance to regain his old life as payment for a task considered to be impossible: 'inception', the implantation of another person's idea into a target's subconscious.",
"release_date": "2010-07-15",
"vote_average": 8.4,
"popularity": 102.6,
"content_type": "movie",
"genres": ["Action", "Science Fiction", "Thriller"],
"cast": [...],
"crew": [...],
"reviews": [...],
"images": [...],
"keywords": [...]
}

TV Show Data

{
"tmdb_id": 1396,
"title": "Breaking Bad",
"overview": "When Walter White, a New Mexico chemistry teacher, is diagnosed with Stage III cancer and given a prognosis of only two years left to live. He breaks bad.",
"first_air_date": "2008-01-20",
"vote_average": 8.8,
"popularity": 297.4,
"content_type": "tv",
"genres": ["Crime", "Drama", "Thriller"],
"cast": [...],
"crew": [...],
"reviews": [...],
"images": [...],
"keywords": [...]
}

Person Data

{
"person_id": 6193,
"name": "Leonardo DiCaprio",
"biography": "Leonardo Wilhelm DiCaprio is an American actor, producer, and environmentalist. He has often played unconventional roles, particularly in biopics and period films.",
"birthday": "1974-11-11",
"known_for_department": "Acting",
"popularity": 35.1,
"filmography": [...]
}

🚀 Usage Examples

{
"contentType": "movie",
"yearFrom": 2024,
"yearTo": 2024,
"sortBy": "popularity.desc",
"resultsWanted": 5,
"collectReviews": true,
"collectImages": true
}

Example 2: Get Details of Specific TV Shows

{
"contentType": "tv",
"searchQueries": ["Breaking Bad", "Game of Thrones", "Stranger Things"],
"collectPeople": true,
"collectReviews": true,
"maxReviewsPerContent": 10
}

Example 3: Find Celebrities by Name

{
"contentType": "person",
"peopleQuery": "Leonardo DiCaprio",
"peopleResultsWanted": 3
}

Example 4: Scrape Action Movies with High Ratings

{
"contentType": "movie",
"genreIds": "28",
"sortBy": "vote_average.desc",
"resultsWanted": 5,
"collectKeywords": true
}

⚙️ How to Use

  1. Set Up Your API Key: Provide your TMDb API key in the apiKey field for optimal performance.
  2. Configure Inputs: Use the input parameters to specify your search criteria and data collection preferences.
  3. Run the Actor: Execute the scraper on the Apify platform.
  4. Download Results: Access and download the scraped data in JSON format.

📊 Limits and Considerations

  • Rate Limiting: Respects TMDb API limits with configurable delays.
  • Data Volume: Large result sets may take longer to process.
  • API Key Requirements: Using your own API key is recommended for better reliability.
  • Content Availability: Data availability depends on TMDb's database.

📝 Disclaimer

This scraper is intended for personal, educational, and research purposes. Users are responsible for complying with TMDb's terms of service and applicable laws. The developers are not liable for any misuse of this tool.