IMDB Advanced Scraper avatar

IMDB Advanced Scraper

Try for free

3 days trial then $35.00/month - No credit card required now

Go to Store
IMDB Advanced Scraper

IMDB Advanced Scraper

epctex/imdb-advanced-scraper
Try for free

3 days trial then $35.00/month - No credit card required now

Uncover comprehensive data on millions of movies, artists, and more with our advanced scraping tool. Extract TV episode details, descriptions, casting, artists, ratings, and more. Customize your search with filters and mappings for precise results.

Actor - IMDB Advanced Scraper

IMDB Advanced scraper

Since IMDB doesn't provide a good and free API, this actor should help you to retrieve data from it. This actor can provide you with very detailed output from IMDB.

The IMDB Advanced data scraper supports the following features:

  • Advanced search - Search anything using the advanced search capability.

  • Search any keyword - You can search any keyword you would like to have and get the results.

  • Scrape lists - Scrape any list that you'd like to get from IMDB

  • Scrape any search result - You can search for keywords, artists, movies, companies, and many more!

  • Scrape genres - If you want to get the most read books on a certain category or anything related to the genres, just type the URL.

  • Scrape movie detail - Scrape very detailed information for each of the movies that you'd like to get.

  • Scrape artist detail - Scrape any artist and their information.

What are the advantages of this actor compared to others?

This actor is extremely optimized, very fast, and provides at least 5x more data than the other actors or projects that you might experience.

Bugs, fixes, updates, and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Input Parameters

The input of this scraper should be JSON containing the list of pages on IMDB that should be visited. Possible fields are:

  • search: (Optional) (String) Keyword that you want to search on IMDB.

  • mode: (Optional) (String) Mode of search. This attribute has to be initiated when a search keyword is provided. The options are: all, tt (Titles), ep (TV Episodes),nm (Names),co (Companies),kw (Keywords).

  • startUrls: (Optional) (Array) List of IMDB URLs. You should only provide a list, title, name, search and find URLs.

  • endPage: (Optional) (Number) Final number of page that you want to scrape. The default is Infinite. This applies to all search requests and startUrls individually.

  • maxItems: (Optional) (Number) You can limit scraped items. This should be useful when you search through the big lists or search results.

  • proxy: (Required) (Proxy Object) Proxy configuration.

  • extendOutputFunction: (Optional) (String) Function that takes a JQuery handle ($) as an argument and returns an object with data.

  • customMapFunction: (Optional) (String) Function that takes each object's handle as an argument and returns the object with executing the function.

This solution requires the use of Proxy servers, either your own proxy servers or you can use Apify Proxy.

Tip

When you want to have a scrape over a specific item URL, just copy and paste the link as one of the startUrl.

If you would like to scrape only the first page of a list then put the link for the page and have the endPage as 1.

With the last approach that is explained above you can also fetch any interval of pages. If you provide the 5th page of a list and define the endPage parameter as 6 then you'll have the 5th and 6th pages only.

Compute Unit Consumption

The actor is optimized to run blazing fast and scrape many items as possible. Therefore, it forefronts all item detail requests. If the actor doesn't block very often it'll scrape 100 items in 1 minute with ~0.03-0.06 compute units.

IMDB Advanced Scraper Input example

1{
2  "searchUrls":[
3    "https://www.imdb.com/title/tt0056234/?ref_=adv_li_tt",
4    "https://www.imdb.com/name/nm0912504/?ref_=fn_al_nm_1",
5    "https://www.imdb.com/list/ls016522954/?ref_=nv_tvv_dvd",
6    "https://www.imdb.com/find?s=kw&q=warn&ref_=nv_sr_sm",
7		"https://www.imdb.com/search/title/?title_type=feature&release_date=2021-01-01,2023-07-01"
8  ],
9  "proxy":{
10    "useApifyProxy":true
11  },
12  "search":"warner bro",
13  "mode": "all",
14  "maxItems":20
15}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with a failure state and output an explanation of what is wrong.

IMDB Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this IMDB Advanced actor.

Scraped IMDB Properties

The structure of each item in IMDB looks like this:

Title Item Detail

1{
2	"scrapedType": "title",
3	"id": "tt2172049",
4	"productionStatus": {
5		"currentProductionStage": {
6			"id": "released",
7			"text": "Released"
8		},
9		"productionStatusHistory": [
10			{
11				"status": {
12					"id": "post_production",
13					"text": "Post-production"
14				}
15			},
16			{
17				"status": {
18					"id": "released",
19					"text": "Released"
20				}
21			}
22		],
23		"restriction": null
24	},
25	"canHaveEpisodes": false,
26	"series": null,
27	"titleText": {
28		"text": "Sant Barbara"
29	},
30	"titleType": {
31		"id": "tvMovie",
32		"canHaveEpisodes": false
33	},
34	"originalTitleText": {
35		"text": "Santa Barbara"
36	},
37	"certificate": {
38		"rating": "TV-G"
39	},
40	"releaseYear": {
41		"year": 2012
42	},
43	"releaseDate": {
44		"day": 4,
45		"month": 12,
46		"year": 2012,
47		"country": {
48			"id": "IT",
49			"text": "Italy"
50		}
51	},
52	"runtime": {
53		"seconds": 6000
54	},
55	"canRate": {
56		"isRatable": true
57	},
58	"ratingsSummary": {
59		"aggregateRating": 6.7,
60		"voteCount": 165,
61		"topRanking": null
62	},
63	"meterRanking": null,
64	"primaryImage": {
65		"id": "rm1824601856",
66		"width": 547,
67		"height": 777,
68		"url": "https://m.media-amazon.com/images/M/MV5BYjkwYzFiZmItZTI5ZS00NzVkLTkwYWQtMGExMmEzNWI2ZGJhXkEyXkFqcGdeQXVyMTQ3Njg3MQ@@._V1_.jpg",
69		"caption": {
70			"plainText": "Sant Barbara (2012)"
71		}
72	},
73	"images": {
74		"total": 3
75	},
76	"videos": {
77		"total": 0
78	},
79	"primaryVideos": [],
80	"externalLinks": {
81		"total": 1
82	},
83	"metacritic": null,
84	"keywords": [
85		{
86			"text": "female whipping"
87		}
88	],
89	"genres": {
90		"genres": [
91			{
92				"text": "Drama",
93				"id": "Drama"
94			}
95		]
96	},
97	"plot": {
98		"plotText": {
99			"plainText": "Dioscuro, governor of Scandriglia, is willing to do anything to get to the Roman government. His daughter Barbara, instead she is disinterested in everything and continued her studies in astronomy from Polycarp and along with her friends: Giuliana, Tito, Crio and the soldier Claudio. With them she will starts a spiritual journey that will totally change her life, also marked by the death of her mother, making the Christian faith its raison d'etre. All this will bring it into conflict with the government, including the prefect Marciano and especially with the father who does not share her choice and that she had not predicted this future course."
100		},
101		"language": {
102			"id": "en-US"
103		}
104	},
105	"plotContributionLink": {
106		"url": "https://contribute.imdb.com/updates?update=tt2172049:outlines.add.1.locale~en-US"
107	},
108	"credits": {
109		"total": 54
110	},
111	"principalCredits": [
112		{
113			"totalCredits": 1,
114			"category": {
115				"text": "Director",
116				"id": "director"
117			},
118			"credits": [
119				{
120					"name": {
121						"nameText": {
122							"text": "Carmine Elia"
123						},
124						"id": "nm0253416"
125					},
126					"attributes": null
127				}
128			]
129		},
130		{
131			"totalCredits": 2,
132			"category": {
133				"text": "Writers",
134				"id": "writer"
135			},
136			"credits": [
137				{
138					"name": {
139						"nameText": {
140							"text": "Maura Nuccetelli"
141						},
142						"id": "nm0637707"
143					},
144					"attributes": null
145				},
146				{
147					"name": {
148						"nameText": {
149							"text": "Mario Ruggeri"
150						},
151						"id": "nm1365120"
152					},
153					"attributes": null
154				}
155			]
156		},
157		{
158			"totalCredits": 12,
159			"category": {
160				"text": "Stars",
161				"id": "cast"
162			},
163			"credits": [
164				{
165					"name": {
166						"nameText": {
167							"text": "Vanessa Hessler"
168						},
169						"id": "nm2057064"
170					},
171					"attributes": null
172				},
173				{
174					"name": {
175						"nameText": {
176							"text": "Thomas Trabacchi"
177						},
178						"id": "nm0870384"
179					},
180					"attributes": null
181				},
182				{
183					"name": {
184						"nameText": {
185							"text": "Massimo Wertmüller"
186						},
187						"id": "nm0921632"
188					},
189					"attributes": null
190				}
191			]
192		}
193	],
194	"reviews": {
195		"total": 2
196	},
197	"criticReviewsTotal": {
198		"total": 0
199	},
200	"triviaTotal": {
201		"total": 0
202	},
203	"engagementStatistics": null,
204	"subNavCredits": {
205		"total": 54
206	},
207	"subNavReviews": {
208		"total": 2
209	},
210	"subNavTrivia": {
211		"total": 0
212	},
213	"subNavFaqs": {
214		"total": 0
215	},
216	"subNavTopQuestions": {
217		"total": 0
218	},
219	"titleGenres": {
220		"genres": [
221			{
222				"genre": {
223					"text": "Drama"
224				}
225			}
226		]
227	},
228	"meta": {
229		"canonicalId": "tt2172049",
230		"publicationStatus": "PUBLISHED"
231	},
232	"castPageTitle": [
233		{
234			"name": {
235				"nameText": {
236					"text": "Vanessa Hessler"
237				}
238			}
239		},
240		{
241			"name": {
242				"nameText": {
243					"text": "Thomas Trabacchi"
244				}
245			}
246		},
247		{
248			"name": {
249				"nameText": {
250					"text": "Massimo Wertmüller"
251				}
252			}
253		},
254		{
255			"name": {
256				"nameText": {
257					"text": "Simone Montedoro"
258				}
259			}
260		}
261	],
262	"creatorsPageTitle": [],
263	"directorsPageTitle": [
264		{
265			"credits": [
266				{
267					"name": {
268						"nameText": {
269							"text": "Carmine Elia"
270						}
271					}
272				}
273			]
274		}
275	],
276	"countriesOfOrigin": {
277		"countries": [
278			{
279				"id": "IT",
280				"text": "Italy"
281			}
282		]
283	},
284	"production": [
285		{
286			"company": {
287				"id": "co0064546",
288				"companyText": {
289					"text": "Lux Vide"
290				}
291			}
292		}
293	],
294	"featuredReviews": [
295		{
296			"id": "rw8978231",
297			"author": {
298				"nickName": "ma-cortes",
299				"userId": "ur3270789"
300			},
301			"summary": {
302				"originalText": "Brilliant and enjoyable TV movie with plenty of love and Christian joy"
303			},
304			"text": {
305				"originalText": {
306					"plaidHtml": "Expensively mounted religious production with mostly Italian cast and spectacularly filmed in Italy. Dioscuro (Wertmüller) , governor of Scandriglia, is willing to do anything to get to the Roman government. His daughter Barbara (really gorgeous fashion model Vanessa Kessler of Asterix at the Olympic Games) , instead she is disinterested in everything and continued her studies in astronomy from Polycarp (Luciano Virgilio) and along with her friends: Giuliana (Laura Roman) , Tito (Albelli) , Crio (Anagni) and the soldier Claudio (Thomas Trabacchi) . With them she will begin a spiritual journey that will totally change her life, also marked by the death of her mother, making the Christian faith its raison d&#39;etre. All this will bring it into conflict with the government, including the prefect Marciano (Thomas Trabacchi) and her father, counsellor of emperor Dioclecianus.<br/><br/>Interesting retelling at the Santa Barbara life and her teachings filled with historical and religious view point . Words that relate to early Christian attitude but they are clearly reflected throughout the film made with honest effort and great faith both from the crew and the cast. I have seen many religious historical epics ; however , I have to admit that seldom have I seen such a purely faithful movie where almost each moment is an experience. As Santa Barbara gets in trouble when she takes the Catholic faith and especially with her father and the demanding Prefect who do not share her choice and that she had not predicted this future course. Breathtaking story set in Dioclecianus times , during the bloody Christian pursuits with emphasis in religious motives , lavish costumes and adequate production design . This is an impressive story of the life of Santa Barbara filmed on a relatively grand scale , being efficiently directed by Carmine Elia . However , not completely reverential at some characters . Life of Santa Barbara is intelligently told , including glowing cinematography by Federico Schlatter , an attractive visual style , memorable but repetitive soundtrack by means of synthesizer and decent cast giving larger-than-life interpretation .<br/><br/>This TV movie is accurately based on Santa Barbara : She is often portrayed with miniature chains and a tower. As one of the Fourteen Holy Helpers, Barbara is a popular saint, perhaps best known as the patron of armourers, artillerymen, military engineers, miners and others who work with explosives because of her legend&#39;s association with lightning, and also of mathematicians. A 15th-century French version of her story credits her with thirteen miracles, many of which reflect the security she offered that her devotees would not die before getting to make confession and receiving extreme unction. Despite the legends detailing her story, the earliest references to her supposed 3rd-century life do not appear until the 7th century, and veneration of her was common, especially in the East, from the 9th century. According to the hagiographies, Barbara, the daughter of a rich pagan named Dioscorus, was carefully guarded by her father who kept her locked up in a tower in order to preserve her from the outside world. Having secretly become a Christian, she rejected an offer of marriage that she received through her father. She was born either in Heliopolis or in Nicomedia. Before going on a journey, her father commanded that a private bath-house be erected for her use near her dwelling, and during his absence, Barbara had three windows put in it, as a symbol of the Holy Trinity, instead of the two originally intended. When her father returned, she acknowledged herself to be a Christian. Dragged before the prefect of the province, Martinianus, who had her cruelly tortured, Barbara held true to her Christian faith. During the night, the dark prison was bathed in light and new miracles occurred. Every morning, her wounds were healed. Torches that were to be used to burn her went out as soon as they came near her. Finally, she was condemned to death by beheading. Her father himself carried out the death sentence. However, as punishment, he was struck by lightning on the way home and his body was consumed by flame. Barbara was buried by a Christian, Valentinus, and her tomb became the site of miracles. Saint Barbara is venerated by Catholics who face the danger of sudden and violent death at work. She is invoked against thunder and lightning and all accidents arising from explosions of gunpowder. She became the patron saint of artillerymen, armourers, military engineers, gunsmiths, and anyone else who worked with cannon and explosives. The name of the barbiturate family of pharmaceutical drugs is believed to derive from the suggestion by an artilleryman commemorating the feast of Saint Barbara in 1864, whom the chemist Adolf von Baeyer encountered at a local tavern whilst celebrating his recent discovery of the parent compound. Saint Barbara is mentioned in Federico García Lorca&#39;s play The House of Bernarda Alba (1936)."
307				}
308			},
309			"authorRating": 6,
310			"submissionDate": "2023-04-08",
311			"helpfulness": {
312				"upVotes": 2,
313				"downVotes": 0
314			}
315		}
316	],
317	"wins": {
318		"total": 0
319	},
320	"nominations": {
321		"total": 0
322	},
323	"prestigiousAwardSummary": null,
324	"episodes": null,
325	"videoStrip": [],
326	"titleMainImages": [
327		{
328			"id": "rm3290246912",
329			"url": "https://m.media-amazon.com/images/M/MV5BYjQ2NzI0YWMtM2IyYi00YTcyLWI5MmEtMDQyNTQxODcxZDEzXkEyXkFqcGdeQXVyNzMzMjU5NDY@._V1_.jpg",
330			"caption": {
331				"plainText": "Sant Barbara (2012)"
332			},
333			"height": 799,
334			"width": 607
335		},
336		{
337			"id": "rm1824601856",
338			"url": "https://m.media-amazon.com/images/M/MV5BYjkwYzFiZmItZTI5ZS00NzVkLTkwYWQtMGExMmEzNWI2ZGJhXkEyXkFqcGdeQXVyMTQ3Njg3MQ@@._V1_.jpg",
339			"caption": {
340				"plainText": "Sant Barbara (2012)"
341			},
342			"height": 777,
343			"width": 547
344		},
345		{
346			"id": "rm3474796288",
347			"url": "https://m.media-amazon.com/images/M/MV5BZDI1NTY0YjQtYjgzOC00YWEwLTlmMmEtMWQ2OTg0MmIzNzU3XkEyXkFqcGdeQXVyNzMzMjU5NDY@._V1_.jpg",
348			"caption": {
349				"plainText": "Sant Barbara (2012)"
350			},
351			"height": 669,
352			"width": 718
353		}
354	],
355	"imageUploadLink": {
356		"url": "https://contribute.imdb.com/image/tt2172049/add?bus=imdb&return_url=https%3A%2F%2Fwww.imdb.com%2Fclose_me&site=web"
357	},
358	"cast": [
359		{
360			"name": {
361				"id": "nm2057064",
362				"nameText": {
363					"text": "Vanessa Hessler"
364				},
365				"primaryImage": {
366					"url": "https://m.media-amazon.com/images/M/MV5BMTU5NDQwNTUyMF5BMl5BanBnXkFtZTcwMjgxNTcxOA@@._V1_.jpg",
367					"width": 1365,
368					"height": 2048
369				}
370			},
371			"attributes": null,
372			"category": {
373				"id": "actress"
374			},
375			"characters": [
376				{
377					"name": "Barbara"
378				}
379			],
380			"episodeCredits": {
381				"total": 0,
382				"yearRange": null
383			}
384		},
385	],
386	"writers": [
387		{
388			"totalCredits": 2,
389			"category": {
390				"text": "Writers"
391			},
392			"credits": [
393				{
394					"name": {
395						"id": "nm0637707",
396						"nameText": {
397							"text": "Maura Nuccetelli"
398						}
399					},
400					"attributes": null
401				},
402				{
403					"name": {
404						"id": "nm1365120",
405						"nameText": {
406							"text": "Mario Ruggeri"
407						}
408					},
409					"attributes": null
410				}
411			]
412		}
413	],
414	"isAdult": false,
415	"moreLikeThisTitles": [
416		{
417			"id": "tt1327890",
418			"titleText": {
419				"text": "Sant'Agostino"
420			},
421			"titleType": {
422				"id": "tvMovie",
423				"text": "TV Movie",
424				"canHaveEpisodes": false,
425				"displayableProperty": {
426					"value": {
427						"plainText": "TV Movie"
428					}
429				}
430			},
431			"originalTitleText": {
432				"text": "Sant'Agostino"
433			},
434			"primaryImage": {
435				"id": "rm1889999104",
436				"width": 800,
437				"height": 1130,
438				"url": "https://m.media-amazon.com/images/M/MV5BMjJkYTYzNDctNmUyMS00MjgyLWIyNGMtNGZkMGFiZWU4YWRjXkEyXkFqcGdeQXVyNDA5NDY4MzU@._V1_.jpg",
439				"caption": {
440					"plainText": "Sant'Agostino (2010)"
441				}
442			},
443			"releaseYear": {
444				"year": 2010,
445				"endYear": null
446			},
447			"ratingsSummary": {
448				"aggregateRating": 6.8,
449				"voteCount": 424
450			},
451			"runtime": {
452				"seconds": 12000
453			},
454			"certificate": null,
455			"canRate": {
456				"isRatable": true
457			},
458			"titleGenres": {
459				"genres": [
460					{
461						"genre": {
462							"text": "Biography"
463						}
464					},
465					{
466						"genre": {
467							"text": "Drama"
468						}
469					},
470					{
471						"genre": {
472							"text": "History"
473						}
474					}
475				]
476			},
477			"canHaveEpisodes": false
478		},
479	],
480	"trivia": [],
481	"goofsTotal": {
482		"total": 0
483	},
484	"goofs": [],
485	"quotesTotal": {
486		"total": 0
487	},
488	"quotes": [],
489	"crazyCredits": [],
490	"alternateVersions": [],
491	"connections": [],
492	"soundtrack": [],
493	"iframeAddReviewLink": {
494		"url": "https://contribute.imdb.com/review/tt2172049/add?bus=imdb&return_url=https%3A%2F%2Fwww.imdb.com%2Fclose_me&site=web"
495	},
496	"topQuestions": [],
497	"faqs": [],
498	"detailsExternalLinks": [],
499	"spokenLanguages": {
500		"spokenLanguages": [
501			{
502				"id": "it",
503				"text": "Italian"
504			}
505		]
506	},
507	"akas": [
508		{
509			"text": "Santa Bárbara"
510		}
511	],
512	"filmingLocations": [
513		{
514			"text": "Tunisia",
515			"location": "Tunisia",
516			"attributes": []
517		}
518	],
519	"companies": {
520		"total": 5
521	},
522	"productionBudget": {
523		"budget": {
524			"amount": 2000000,
525			"currency": "EUR"
526		}
527	},
528	"lifetimeGross": null,
529	"openingWeekendGross": null,
530	"worldwideGross": null,
531	"technicalSpecifications": {
532		"soundMixes": {
533			"items": []
534		},
535		"aspectRatios": {
536			"items": [
537				{
538					"aspectRatio": "16:9 HD",
539					"attributes": []
540				}
541			]
542		},
543		"colorations": {
544			"items": [
545				{
546					"conceptId": "color",
547					"text": "Color",
548					"attributes": []
549				}
550			]
551		}
552	},
553	"contributionQuestions": [
554		{
555			"entity": {
556				"primaryImage": {
557					"url": "https://m.media-amazon.com/images/M/MV5BYjkwYzFiZmItZTI5ZS00NzVkLTkwYWQtMGExMmEzNWI2ZGJhXkEyXkFqcGdeQXVyMTQ3Njg3MQ@@._V1_.jpg",
558					"width": 547,
559					"height": 777,
560					"caption": {
561						"plainText": "Sant Barbara (2012)"
562					}
563				}
564			},
565			"questionId": "tt2172049.plot_outline.en-US",
566			"questionText": {
567				"plainText": "What is the English language plot outline for Sant Barbara (2012)?"
568			},
569			"contributionLink": {
570				"url": "https://contribute.imdb.com/answers?pinnedQuestion=tt2172049.plot_outline.en-US"
571			}
572		}
573	]
574}

Contact

Please visit us through epctex.com to see all the products that are available for you. If you are looking for any custom integration or so, please reach out to us through the chat box in epctex.com. In need of support? devops@epctex.com is at your service.

Developer
Maintained by Community

Actor Metrics

  • 20 monthly users

  • 2 stars

  • >99% runs succeeded

  • 16 hours response time

  • Created in May 2022

  • Modified 18 hours ago

Categories