User

Jakub Balada

jakubbalada

I'm a co-founder of Apify, father of 2 kids, web hacker and beer lover.

All
Popularity
Actor

Content Checker

jakubbalada/content-checker

You can use this actor to monitor any page's content and get a notification when content changes. Technically it extracts text by a given selector and compares it with the previous run. If there is any change, it runs another act...

avatarjakubbalada
110star
FEATURED
Crawler

Email and Social handlers extractor

Get emails and social handlers (Twitter, LinkedIn, Instagram) from page/domain/web. Just change the Start url and define the scope.

avatarjakubbalada
666cloud_download
Crawler

transfermarkt.com

Get info about your favorite soccer players from transfermarkt.com

avatarjakubbalada
416cloud_download
Crawler

Booking - hotel details

Get info about hotels on booking.com based on search query.

avatarjakubbalada
317cloud_download
Crawler

Complete HTML

Crawls entire site (www subdomain) and extracts complete HTML content for every page

avatarjakubbalada
272cloud_download
Crawler

Booking - hotel prices

Get prices for your favorite hotel on booking.com. Scrapes all available rooms with description and prices for given hotel and dates.

avatarjakubbalada
228cloud_download
Crawler

Google Play store - app reviews

Get app reviews from Google Play store (max. 4000 reviews). Uses internal AJAX call which returns 40 reviews in html code.

avatarjakubbalada
143cloud_download
Crawler

yellowpages.com

Scrapes basic info for given keyword and location (from a list)

avatarjakubbalada
95cloud_download
Crawler

yelp.com with reviews

Get basic info and all reviews from Yelp

avatarjakubbalada
79cloud_download
Crawler

Louis Vuitton

Get product data from e-commerce site

avatarjakubbalada
76cloud_download
Crawler

IMDB.com

Get info about movies from IMDB (from detail page)

avatarjakubbalada
67cloud_download
Crawler

Basic SEO

Crawler for basic SEO analysis.

avatarjakubbalada
59cloud_download
Crawler

6pm.com - JS variable

Get products information from e-commerce fashion site using JavaScript variable available on a page

avatarjakubbalada
56cloud_download
Crawler

yelp.com reviews from JSON-LD

Crawler takes biz id from customData attribute and scrapes all reviews using JSON Linked data. If the page is not loaded (proxy can be banned), it is enqueued again.

avatarjakubbalada
56cloud_download
Crawler

XML parser

Crawler gets all categories as a set from given xml feed with products.

avatarjakubbalada
35cloud_download
Crawler

booli.se

Get all real estate offers from booli.se using internal JS variable

avatarjakubbalada
31cloud_download
Crawler

prisjakt.nu

Get product prices from prisjakt.nu

avatarjakubbalada
31cloud_download
Crawler

Hubspot.com

Get prospects from Hubspot.com behind your login which is handled by submitting login form.

avatarjakubbalada
29cloud_download
Crawler

login to comicsdb.cz

Login example (comicsdb.cz). Simple POST data in Start url doesn't work, Pseudo-url has to be used to handle new request after login.

avatarjakubbalada
26cloud_download
Crawler

Hacker News

Get top HN submissions (data is taken from the list on the frontpage)

avatarjakubbalada
22cloud_download
Crawler

topshop.com using XHRs

Get all products from topshop.com category using their internal XHRs

avatarjakubbalada
22cloud_download
Crawler

Readability

Get text from a page using readability.js

avatarjakubbalada
20cloud_download
Crawler

Audience Demographics from Alexa.com

Get audience demographics for given site from Alexa. Data are extracted from bars using their width attribute

avatarjakubbalada
19cloud_download
Crawler

StartupJobs.cz offers

Get all job offers from startupjobs.cz (data taken from a list)

avatarjakubbalada
16cloud_download
Crawler

SFO Flights

Get departures and arrivals at SFO. Pagination is handled in Page function using click()

avatarjakubbalada
15cloud_download
Crawler

Startups from Geekwire v1

Get all startups based in the Pacific Northwest from Geekwire collection. Crawler navigates through pagination in one page function using JS click().

avatarjakubbalada
14cloud_download
Crawler

kirnazabete.com

Get all new products from e-commerce site. Uses internal JS variable for some attributes.

avatarjakubbalada
14cloud_download
Crawler

Techcrunch.com

Outputs links to articles containing specific keyword (and a few words around the keyword)

avatarjakubbalada
14cloud_download
Crawler

blibli.com

Get product reviews from Blibli.com using internal JS variable and AJAX calls

avatarjakubbalada
13cloud_download
Crawler

Google scholar

Get related articles for the first result of a given search query on Google scholar

avatarjakubbalada
12cloud_download