You can use this actor to monitor any page's content and get a notification when content changes. Technically it extracts text by a given selector and compares it with the previous run. If there is any change, it runs another act...
Email and Social handlers extractor
Get emails and social handlers (Twitter, LinkedIn, Instagram) from page/domain/web. Just change the Start url and define the scope.
Get info about your favorite soccer players from transfermarkt.com
Booking - hotel details
Get info about hotels on booking.com based on search query.
Crawls entire site (www subdomain) and extracts complete HTML content for every page
Booking - hotel prices
Get prices for your favorite hotel on booking.com. Scrapes all available rooms with description and prices for given hotel and dates.
Google Play store - app reviews
Get app reviews from Google Play store (max. 4000 reviews). Uses internal AJAX call which returns 40 reviews in html code.
Scrapes basic info for given keyword and location (from a list)
yelp.com with reviews
Get basic info and all reviews from Yelp
Get product data from e-commerce site
Get info about movies from IMDB (from detail page)
Crawler for basic SEO analysis.
6pm.com - JS variable
yelp.com reviews from JSON-LD
Crawler takes biz id from customData attribute and scrapes all reviews using JSON Linked data. If the page is not loaded (proxy can be banned), it is enqueued again.
Crawler gets all categories as a set from given xml feed with products.
Get all real estate offers from booli.se using internal JS variable
Get product prices from prisjakt.nu
Get prospects from Hubspot.com behind your login which is handled by submitting login form.
Get top HN submissions (data is taken from the list on the frontpage)
login to comicsdb.cz
Login example (comicsdb.cz). Simple POST data in Start url doesn't work, Pseudo-url has to be used to handle new request after login.
topshop.com using XHRs
Get all products from topshop.com category using their internal XHRs
Get text from a page using readability.js
Audience Demographics from Alexa.com
Get audience demographics for given site from Alexa. Data are extracted from bars using their width attribute
Get all job offers from startupjobs.cz (data taken from a list)
Get departures and arrivals at SFO. Pagination is handled in Page function using click()
Startups from Geekwire v1
Get all startups based in the Pacific Northwest from Geekwire collection. Crawler navigates through pagination in one page function using JS click().
Get all new products from e-commerce site. Uses internal JS variable for some attributes.
Outputs links to articles containing specific keyword (and a few words around the keyword)
Get product reviews from Blibli.com using internal JS variable and AJAX calls
Get related articles for the first result of a given search query on Google scholar