fffilm.name movie reviews by Frantisek Fuka
Get movie reviews from fffilm.name
Get text from a page using readability.js
yelp.com with reviews
Get basic info and all reviews from Yelp
Get product data from e-commerce site
Crawler for basic SEO analysis.
Crawls through articles and checks if these is a special e-commerce widget (actually returns number of occurrences).
Startups from Geekwire v2
Get all startups based in the Pacific Northwest from Geekwire collection. Crawler changes option in a dropdown to show all startups and then scrapes data.
Get product reviews from Blibli.com using internal JS variable and AJAX calls
Get events for given category from spain-eventos.es. Pages are being enqueued in a Page function to avoid clicking on "Load more" button
Google Play store - app reviews
Get app reviews from Google Play store (max. 4000 reviews). Uses internal AJAX call which returns 40 reviews in html code.
Outputs links to articles containing specific keyword (and a few words around the keyword)
Get all companies from jobs.dou.ua. Uses internal AJAX calls with CSRF token taken from JS variable on a page.
Czech legislative elections 2017
Get data about Czech legislative elections 2017 in time
topshop.com using XHRs
Get all products from topshop.com category using their internal XHRs
Get related articles for the first result of a given search query on Google scholar
6pm.com - JS variable
Get information of all cases for given day from http://probate.franklincountyohio.gov/
Get all sellers' details from mintmarket.cz
Get info about companies from czech Yellow pages
login to comicsdb.cz
Login example (comicsdb.cz). Simple POST data in Start url doesn't work, Pseudo-url has to be used to handle new request after login.
Scrapes basic info for given keyword and location (from a list)
Get basic product data from one page at customwheeloffset.com
Get basic info about campaign on Patreon
Audience Demographics from Alexa.com
Get audience demographics for given site from Alexa. Data are extracted from bars using their width attribute
Joke from stackoverflow.com for HackPrague
Get random joke from Stack Overflow. Made for HackPrague
Get agency profiles from adforum.com using internal AJAX calls
Get news from dotesports.com
Get product prices from prisjakt.nu
Startups from Geekwire v1
Get all startups based in the Pacific Northwest from Geekwire collection. Crawler navigates through pagination in one page function using JS click().
Get last movie reviews from csfd.cz