Web Designer News Scraper
This Actor is unavailable because the developer has decided to deprecate it. Would you like to try a similar Actor instead?
See alternative ActorsWeb Designer News Scraper
Scraper for www.webdesignernews.com
Web Designer News - Scraper
Scraper for www.webdesignernews.com, using Apify.
Input settings
Only startUrl
required, for example this could be the main page: https://www.webdesignernews.com
, some of the pagination pages: https://www.webdesignernews.com/page/2
or some specific section: https://www.webdesignernews.com/category/apps
or https://www.webdesignernews.com/category/apps/page/3
wayToScrape
can be old
or new
, by default old
. If, for example, the starting link is on the third page (https://webdesignernews.com/page/3
for example), then scraper will scrape third page, fourth page, fifth and so on, until pagination or maxRequestsPerCrawl
limit ends. But if the wayToScrape
is new
- third, second and first pages will be scraped. So you decide which way actor will work.
Local use
It is not necessary to use the apify.com service to use actor. Copy this repository and install node dependencies, for example, this way:
1git clone git@github.com:Ganevru/actor-webdesignernews-scraper.git 2cd actor-webdesignernews-scraper 3npm install
Create in the root of project: apify_storage/key_value_stores/default/INPUT.json
In INPUT.json
, write your settings, for example:
1{ 2 "startUrl": "https://webdesignernews.com/page/3", 3 "wayToScrape": "old", 4 "maxRequestsPerCrawl": 3, 5 "maxRequestRetries": 3, 6 "maxConcurrency": 3, 7 "liveView": true, 8 "proxyConfiguration": { 9 "useApifyProxy": false 10 } 11}
Start scrapping with command:
npm run local-start
Or, if you need a clean start:
npm run local-start-fresh
This will remove the results of all previous launches!
The result of the launch will be in apify_storage/datasets/default
It will be something like this:
1{ 2 "title": "AMP Pages Fundamentally Changing How the Web Works", 3 "vote": 47, 4 "thumb": "https://webdesignernews.s3.amazonaws.com/post_images/2657750/vlcsnap-2019-04-17-00h58m00s057-728x407-1d95-140x112.jpg", 5 "link": "https://www.webdesignernews.com/redirect/id/2657750", 6 "source": "androidpolice.com", 7 "date": "2 days ago", 8 "requestUrl": "https://webdesignernews.com/page/3" 9}