CBS Local Scraper

  • lukass/cbs-local-scraper
  • Modified
  • Users 2
  • Runs 492
  • Created by Author's avatarLukáš Širhal

Scrape news data from cbslocal.com with this unofficial API. Extract articles, monitor their popularity and performance and automate the fight against fake news. Filter the results by authors, topics, categories, or publication dates. Preview or download the results in your preferred format.

Free trial for 7 days

Then $20.00/month

No credit card required now

CBS Local Scraper

Free trial for 7 days

Then $20.00/month

Start URLs

startUrls

Required

array

Can be main page URL or any category URLs. Article pages are found and enqueued from these.

Maximum number of articles

maxArticlesPerCrawl

Optional

integer

Maximum number of valid articles scraped. The crawler will stop automatically after reaching this number.

Date from

dateFrom

Optional

string

Only articles from this day to present will be scraped. If empty, all articles will be scraped. Format is YYYY-MM-DD, e.g. 2019-12-31, or Number type e.g. 1 week or 20 days

Only new articles

onlyNewArticlesPerDomain

Optional

boolean

If true, will scrape only new articles each time you run it. All URLs you scraped are compared with saved ones. Scraped articles are saved in one dataset per each domain, datasets are named 'ARTICLES-SCRAPED-domain'