article-scrapper
Pricing
$3.00 / 1,000 result_sets
article-scrapper
A flexible and powerful Apify Actor for scraping articles from tech news websites. This scraper can work with any tech news site - either from predefined presets or custom URLs
article-scrapper
Pricing
$3.00 / 1,000 result_sets
A flexible and powerful Apify Actor for scraping articles from tech news websites. This scraper can work with any tech news site - either from predefined presets or custom URLs
Use predefined news sites or provide custom URLs
Select from predefined tech news sources (only used if 'Use Preset Sites' is checked)
[ "verge", "techcrunch", "wired"]Enter custom URLs to scrape (one per line). Can be homepages or article listing pages. Used when 'Use Preset Sites' is unchecked.
Maximum number of articles to scrape from each source
Maximum number of listing pages to check per source
Extract full article text (may increase scraping time)
Filter articles by keywords. Only articles containing ANY of these keywords in title or description will be included. Leave empty to scrape all articles.
'any' — article passes if it matches at least one filter (OR logic, recommended). 'all' — article must pass every filter simultaneously (AND logic, very strict).
Only scrape articles where the title contains this text (case-insensitive). Leave empty to scrape all articles.
Only scrape articles where the summary/description contains this text (case-insensitive). Leave empty to scrape all articles.
Only include articles published on or after this date. Format: YYYY-MM-DD or ISO datetime (e.g. 2025-01-15). Leave empty for no restriction.
Only include articles published on or before this date. Format: YYYY-MM-DD or ISO datetime (e.g. 2025-03-31). Leave empty for no restriction.
Only include articles published within this time window. Examples: '24h', '7d', '30d', '2w'. Takes precedence over 'Published After' when both are set.
For preset sources, use their RSS/Atom feed instead of HTML scraping. Feeds are faster, more reliable, and return cleaner data.
Analyze each article's sentiment (positive / neutral / negative) based on its title and summary. Adds a 'sentiment' field to the output.