Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.
Filter articles containing specific keywords. Leave blank to get all articles. Supports boolean operators: AND, OR, NOT with parentheses (e.g., 'climate AND (change OR crisis) NOT politics').
Search in Article Titles
searchInTitlebooleanOptional
Apply keyword search to article titles.
Default value of this property is true
Search in Article Content
searchInContentbooleanOptional
Apply keyword search to article text content.
Default value of this property is true
Case Sensitive Search
caseSensitivebooleanOptional
Make keyword search case sensitive.
Default value of this property is false
Minimum Word Count
minWordCountintegerOptional
Skip articles that have fewer words than this number. Use 0 for no minimum.
Default value of this property is 0
Extract Summary & Keywords
extractSummarybooleanOptional
Generate article summaries and keywords using Natural Language Processing (NLP). Note: This increases processing time.
Default value of this property is true
Include Images
includeImagesbooleanOptional
Extract and include image URLs from articles.
Default value of this property is true
Request Timeout (seconds)
requestTimeoutintegerOptional
Timeout for each article request in seconds.
Default value of this property is 7
Concurrent Requests
concurrencyintegerOptional
Number of articles to process simultaneously. Higher values = faster crawling but may trigger rate limits.
Default value of this property is 5
Proxy Configuration
proxyConfigurationobjectOptional
Configure proxy settings for reliable scraping. Apify Proxy is recommended for best results.
Default value of this property is {"useApifyProxy":false}