Pricing

Pay per usage

Go to Apify Store

News Article Scraper

Try for free

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Actor stats

Bookmarked

Total users

Monthly active users

13 hours ago

Last modified

What does News Article Scraper do?

News Article Scraper extracts full article content from news websites like TechCrunch, CNN, BBC, and other online publications. It collects headlines, full body text, author names, publication dates, images, and source URLs from any news site you point it at. This actor is perfect for media monitoring, content aggregation, and building news datasets for research.

Why use News Article Scraper?

Full article extraction -- Goes beyond headlines to capture the complete body text of each article.
Multi-site support -- Works across a wide range of news sites and blogs thanks to intelligent content detection.
Configurable crawl depth -- Follow links from a homepage to discover and scrape articles across multiple pages.
Proxy-powered reliability -- Uses Apify Proxy to handle rate limiting and access regional content.
API integration -- Trigger scraping runs and retrieve results programmatically via the Apify API.

How to use News Article Scraper

Visit the Apify Store and find News Article Scraper.
Click Try for free to open the actor in the Apify Console.
Enter one or more news site URLs or direct article links in the News Site URLs field.
Set the Max Articles limit to control the total number of articles collected.
Adjust the Crawl Depth if you want the actor to follow links deeper into the site (default is 1 level).
Click Start and download the results from the Dataset tab when the run completes.

Input configuration

Field	Type	Description	Default
`urls`	Array of strings	URLs of news sites or articles to scrape	`["https://techcrunch.com"]`
`maxArticles`	Integer	Maximum number of articles to scrape	`50`
`crawlDepth`	Integer	How many levels deep to follow links	`1`

Output data

Each article is stored as a separate record in the dataset. Below is an example output:

{
  "headline": "OpenAI Announces New Partnership with Major Cloud Provider",
  "bodyText": "OpenAI revealed today that it has entered into a strategic partnership with a leading cloud infrastructure provider. The deal, reportedly valued at over $2 billion, will expand access to...",
  "author": "Jane Doe",
  "publishDate": "2025-11-20T14:30:00Z",
  "imageUrl": "https://techcrunch.com/wp-content/uploads/2025/11/openai-partnership.jpg",
  "sourceUrl": "https://techcrunch.com/2025/11/20/openai-cloud-partnership/"
}

Cost of usage

News Article Scraper uses Pay-Per-Event (PPE) pricing at the Mid tier:

Tier	Cost per 1,000 events	Free tier (approx.)
Mid	$0.75	~6,600 events/month

One event corresponds to one article scraped. Collecting 50 articles from a single news site would cost approximately $0.038. The free tier allocation covers roughly 6,600 articles per month at no charge.

Tips and advanced usage

Media monitoring -- Set up scheduled runs to scrape your target publications daily and track coverage on specific topics.
Build training datasets -- Collect thousands of articles for NLP and machine learning projects such as text classification or summarization.
Track multiple sources -- Pass several news site URLs in a single run to aggregate content across publications.
Increase crawl depth for archives -- Set crawlDepth to 2 or 3 to discover articles linked from category or archive pages.
Combine with keyword filtering -- Export the dataset and filter by headline or body text to isolate articles on specific topics.

Built with Crawlee and Apify SDK. See more scrapers by consummate_mandala on Apify Store.

News Articles Scraper

proscraper/news-articles-scraper

Scrape data for news articles. Takes in list of URL's in start_urls and returns the data. Can be used to feed LLM models or training.

Owais Nazir

Tech News Article Scraper

inquisitive_sarangi/news-article-scraper

Tech News Article Scraper is a simple yet powerful tool to extract news articles from a variety of popular news websites. Supported The Verge, CNET, Wired, TechCrunch, Ars Technica, Tech Radar, Engadget

API Master

News Article Scraper for Feeding LLM

proscraper/newsarticlescraper

Scrape news articles metadata to feed into LLM models. Returns article body, published date, article title, author etc.

Owais Nazir

141

Google News Article Scraper

webscrap18/google-news-article-scraper

Scrape Google News, Extract full content with Title, Article Text, Images and Structured data.

WebScrap

article-scrapper

credible_sandal/article-scrapper

A flexible and powerful Apify Actor for scraping articles from tech news websites. This scraper can work with any tech news site - either from predefined presets or custom URLs

RK K

News Website Crawler & Article Extractor

xtech/news-source-crawler

Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.

Xtech

333

4.9

(3)

Article Extractor & News Scraper

web.harvester/article-extractor-news-scraper

Extract articles from any news site, blog, or webpage. Get title, full text, author, date, images & metadata using 7 extraction engines (Newspaper4k, Trafilatura, Goose3). Anti-bot bypass, proxy rotation, automatic fallback. Perfect for news monitoring, NLP datasets & content aggregation.

Web Harvester

5.0

(2)

Smart Article Scraper - Text, Data & Insights

xtech/article-extractor

𝗔𝗿𝘁𝗶𝗰𝗹𝗲 𝗦𝗰𝗿𝗮𝗽𝗲𝗿 & 𝗖𝗼𝗻𝘁𝗲𝗻𝘁 𝗘𝘅𝘁𝗿𝗮𝗰𝘁𝗼𝗿 - Extract clean text, metadata, keywords & summaries from any web article or blog post. Perfect for 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵, 𝗰𝗼𝗺𝗽𝗲𝘁𝗶𝘁𝗶𝘃𝗲 𝗮𝗻𝗮𝗹𝘆𝘀𝗶𝘀 & 𝗰𝗼𝗻𝘁𝗲𝗻𝘁 𝗺𝗮𝗿𝗸𝗲𝘁𝗶𝗻𝗴.

Xtech

1.0

(1)

AI News Summarizer: Multi-Source Article Scraper

lanky_quantifier/ai-news-summarizer

Scrape and summarize news from TechCrunch, Reuters, and Google News. Extract sentiment, categories, and key topics from articles. Supports multiple output formats.

Vhub Systems

Bloomberg Category News Scraper

piotrv1001/bloomberg-category-news-scraper

The Bloomberg Category News Scraper extracts news articles from Bloomberg by category, capturing headlines, authors, publish dates, images, and article links. Ideal for news aggregation, market analysis, and trend monitoring.