Pricing

$15.00/month + usage

Try for free

Go to Apify Store

Articles Extractor

Try for free

The Article Extractor is an enterprise-grade web scraping solution designed specifically for extracting structured data from news articles, blog posts, and online publications. Our advanced HTML parsing engine delivers unmatched accuracy in content extraction across thousands of websites.

Pricing

$15.00/month + usage

Rating

5.0

(2)

Developer

Web Harvester

Actor stats

Bookmarked

759

Total users

Monthly active users

a year ago

Last modified

Categories

News

SEO tools

You can access the Articles Extractor programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = {
9    "startUrls": [{ "url": "https://www.cnbc.com/2022/09/21/what-another-major-rate-hike-by-the-federal-reserve-means-to-you.html" }],
10    "headerGeneratorOptions": {},
11    "proxyConfiguration": {
12        "useApifyProxy": True,
13        "apifyProxyGroups": ["RESIDENTIAL"],
14    },
15}
16
17# Run the Actor and wait for it to finish
18run = client.actor("web.harvester/articles-extractor").call(run_input=run_input)
19
20# Fetch and print Actor results from the run's dataset (if there are any)
21print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
22for item in client.dataset(run["defaultDatasetId"]).iterate_items():
23    print(item)
24
25# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

Articles Extractor API in Python

The Apify API client for Python is the official library that allows you to use Articles Extractor API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

$pip install apify-client

Other API clients include:

Articles Extractor API in JavaScript

Articles Extractor API through CLI

Articles Extractor OpenAPI definition

Articles Extractor API

Smart Article Extractor

lukaskrivka/article-extractor-smart

📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.

Lukáš Křivka

7.6K

4.1

Web Article Content Extractor

vulnv/web-article-content-extractor

Extract clean, readable content from news articles, blog posts, and web pages. Batch process multiple URLs, download images, bypass bot protection with proxy support. Perfect for content curation, research, and data analysis.

VulnV

News Search Articles Scraper

data_direct/news-articles-scraper

Search news articles by keyword

Data Direct

Macys News Articles

pintostudio/macys-news-articles

The Macy's News Articles Actor is a powerful Apify web scraping tool designed to extract press releases and news articles from Macy's official newsroom.

Pinto Studio

Smart Article Extractor

parseforge/article-extractor

Extract clean article content from any news, blog, or publisher site! Pull full body text, author, publish date, word count, language, reading time, images, and metadata at scale. Ideal for content research, media monitoring, SEO audits, and AI training. Start extracting articles in minutes!

ParseForge

News Website Crawler & Article Extractor

xtech/news-source-crawler

Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.

Xtech

409

4.8

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

scrapingxpert

354

5.0

Archive Blog Detail Extractor

getdataforme/archive-blog-detail-extractor

The Archive Blog Detail Extractor is an Apify tool designed for scraping detailed information from Internet Archive blog posts. It captures titles, descriptions, comments, and metadata in structured JSON format, supporting customizable URLs and item limits....

GetDataForMe

Medium Publication Scraper Pro

red.cars/medium-publication-scraper

Enterprise-grade Medium data extraction with comprehensive filtering and multi-mode support. Extract publications, authors, articles and trending content without API keys.

AutomateLab

Solana Token Safety Checker — Rug Pull & Honeypot Detection

sanjeep/solana-token-safety-checker

7-point security audit for any Solana token. Detects honeypots, rug pulls, concentrated holders, serial scammers, and fake liquidity. Essential for meme coin traders and DeFi investors.