AP News Scraper
Pricing
from $1.00 / 1,000 listing results
AP News Scraper
Scrape news articles from AP News hub and topic pages. Extract enriched article data including full body text, authors with bios, tags, sections, publish dates, and video metadata. Ideal for news monitoring, media research, and content analysis.
Pricing
from $1.00 / 1,000 listing results
Rating
0.0
(0)
Developer

FalconScrape
Actor stats
1
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
AP News Scraper
Extract news articles from AP News (Associated Press). Scrape by topic hub or provide direct article URLs. Get lean metadata listings or enriched articles with full body text, authors, tags, and more.
Features
- Topic/Hub Discovery: Scrape any AP News hub page (e.g.
/hub/technology,/hub/iran) with automatic pagination - Direct Article URLs: Provide specific article URLs to scrape directly
- Two Output Modes: Choose between lean listings (fast, metadata only) or enriched articles (full content)
- Rich Article Data: Full body text, authors with bios, tags, section, publish/modified dates, video metadata
- Fast & Lightweight: Uses CheerioCrawler (no browser needed) for maximum speed and low cost
How It Works
- Provide Hub or Article URLs - Enter one or more AP News hub URLs (e.g.
https://apnews.com/hub/technology) or direct article URLs. - Choose Output Mode - Enable "Enrich articles" for full article content, or disable it for fast metadata-only listings.
- Run the Scraper - The scraper discovers articles from hub pages (with pagination) and extracts the data.
Input
| Field | Type | Default | Description |
|---|---|---|---|
hubUrls | string[] | - | AP News hub URLs to scrape, e.g. https://apnews.com/hub/iran |
articleUrls | string[] | - | Specific article URLs to scrape directly |
enrichArticles | boolean | true | When enabled, visits each article page for full content. When disabled, outputs lean listings only. |
maxPages | integer | 5 | Max pagination depth per hub URL (~54 articles per page) |
maxArticles | integer | 100 | Maximum total articles to process |
proxyConfiguration | object | none | Optional proxy settings. Datacenter proxies are sufficient. |
At least one of hubUrls or articleUrls must be provided.
Sample Output Data
Enriched Article (enrichArticles: true)
{"type": "article","url": "https://apnews.com/article/iran-war-images-misinformation-russia-israel-9e495017dc5c4bf24a0b6152863dbfb1","title": "Visual misinformation about Iran war fueled by state actors | AP News","description": "A deluge of misrepresented or fabricated videos has spread widely online since the Iran war began...","authors": [{"name": "Melissa Goldin","url": "https://apnews.com/author/melissa-goldin","email": "mgoldin@ap.org","jobTitle": "News verification reporter","imageUrl": "https://assets.apnews.com/.../profile-melissa-goldin.PNG","bio": "Goldin debunks, analyzes and tracks misinformation for The Associated Press."}],"publishedDate": "2026-03-07T05:05:11Z","modifiedDate": "2026-03-07T05:05:11Z","section": "World News","tags": ["Artificial intelligence", "Iran war", "Misinformation"],"imageUrl": "https://dims.apnews.com/dims4/default/39d564b/.../resize/980x653!/quality/90/?url=...","bodyText": "As attacks spread after the bombing of Iran by U.S. and Israeli forces...","bodyParagraphs": 16,"wordCount": 809,"hasVideo": false,"videos": []}
Lean Listing (enrichArticles: false)
{"type": "listing","url": "https://apnews.com/article/iran-war-images-misinformation-russia-israel-9e495017dc5c4bf24a0b6152863dbfb1","title": "State actors are behind much of the visual misinformation about the Iran war","timestamp": "2026-03-07T05:05:11.000Z","timestampMs": 1772859911000,"imageUrl": "https://dims.apnews.com/dims4/default/c061e4b/.../resize/667x445!/format/webp/quality/90/?url=...","readingTime": null,"source": "hub","hubUrl": "https://apnews.com/hub/technology"}
Pricing
This Actor uses a Pay Per Event pricing model. You only pay for the results you get.
| Event | Price | Per 1,000 results | What you get |
|---|---|---|---|
| Enriched article | $0.004 | $4.00 | Full body text, authors, tags, section, description, images, video metadata |
| Lean listing | $0.001 | $1.00 | Title, URL, timestamp, thumbnail |
| Actor start | $0.00005/GB | - | One-time charge per run based on memory |
Cost examples
| Scenario | Estimated cost |
|---|---|
| 1 hub, 5 pages, enriched (~270 articles) | ~$1.08 |
| 1 hub, 5 pages, lean only (~270 articles) | ~$0.27 |
| 10 direct article URLs, enriched | ~$0.04 |