Wikipedia Scraper - Articles, Summaries & Search
Pricing
Pay per usage
Go to Apify Store
Under maintenance
Wikipedia Scraper - Articles, Summaries & Search
Scrape Wikipedia articles, summaries, and search results. Supports summary, search, and random article modes via the Wikipedia REST API. Pure HTTP, no browser needed.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
oscar lira
Maintained by Community
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Apify actor that scrapes Wikipedia articles, summaries, and search results using the Wikipedia REST API. Pure HTTP requests, no browser needed.
Features
- Summary mode - Fetch article summaries by title
- Search mode - Find articles matching a search query
- Random mode - Get random Wikipedia articles
- Multi-language support (any Wikipedia language edition)
- Extracts: title, summary, description, images, coordinates, metadata
Input
| Field | Type | Description | Default |
|---|---|---|---|
mode | enum | summary, search, or random | summary |
titles | string[] | Article titles (summary mode) | [] |
searchQuery | string | Search query (search mode) | "" |
language | string | Wikipedia language code | "en" |
maxResults | integer | Max results (1-200) | 50 |
Output
Each result contains:
title- Article titleextract- Plain text summarydescription- Short descriptionthumbnail- Thumbnail image URLoriginalImage- Full-size image URLpageUrl- Link to Wikipedia articlepageId- Wikipedia page IDlastModified- Last modification date (ISO 8601)coordinates- Geographic coordinates (if available)contentLength- Length of the extract in characters
Examples
Fetch article summaries
{"mode": "summary","titles": ["Python_(programming_language)", "JavaScript"],"language": "en"}
Search Wikipedia
{"mode": "search","searchQuery": "artificial intelligence","language": "en","maxResults": 10}
Get random articles
{"mode": "random","language": "en","maxResults": 5}
Technical Details
- Runtime: Node.js 20 (Alpine)
- No browser required - uses native
fetch - Wikipedia REST API v1 + MediaWiki Action API
- Respects rate limits with built-in delays