Wikipedia Scraper - Articles, Summaries & Search avatar

Wikipedia Scraper - Articles, Summaries & Search

Under maintenance

Pricing

Pay per usage

Go to Apify Store
Wikipedia Scraper - Articles, Summaries & Search

Wikipedia Scraper - Articles, Summaries & Search

Under maintenance

Scrape Wikipedia articles, summaries, and search results. Supports summary, search, and random article modes via the Wikipedia REST API. Pure HTTP, no browser needed.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

oscar lira

oscar lira

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Apify actor that scrapes Wikipedia articles, summaries, and search results using the Wikipedia REST API. Pure HTTP requests, no browser needed.

Features

  • Summary mode - Fetch article summaries by title
  • Search mode - Find articles matching a search query
  • Random mode - Get random Wikipedia articles
  • Multi-language support (any Wikipedia language edition)
  • Extracts: title, summary, description, images, coordinates, metadata

Input

FieldTypeDescriptionDefault
modeenumsummary, search, or randomsummary
titlesstring[]Article titles (summary mode)[]
searchQuerystringSearch query (search mode)""
languagestringWikipedia language code"en"
maxResultsintegerMax results (1-200)50

Output

Each result contains:

  • title - Article title
  • extract - Plain text summary
  • description - Short description
  • thumbnail - Thumbnail image URL
  • originalImage - Full-size image URL
  • pageUrl - Link to Wikipedia article
  • pageId - Wikipedia page ID
  • lastModified - Last modification date (ISO 8601)
  • coordinates - Geographic coordinates (if available)
  • contentLength - Length of the extract in characters

Examples

Fetch article summaries

{
"mode": "summary",
"titles": ["Python_(programming_language)", "JavaScript"],
"language": "en"
}

Search Wikipedia

{
"mode": "search",
"searchQuery": "artificial intelligence",
"language": "en",
"maxResults": 10
}

Get random articles

{
"mode": "random",
"language": "en",
"maxResults": 5
}

Technical Details

  • Runtime: Node.js 20 (Alpine)
  • No browser required - uses native fetch
  • Wikipedia REST API v1 + MediaWiki Action API
  • Respects rate limits with built-in delays