Pricing

$1.00 / 1,000 article extracteds

Wikipedia Article Scraper - Search & Extract Content

Search and extract Wikipedia article metadata, summaries, and content via the official MediaWiki API. No scraping overhead — pure API integration with high reliability.

Pricing

$1.00 / 1,000 article extracteds

Rating

0.0

(0)

Developer

Pierrick McD0nald

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Wikipedia Article Scraper — Search & Extract Content

Extract Wikipedia article metadata, summaries, and content via the official MediaWiki API. This Actor searches Wikipedia by keyword and returns structured data for every matching article — no browser overhead, no scraping complexity, just clean API integration.

Use Cases

Content Research — Gather article summaries and metadata for academic research, content marketing, or knowledge base building.
SEO & Topic Analysis — Extract word counts, article sizes, and publication dates to analyze content depth and freshness across topics.
Data Enrichment — Augment datasets with Wikipedia summaries, thumbnail images, and canonical URLs for entity linking and NLP pipelines.
Multilingual Content — Search across 300+ Wikipedia language editions to build localized content collections.

Input

Field	Type	Required	Description
`searchQuery`	String	Yes	Search term to find Wikipedia articles (e.g., "machine learning", "quantum computing")
`maxResults`	Number	No	Maximum articles to extract, 1–500 (default: 25)
`includeExtract`	Boolean	No	Fetch article introduction/summary text (default: true)
`includeImages`	Boolean	No	Fetch thumbnail image URLs (default: false)
`language`	String	No	Wikipedia language code: en, es, fr, de, ja, etc. (default: "en")
`proxyConfiguration`	Object	No	Proxy settings (optional — Wikipedia API does not require proxy)

Output

The Actor outputs a dataset with the following fields:

{
  "pageId": 233488,
  "title": "Machine learning",
  "url": "https://en.wikipedia.org/wiki/Machine_learning",
  "snippet": "Machine learning (ML) is a field of study in artificial intelligence...",
  "extract": "Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of statistical algorithms...",
  "wordCount": 15287,
  "size": 141291,
  "thumbnail": "https://upload.wikimedia.org/wikipedia/commons/thumb/...",
  "timestamp": "2026-05-15T10:30:00Z",
  "language": "en"
}

Pricing

Pay per event: $0.001 per article extracted.

No minimums, no subscriptions. You only pay for the results you receive. The Wikipedia MediaWiki API is free and public, so compute costs are minimal and margins stay high.

Limitations

Maximum 500 results per run (Wikipedia API limit)
Article extracts are limited to the introduction/summary section
Thumbnail images are only available when includeImages is enabled and the article has an image
Rate limits apply per Wikipedia language edition (handled automatically with retries)

FAQ

Q: Do I need a Wikipedia API key? A: No. This Actor uses the public MediaWiki API with no authentication required.

Q: Can I search in languages other than English? A: Yes. Set the language field to any valid Wikipedia language code (e.g., "es" for Spanish, "ja" for Japanese).

Q: What happens if my search returns thousands of results? A: The Actor respects the maxResults limit and paginates through the API automatically. You only pay for the number of articles actually extracted.

Changelog

v1.0.0 — Initial release

Wikipedia Article Scraper

kayhermes/wikipedia-scraper

Khoa Nguyen

Wikipedia Scraper

automation-lab/wikipedia-scraper

Search and extract Wikipedia articles — titles, summaries, full content, categories, and images. Uses the free MediaWiki API.

Stas Persiianenko

Wikipedia Article Scraper

crawlerbros/wikipedia-scraper

Extract structured data from Wikipedia articles. Get summaries, categories, images, metadata, and descriptions using Wikipedia's official API. Supports 300+ languages.

Crawler Bros

Wikipedia Scraper

leftwinglautus/wikipedia-scraper

Scrape Wikipedia articles via the official Wikipedia API. Search articles, get summaries, full content, and categories.

Moeeze Hassan

Wikipedia Article Extractor

rambunctious_fingerprint/wikipedia-extractor

Casey Marsh

Wikipedia Scraper — Search Articles & Extract Content

puskin/wikipedia-scraper

Search Wikipedia articles, get summaries, and extract full page content via the free MediaWiki API. No authentication required — perfect for research, AI training data, and knowledge base building.

Giovanni Bucci

Wikipedia Article Extractor

glassventures/wikipedia-article-extractor

Extract Wikipedia articles via MediaWiki API. Get full text, summaries, sections, categories, images, links. Multi-language. Perfect for AI/ML training data and RAG.

Glass Ventures

Wikipedia Scraper - Article Content Extractor

lulzasaur/wikipedia-scraper

Scrape Wikipedia articles. Search by topic and extract full structured content: summaries, sections, infobox data, categories, references, images, and edit history for any article.