Wikipedia Scraper - Articles, Summaries & Search avatar

Wikipedia Scraper - Articles, Summaries & Search

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Wikipedia Scraper - Articles, Summaries & Search

Wikipedia Scraper - Articles, Summaries & Search

Scrape Wikipedia articles, summaries, and search results. Supports summary, search, and random article modes via the Wikipedia REST API. Pure HTTP, no browser needed.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

deusex machine

deusex machine

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

17 days ago

Last modified

Categories

Share

Wikipedia Scraper

Fetches article summaries from Wikipedia. Search for articles, look up specific titles, or grab random pages. Supports 300+ language editions.

What data does it extract?

FieldDescription
titleArticle title
extractPlain text summary (first few paragraphs)
descriptionShort Wikidata description (e.g. American tech company)
thumbnailURL to the article's thumbnail image
originalImageURL to the full-resolution main image
pageUrlDirect link to the Wikipedia article
pageIdNumeric Wikipedia page ID
lastModifiedISO 8601 timestamp of the last edit
coordinatesGeographic coordinates {lat, lon} if the article has them
contentLengthCharacter count of the extract text

Use cases

  • Knowledge bases — Pull article summaries for product databases, chatbots, or search indexes
  • NLP training data — Collect thousands of article summaries in any language for text models
  • Content enrichment — Add Wikipedia descriptions and images to existing datasets (companies, cities, people)
  • Education tools — Build quiz apps or study guides from random or topic-specific articles
  • Geographic datasets — Extract coordinates from place-related articles for mapping projects

How to use

Search for articles on a topic:

{
"mode": "search",
"searchQuery": "machine learning",
"maxResults": 20
}

Get summaries for specific articles:

{
"mode": "summary",
"titles": ["SpaceX", "Tesla, Inc.", "Elon Musk"]
}

Fetch random articles in Spanish:

{
"mode": "random",
"language": "es",
"maxResults": 50
}

Input parameters

ParameterTypeDefaultDescription
modestring"search"Operation mode: "summary", "search", or "random"
titlesarray[]Article titles to fetch (used in summary mode)
searchQuerystring""Search query text (used in search mode)
languagestring"en"Wikipedia language code: en, es, de, fr, ja, pt, etc.
maxResultsinteger50Max results for search and random modes (1-200)

Output example

{
"title": "SpaceX",
"extract": "Space Exploration Technologies Corp., commonly referred to as SpaceX, is an American spacecraft manufacturer, launch service provider, defense contractor, and satellite communications company headquartered in Hawthorne, California.",
"description": "American space transportation company",
"thumbnail": "https://upload.wikimedia.org/wikipedia/commons/thumb/2/2e/SpaceX_logo_black.svg/320px-SpaceX_logo_black.svg.png",
"originalImage": "https://upload.wikimedia.org/wikipedia/commons/2/2e/SpaceX_logo_black.svg",
"pageUrl": "https://en.wikipedia.org/wiki/SpaceX",
"pageId": 2677395,
"lastModified": "2025-03-25T08:12:45Z",
"coordinates": { "lat": 33.9206, "lon": -118.3283 },
"contentLength": 412
}

Performance & cost

  • Uses the Wikipedia REST and Action APIs directly — no browser, minimal compute
  • Fetches up to 50 search results per API call, then gets each summary individually
  • A search for 50 articles takes about 10-15 seconds (rate-limited to be polite to Wikipedia)

FAQ

Does this get the full article text? No, it returns the summary (first few paragraphs). Full article content would require a different API endpoint and is much heavier on data.

What languages are supported? Any Wikipedia language edition. Use the standard language code: en for English, es for Spanish, de for German, ja for Japanese, etc. Full list at wikipedia.org.

What's the difference between summary and search mode? summary takes exact article titles and returns their summaries. search takes a free-text query and finds matching articles.

Why do some articles have no image? Not every Wikipedia article has an associated image. The thumbnail and originalImage fields will be null for those.