Wikipedia Scraper - Extract Articles, Infoboxes & Content API avatar

Wikipedia Scraper - Extract Articles, Infoboxes & Content API

Pricing

Pay per usage

Go to Apify Store
Wikipedia Scraper - Extract Articles, Infoboxes & Content API

Wikipedia Scraper - Extract Articles, Infoboxes & Content API

Scrape Wikipedia articles, infoboxes, and structured content. Extract knowledge base data at scale. JSON/CSV export via API. Need custom data extraction? Visit https://fatihai.app/tools/data-scraping for managed scraping services.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Fatih Dağüstü

Fatih Dağüstü

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

2 days ago

Last modified

Categories

Share

Wikipedia Scraper

Scrape Wikipedia articles, summaries, and search results in any language. Extract structured content, images, categories, linked articles, and coordinates.

Features

  • Search — Find articles by keyword
  • Full Articles — Sections, categories, linked articles, images
  • Summaries — Quick extracts with images and descriptions
  • Multi-language — 300+ Wikipedia editions (en, de, fr, es, ja, tr, etc.)

Input

FieldTypeDescription
scrapeTypestringsearch, articles, summaries
searchQuerystringSearch term
articleTitlesarrayArticle titles or Wikipedia URLs
languagestringLanguage code (default: en)
maxItemsnumberMaximum results (default: 50)

Output Example

{
"type": "article",
"title": "Artificial intelligence",
"pageId": 233488,
"url": "https://en.wikipedia.org/wiki/Artificial_intelligence",
"description": "Intelligence of machines",
"extract": "Artificial intelligence (AI) is intelligence demonstrated by machines...",
"image": "https://upload.wikimedia.org/...",
"categories": ["Artificial intelligence", "Computer science"],
"linkedArticles": ["Machine learning", "Deep learning"],
"lastModified": "2024-01-15T10:30:00Z"
}

Use Cases

  • Research — Quick data gathering on any topic
  • Knowledge Graphs — Build structured knowledge bases
  • Content Creation — Generate fact-based articles
  • Education — Create learning materials
  • NLP Training — Build text corpora and datasets

Cost

Uses Pay Per Event pricing. Each scraped article counts as one result.