Wikipedia Revision History Scraper
Pricing
$20.00/month + usage
Go to Apify Store
Under maintenance
Wikipedia Revision History Scraper
Scrape the revision history of any Wikipedia page, including metadata and diffs for each revision.
Pricing
$20.00/month + usage
Rating
0.0
(0)
Developer

ViewSource
Maintained by Community
Actor stats
0
Bookmarked
4
Total users
1
Monthly active users
7 months ago
Last modified
Categories
Share
Scrapes the revision history of any Wikipedia page, including diffs and metadata for each revision.
Input
| Field | Type | Required | Description |
|---|---|---|---|
wikipediaPage | string | Yes | Wikipedia page URL (e.g. "https://en.wikipedia.org/wiki/LangChain") |
limit | integer | No | Maximum umber of revisions to fetch (default: 50) |
includeDiff | boolean | No | Whether to include diff between revisions in output (default: true) |
Example input
{"wikipediaPage": "https://en.wikipedia.org/wiki/LangChain","limit": 10,"includeDiff": true}
Output
The actor outputs a JSON array where each object represents a revision. Each object contains:
revid: revision IDparentid: parent revision IDminor: whether the revision was a minor edituser: username of the user who made the revisiontimestamp: timestamp of the revisioncomment: edit summarytags: array of tags associated with the revision (e.g. ["visualeditor"])size: size of the page after the revision, in bytessize_diff: difference between the size of the page after and before the revision, in bytesdiff_raw: raw HTML diff between the revision and its parent (ifincludeDiffis true)diff_parsed: array of line changes between the revision and its parent (ifincludeDiffis true)
Example output
[{"revid": 123456789,"parentid": 123456788,"minor": false,"user": "ExampleUser","timestamp": "2024-01-01T12:34:56Z","comment": "Fixed typo","tags": ["visualeditor"],"size": 12345,"size_diff": 10,"diff_raw": "<tr>...</tr>","diff_parsed": [{ "before": "old line", "after": "new line" }]}]

