
Wikipedia Revision History Scraper
Pricing
$20.00/month + usage
Go to Store

Wikipedia Revision History Scraper
Scrape the revision history of any Wikipedia page, including metadata and diffs for each revision.
0.0 (0)
Pricing
$20.00/month + usage
0
Total users
2
Monthly users
2
Runs succeeded
>99%
Last modified
21 days ago
Scrapes the revision history of any Wikipedia page, including diffs and metadata for each revision.
Input
Field | Type | Required | Description |
---|---|---|---|
wikipediaPage | string | Yes | Wikipedia page URL (e.g. "https://en.wikipedia.org/wiki/LangChain") |
limit | integer | No | Maximum umber of revisions to fetch (default: 50 ) |
includeDiff | boolean | No | Whether to include diff between revisions in output (default: true ) |
Example input
{"wikipediaPage": "https://en.wikipedia.org/wiki/LangChain","limit": 10,"includeDiff": true}
Output
The actor outputs a JSON array where each object represents a revision. Each object contains:
revid
: revision IDparentid
: parent revision IDminor
: whether the revision was a minor edituser
: username of the user who made the revisiontimestamp
: timestamp of the revisioncomment
: edit summarytags
: array of tags associated with the revision (e.g. ["visualeditor"])size
: size of the page after the revision, in bytessize_diff
: difference between the size of the page after and before the revision, in bytesdiff_raw
: raw HTML diff between the revision and its parent (ifincludeDiff
is true)diff_parsed
: array of line changes between the revision and its parent (ifincludeDiff
is true)
Example output
[{"revid": 123456789,"parentid": 123456788,"minor": false,"user": "ExampleUser","timestamp": "2024-01-01T12:34:56Z","comment": "Fixed typo","tags": ["visualeditor"],"size": 12345,"size_diff": 10,"diff_raw": "<tr>...</tr>","diff_parsed": [{ "before": "old line", "after": "new line" }]}]