Wayback Machine Scraper
Pricing
from $1.00 / 1,000 snapshot scrapeds
Wayback Machine Scraper
List Internet Archive Wayback Machine snapshots for one or more URLs. Returns timestamp, snapshot URL, HTTP status, MIME type, digest. Useful for tracking website changes over time, OSINT research, content recovery, and brand monitoring.
Pricing
from $1.00 / 1,000 snapshot scrapeds
Rating
0.0
(0)
Developer
Gio
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
List Internet Archive Wayback Machine snapshots for one or more URLs. Uses the CDX server API.
Returns timestamp, snapshot URL, HTTP status, MIME type, digest, byte length.
Useful for tracking website changes over time, OSINT research, content recovery, brand monitoring, link rot studies.
Free vs. paid
- Free plan: mock records for each URL.
- Paid plan: real, live Wayback Machine data.
Input
| Field | Type | Description |
|---|---|---|
urls | Array (required) | List of URLs to look up. |
from | String | Start date filter (YYYY, YYYYMMDD, or YYYYMMDDhhmmss). |
to | String | End date filter. |
maxSnapshotsPerUrl | Integer | Default 50, max 1000. |
debug | Boolean | Verbose logs. |
Output
{"url": "apify.com","timestamp": "20210105141317","snapshotUrl": "https://web.archive.org/web/20210105141317/apify.com","originalUrl": "https://apify.com/","statusCode": "200","mimeType": "text/html","digest": "QPBSADYPYQEHJ4NTAXNCLN7QHFFROZHU","length": 158034}
Pricing
$0.001/snapshot. 1,000 snapshots = $1.
Limitations
- Wayback Machine's CDX server has soft rate limits (~1 req/sec). The actor adds 400ms between URL queries.
- For very popular URLs, the number of snapshots can be massive (millions). Use
from/toto scope.
If this actor helped you, please leave a review on the Apify Store.