Wayback Machine Scraper avatar

Wayback Machine Scraper

Pricing

from $1.00 / 1,000 snapshot scrapeds

Go to Apify Store
Wayback Machine Scraper

Wayback Machine Scraper

List Internet Archive Wayback Machine snapshots for one or more URLs. Returns timestamp, snapshot URL, HTTP status, MIME type, digest. Useful for tracking website changes over time, OSINT research, content recovery, and brand monitoring.

Pricing

from $1.00 / 1,000 snapshot scrapeds

Rating

0.0

(0)

Developer

Gio

Gio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

List Internet Archive Wayback Machine snapshots for one or more URLs. Uses the CDX server API.

Returns timestamp, snapshot URL, HTTP status, MIME type, digest, byte length.

Useful for tracking website changes over time, OSINT research, content recovery, brand monitoring, link rot studies.

Free vs. paid

  • Free plan: mock records for each URL.
  • Paid plan: real, live Wayback Machine data.

Input

FieldTypeDescription
urlsArray (required)List of URLs to look up.
fromStringStart date filter (YYYY, YYYYMMDD, or YYYYMMDDhhmmss).
toStringEnd date filter.
maxSnapshotsPerUrlIntegerDefault 50, max 1000.
debugBooleanVerbose logs.

Output

{
"url": "apify.com",
"timestamp": "20210105141317",
"snapshotUrl": "https://web.archive.org/web/20210105141317/apify.com",
"originalUrl": "https://apify.com/",
"statusCode": "200",
"mimeType": "text/html",
"digest": "QPBSADYPYQEHJ4NTAXNCLN7QHFFROZHU",
"length": 158034
}

Pricing

$0.001/snapshot. 1,000 snapshots = $1.

Limitations

  • Wayback Machine's CDX server has soft rate limits (~1 req/sec). The actor adds 400ms between URL queries.
  • For very popular URLs, the number of snapshots can be massive (millions). Use from/to to scope.

If this actor helped you, please leave a review on the Apify Store.