Websites Archiver (Wayback Machine) avatar
Websites Archiver (Wayback Machine)

Pricing

$9.00/month + usage

Go to Apify Store
Websites Archiver (Wayback Machine)

Websites Archiver (Wayback Machine)

Effortlessly archive any website with our Automated Website Archiving Tool. It leverages the power of the Wayback Machine at web.archive.org to ensure your sites are preserved for future reference.

Pricing

$9.00/month + usage

Rating

5.0

(2)

Developer

Web Harvester

Web Harvester

Maintained by Community

Actor stats

3

Bookmarked

66

Total users

5

Monthly active users

19 days ago

Last modified

Share

Website Archiver (Wayback Machine)

Effortlessly archive any website with our Automated Website Archiving Tool. It leverages the power of the Wayback Machine at web.archive.org to ensure your sites are preserved for future reference.

Usage

The actor accepts an input in the following format:

{
"startUrls": [
{
"url": "https://crawlee.dev"
}
],
"fastArchiveMode": true,
"archiveErrorPages": true,
"storeArchivedResources": false
}

Input Options

OptionTypeDefaultDescription
startUrlsarrayrequiredList of URLs to archive
fastArchiveModebooleantrueWhen enabled, sends archive request without waiting for full completion. Faster but provides less detailed output.
archiveErrorPagesbooleantrueWhether to archive pages that return HTTP 4xx and 5xx status codes
storeArchivedResourcesbooleanfalseWhether to include the list of archived resources in the output (only available in full mode)

Output

Full Archive Mode (fastArchiveMode: false)

{
"url": "https://crawlee.dev",
"archivedUrl": "https://web.archive.org/web/20240610223756/https://crawlee.dev/",
"archived": true,
"archivedAt": "2024-06-10T22:38:15.643Z",
"archivedResourcesCount": 69,
"archivedResources": [
"https://crawlee.dev/",
"https://crawlee.dev/js/custom.js",
"https://crawlee.dev/assets/css/styles.5a93fba9.css"
]
}

Fast Archive Mode (fastArchiveMode: true)

{
"url": "https://crawlee.dev",
"archivedUrl": "https://web.archive.org/web/20240610223756/https://crawlee.dev/",
"archived": true,
"archivedAt": "2024-06-10T22:38:15.643Z"
}

Failed Archive

{
"url": "https://example.com/blocked",
"archivedUrl": null,
"note": "This URL has been excluded from the Wayback Machine",
"archived": false
}

Running the Actor

To run the actor, you'll need to have an Apify account. Once you're logged in, you can run the actor from the Apify Console. You can also use the Apify API to run the actor programmatically.

For more information on how to use Apify Actors, please refer to the Apify documentation.