# Wayback Machine Domain Snapshot Extractor

**Use case:** 

Export archived snapshots for a domain from the Wayback Machine CDX API with timestamps, status codes, MIME types, digests, and replay URLs.

## Input

```json
{
  "url": "example.com",
  "matchType": "domain",
  "maxSnapshots": 1000,
  "fromDate": "20200101",
  "toDate": "20251231",
  "filterStatusCodes": [],
  "excludeStatusCodes": [],
  "filterMimeTypes": [],
  "pageSize": 10000,
  "collapse": "urlkey",
  "outputWaybackUrl": true
}
```

## Output

```json
{
  "originalUrl": {
    "label": "Original URL",
    "format": "link"
  },
  "timestamp": {
    "label": "Timestamp"
  },
  "statusCode": {
    "label": "Status Code"
  },
  "mimeType": {
    "label": "MIME Type"
  },
  "digest": {
    "label": "Content Digest"
  },
  "length": {
    "label": "Size (bytes)",
    "format": "number"
  },
  "urlKey": {
    "label": "URL Key"
  },
  "waybackUrl": {
    "label": "Wayback URL",
    "format": "link"
  }
}
```

## About this Actor

This example demonstrates how to use [Wayback Machine CDX Bulk Extractor](https://apify.com/automation-lab/wayback-machine-cdx-extractor) with a specific input configuration. Visit the [Actor detail page](https://apify.com/automation-lab/wayback-machine-cdx-extractor) to learn more, explore other use cases, and run it yourself.