Pricing

Pay per event

🗂️ Google Cache Viewer — Wayback + Archive Alternative

Replaces Google's cached-page view (killed Feb 2024). Queries Wayback Machine + archive.today, returns latest snapshot URL, timestamp, and extracted text content.

Pricing

Pay per event

Rating

0.0

(0)

Developer

NexGenData

Actor stats

Bookmarked

Total users

Monthly active users

10 days ago

Last modified

What it does

For every URL you provide, the actor:

Queries Wayback Machine's public "closest snapshot" API
Queries archive.today's /newest/ endpoint (follows redirect chain)
Returns the freshest available snapshot with URL, ISO timestamp, and source
Optionally fetches the snapshot HTML and extracts title + 8K char text content
Emits a stable content hash for change detection

Example

import requests

r = requests.post(
    "https://api.apify.com/v2/acts/nexgendata~google-cache-viewer/run-sync-get-dataset-items?token=" + APIFY_TOKEN,
    json={
        "urls": [
            "https://example.com/blog/post-now-deleted",
            "https://techcrunch.com/2023/01/01/some-article"
        ],
        "fetchContent": True
    }
)

for item in r.json():
    if item["found"]:
        print(f"{item['url']}")
        print(f"  Archived: {item['latest_timestamp']} via {item['source']}")
        print(f"  Title: {item['content_title']}")
        print(f"  Preview: {item['content_text'][:200]}...")
    else:
        print(f"{item['url']} — NOT ARCHIVED")

Sample output:

https://example.com/blog/post-now-deleted
  Archived: 2023-11-04T08:22:17Z via wayback
  Title: How We Scaled to 10M Users
  Preview: When we hit 10 million monthly users last fall, we learned...

cURL

curl -X POST "https://api.apify.com/v2/acts/nexgendata~google-cache-viewer/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls":["https://example.com/"],"fetchContent":true}'

Why this replaces Google Cache

	Google Cache (dead)	This actor
Status	Shut down Feb 2024	Active
Access	`cache:URL` operator / `webcache.googleusercontent.com`	HTTPS API
Freshness	Last Google crawl	Last Wayback/archive.today snapshot (minutes to months)
Bulk mode	Manual, one URL at a time	200 URLs/run
Text extraction	❌ (raw HTML)	✅ (8K char cleaned text)
Machine-readable	❌	✅ (JSON)
Cost	Free	$0.003 per URL

Common use cases

Dead-link recovery — find the last archived version of a page that 404'd
SEO audits — see what a competitor's page used to say before they rewrote it
Journalism / OSINT — pull the text of pages that were deleted after publication
Legal / compliance — document what a contract/terms page said on a given date
Content monitoring — track if an important page changed (via content_hash)
Affiliate link repair — bulk lookup of product pages that were removed

Pricing

$0.005 per run (startup)
$0.003 per URL looked up (includes content extraction when requested)

100 URLs with content extraction = $0.305. Cheaper than Screaming Frog's archive plugin and no subscription.

FAQ

Q: Does archive.today always have the page? A: Not always. Wayback is broader; archive.today often has freshness Wayback doesn't. The actor queries both and returns the fresher of the two.

Q: What if neither has it? A: Returns found: false. Can't conjure pages that were never archived.

Q: Does this trigger a new archive capture? A: No — read-only. To create a fresh capture, use Wayback's Save Page Now endpoint separately (your request, not ours).

Q: Rate limits? A: Wayback rate-limits shared usage at about 1 request/second per IP. This actor paces accordingly — expect ~1 URL/second.

Q: How old can snapshots be? A: Wayback has archives dating to 1996. For any URL with a public history, you'll likely find something.

Try it

🗂️ Google Cache Viewer on Apify

New to Apify? Get free platform credits.

💻 Code Example — Python

from apify_client import ApifyClient

client = ApifyClient("YOUR_APIFY_TOKEN")
run = client.actor("nexgendata/google-cache-viewer").call(run_input={
    # Fill in the input shape from the actor's input_schema
})

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

🌐 Code Example — cURL

curl -X POST "https://api.apify.com/v2/acts/nexgendata~google-cache-viewer/run-sync-get-dataset-items?token=YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{ /* input schema */ }'

❓ FAQ

Q: How do I get started? Sign up at apify.com, grab your API token from Settings → Integrations, and run the actor via the Apify console, API, Python SDK, or any integration (Zapier, Make.com, n8n).

Q: What's the typical cost per run? See the pricing section below. Most runs finish under $0.10 for typical batches.

Q: Is this actor maintained? Yes. NexGenData maintains 165+ Apify actors and ships updates regularly. Bug reports via the Apify console issues tab get responses within 24 hours.

Q: Can I use the output commercially? Yes — you own the output data. Check the target site's Terms of Service for any usage restrictions on the scraped content itself.

Q: How do I handle rate limits? Apify manages concurrency and retries automatically. For very large batches (10K+ items), run multiple smaller jobs in parallel instead of one mega-job for better reliability.

💰 Pricing

Pay-per-event pricing — you only pay for what you actually extract.

Actor Start: $0.0001
result: $0.0050

🚀 Apify Affiliate Program

New to Apify? Sign up with our referral link — you get free platform credits on signup, and you help fund the maintenance of this actor fleet.

📚 More From NexGenData

Explore the full catalog, tutorials, Gumroad data packs, and newsletter at thenextgennexus.com — the brand home for everything we ship.

📖 Tutorials & how-to guides
🗂️ Full actor catalog with usage examples
📦 Gumroad data packs (one-time purchases)
📬 Newsletter — monthly drops of new actors and revenue experiments

Built and maintained by NexGenData — 165+ actors covering scraping, enrichment, MCP servers, and automation. 🏠 Home: thenextgennexus.com

Why Google Cache Viewer Beats the Wayback Machine, Archive.today, Bing Cache & Cachedview.com

Feature	NexGenData Google Cache Viewer	Internet Archive Wayback	Archive.today	Bing Cache	Cachedview.com
Cost	$0.002 per URL, pay-per-event	Free (rate-limited, slow)	Free (rate-limited)	Removed by Microsoft	Web-only (no API)
Bulk input	Thousands per run	One per request	One per request	RIP	One per page
Google cache fallback	Yes — `webcache.googleusercontent.com` while it lasted	No	No	RIP	Was the whole product
Wayback fallback	Yes — closest-to-target-date snapshot	Yes (it IS Wayback)	Partial	RIP	No
Archive.today fallback	Yes	No	Yes (it IS them)	RIP	No
Structured output	Yes — JSON with source + retrieved text + timestamp	HTML only	HTML only	RIP	HTML only
Schedule + webhook	Native	None	None	RIP	None
Monthly minimum	None	None	Donations	RIP	None
Auth	Apify token	None	None	RIP	None

Google deprecated webcache.googleusercontent.com in 2024. Bing Cache was removed years earlier. This actor stitches together every remaining cache source (Wayback Machine, Archive.today, archive.org search) into one bulk pipeline that returns the closest snapshot to a target date, plus extracted text, plus the source URL of the archived copy — so SEO teams, journalists, and OSINT researchers stop manually pasting URLs into half a dozen archive sites.

Use case	Actor
Google CSE search API replacement	google-cse-replacement
goo.gl short URL resolver via Wayback	goo-gl-resolver
Alexa Rank replacement (site traffic)	alexa-rank-replacement
Wappalyzer / BuiltWith tech-stack detector	wappalyzer-replacement
Lighthouse + Core Web Vitals auditor	page-speed-analyzer
WCAG 2.2 accessibility auditor	wcag-accessibility-auditor
Bulk DNS A / MX / NS / TXT / CAA records	dns-records-lookup
WHOIS / RDAP replacement (any TLD)	whois-replacement
Bulk IP-to-country / city / ISP / ASN	ip-geolocation-replacement
Web scraping MCP for AI agents	web-scraping-mcp-server

Browse the full NexGenData catalog of 260+ actors at https://apify.com/nexgendata?fpr=2ayu9b

Wayback Machine Scraper

gio21/wayback-machine-scraper

List Internet Archive Wayback Machine snapshots for one or more URLs. Returns timestamp, snapshot URL, HTTP status, MIME type, digest. Useful for tracking website changes over time, OSINT research, content recovery, and brand monitoring.

Gio

Wayback Machine Snapshots Scraper — Internet Archive History

seemuapps/wayback-machine-snapshots-scraper

List every Internet Archive snapshot of a URL, page, or whole domain. Timestamp, snapshot URL, status code, mime type, content length. No login.

Andrew

Wayback Machine Scraper

glassventures/wayback-machine-scraper

Scrape Wayback Machine archive snapshots for any URL or domain. Get archived URLs, timestamps, status codes, MIME types. Export to JSON, CSV, Excel.

Glass Ventures

Internet Archive & Wayback Machine Scraper

cloud9_ai/internet-archive-scraper

Search Internet Archive and check Wayback Machine snapshots. Access 800B+ archived pages, books, movies, audio. Search items, get metadata, or check URL archive history. No API key needed. For SEO, OSINT, legal, and research.

cloud9

Wayback Machine Checker

automation-lab/wayback-machine-checker

This actor checks if URLs are archived in the Internet Archive Wayback Machine. It retrieves snapshot counts, oldest and newest archive dates, and direct links to archived versions. Uses both the Availability API and CDX API for comprehensive results.

Stas Persiianenko

Websites Archiver (Wayback Machine)

web.harvester/websites-archiver

Effortlessly archive any website with our Automated Website Archiving Tool. It leverages the power of the Wayback Machine at web.archive.org to ensure your sites are preserved for future reference.

Web Harvester

5.0

Wayback Machine Search

crawlerbros/wayback-machine-search

Query Internet Archive's Wayback Machine for historical snapshots of any URL or domain. Filter by date, HTTP status, MIME type, and deduplicate. Optionally fetch the archived page text. Free public CDX API, no authentication.

Crawler Bros

5.0

Wayback Machine Historical Content Scraper

happyfhantum/wayback-machine-historical-content-scraper

Compare archived website snapshots through the Wayback Machine and extract page-history change signals.

Kelsey Todd

4.0

Wayback Machine Archive Scraper

andok/wayback-machine-scraper

Fetch historical snapshots of any webpage from the Internet Archive. Perfect for digital forensics and tracking deleted content.

Andok

Internet Archive Search — Wayback Machine Advanced Query Tool

maged120/archive-org-advanced-search

Search the Internet Archive (archive.org) with full advanced filter support — date range, media type, language, subject, and more. Returns metadata from archived web pages, books, audio, and video.