Webpage Diff Checker avatar

Webpage Diff Checker

Pricing

Pay per event

Go to Apify Store
Webpage Diff Checker

Webpage Diff Checker

This actor compares text content between two web pages. It extracts visible text from both URLs, computes a line-by-line diff, and reports similarity percentage, added/removed lines, and a sample of differences. Useful for detecting content changes, comparing staging vs production, or A/B...

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

Compare text content between two web page URLs and find differences.

What does Webpage Diff Checker do?

This actor compares text content between two web pages. It extracts visible text from both URLs, computes a line-by-line diff, and reports similarity percentage, added/removed lines, and a sample of differences. Useful for detecting content changes, comparing staging vs production, or A/B test validation.

You can compare multiple URL pairs in a single run. Each pair produces its own result with a similarity score, making it easy to identify which pages have changed and by how much.

Use cases

  • DevOps engineers -- compare staging vs production pages to verify deployments match before going live
  • Content managers -- detect unauthorized or unexpected page changes across a website
  • QA testers -- validate A/B test variants have the intended content differences
  • Migration teams -- verify content is preserved correctly after a site migration or CMS switch
  • Compliance officers -- monitor regulated pages (terms, privacy policy) for unauthorized edits
  • SEO teams -- detect unexpected content changes that may affect search rankings or featured snippets

Why use Webpage Diff Checker?

  • Batch comparison -- compare multiple URL pairs in a single run
  • Similarity scoring -- get a percentage-based similarity score for quick assessment
  • Line-level diffs -- see exactly which lines were added, removed, or changed
  • Title matching -- separately checks whether page titles match
  • Structured output -- machine-readable JSON with diff samples, ready for automated processing
  • Pay-per-event pricing -- cost-effective for both one-off checks and regular monitoring
  • Fast and lightweight -- HTTP-only requests with no browser overhead, so comparisons complete in seconds

Input parameters

ParameterTypeRequiredDefaultDescription
pairsobject[]Yes--Array of URL pair objects, each with url1 and url2 properties

Each object in the pairs array must have:

PropertyTypeDescription
url1stringFirst URL to compare
url2stringSecond URL to compare

Example input

{
"pairs": [
{
"url1": "https://example.com",
"url2": "https://www.example.com"
}
]
}

Output example

{
"url1": "https://example.com",
"url2": "https://www.example.com",
"isIdentical": true,
"similarityPercent": 100,
"titleMatch": true,
"addedLines": 0,
"removedLines": 0,
"diffSample": [],
"error": null,
"checkedAt": "2026-03-01T12:00:00.000Z"
}

How much does it cost?

EventPriceDescription
Start$0.035One-time per run
Pair compared$0.002Per URL pair compared

Example costs:

  • 5 pairs: $0.035 + 5 x $0.002 = $0.045
  • 50 pairs: $0.035 + 50 x $0.002 = $0.135
  • 500 pairs: $0.035 + 500 x $0.002 = $1.035

Using the Apify API

Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_TOKEN' });
const run = await client.actor('automation-lab/webpage-diff-checker').call({
pairs: [{ url1: 'https://example.com', url2: 'https://www.example.com' }],
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient
client = ApifyClient('YOUR_TOKEN')
run = client.actor('automation-lab/webpage-diff-checker').call(run_input={
'pairs': [{'url1': 'https://example.com', 'url2': 'https://www.example.com'}],
})
items = client.dataset(run['defaultDatasetId']).list_items().items
for item in items:
print(f'Similarity: {item["similarityPercent"]}%')

Integrations

Webpage Diff Checker integrates with your existing workflow through the Apify platform. Connect it to Make (formerly Integromat), Zapier, or n8n to schedule regular comparisons and trigger alerts when differences are detected. Send notifications to Slack when pages diverge, log comparison results in Google Sheets for audit trails, or use webhooks to integrate diff checks into your CI/CD pipeline.

Common integration patterns include:

  • Deployment verification -- trigger a comparison after each deployment via webhook, comparing staging and production URLs automatically
  • Content monitoring -- schedule daily runs to detect changes to pricing pages, terms of service, or other regulated content
  • Audit logging -- push every comparison result to Google Sheets or a database to maintain a historical record of content changes

Tips and best practices

  • Schedule daily or weekly runs to monitor important pages for unexpected changes, such as pricing pages or legal documents.
  • Compare staging and production URLs before every deployment to catch content discrepancies early.
  • Check the diffSample field for a quick preview of what changed without downloading the full dataset.
  • Use similarity percentage thresholds in your automation to only trigger alerts when changes exceed a certain amount (e.g., less than 95% similarity).
  • Pair with webhooks to build a content change monitoring system that alerts your team in real time.

FAQ

Does it compare HTML or text? It compares visible text content extracted from the pages, not raw HTML. This gives a more meaningful comparison focused on what users actually see, ignoring markup differences that do not affect the displayed content.

Can I compare more than two pages at once? Yes. The pairs input accepts an array, so you can compare multiple URL pairs in a single run. Each pair produces its own result in the output dataset.

Does it handle dynamic content? The actor fetches pages via HTTP without a browser. Content that requires JavaScript rendering may not be captured. For most server-rendered pages, the comparison works well.

What does the diffSample field contain? The diffSample field contains a preview of the actual differences found between the two pages. Each entry shows the type of change (added or removed) and the affected text, giving you a quick summary without needing to compare the pages manually.

Can I use this for regression testing? Yes. Many teams use Webpage Diff Checker as part of their deployment pipeline to compare staging and production content. Pair it with scheduled runs or webhooks to automate regression detection after every release.

What does 100% similarity mean? A similarity of 100% means the visible text content of both pages is identical. The pages may still differ in HTML structure, styling, or scripts, but the text a user would see is the same.

Can I compare pages from different websites? Yes. The two URLs in each pair can be from different domains. This is useful for comparing mirrored content, syndicated articles, or localized versions of the same page.