Backlink Opportunity Finder avatar

Backlink Opportunity Finder

Pricing

from $150.00 / 1,000 opportunity founds

Go to Apify Store
Backlink Opportunity Finder

Backlink Opportunity Finder

Find broken backlink opportunities at scale. Searches Google for your keywords, crawls ranking pages with anti-detection browser, discovers dead external links (404/410), and extracts contact info for outreach. Pay only per keyword searched and opportunity found.

Pricing

from $150.00 / 1,000 opportunity founds

Rating

0.0

(0)

Developer

Muhammad Arif

Muhammad Arif

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

๐Ÿ”— Backlink Opportunity Finder

Automatically discover broken backlink opportunities at scale. Enter your target keywords, and the actor scrapes Google, crawls every ranking page, finds dead external links (404/410), and extracts contact info โ€” all in one run.

Stop manually hunting for broken links. Get a ready-to-use outreach list with opportunity scores, broken URLs, anchor texts, and contact details.


What it does

Keywords โ†’ Google SERP โ†’ Crawl ranking pages โ†’ Find broken outbound links โ†’ Extract contacts โ†’ Scored opportunities
  1. Searches Google for your keywords using dedicated SERP proxies
  2. Crawls each ranking page with an anti-detection browser (Camoufox)
  3. Extracts all external dofollow links and checks their HTTP status
  4. Flags dead links (404, 410) as replacement opportunities
  5. Pulls contact info (emails, socials, contact pages) from each site
  6. Scores every opportunity (0โ€“100) based on broken link count and contact availability
  7. Outputs only pages where broken links were found โ€” zero noise

Why this actor

  • Anti-detection: Uses Camoufox (hardened Firefox) โ€” not regular Playwright browsers. Passes bot detection on sites that block typical scrapers.
  • Pipeline architecture: SERP scraping and URL crawling run in parallel. Results start flowing as soon as the first keyword is scraped โ€” no waiting.
  • Smart proxy escalation: Starts without proxy, auto-escalates to datacenter โ†’ residential only when a site blocks. Saves proxy costs.
  • Social/UGC filtering: Automatically skips Facebook, YouTube, Reddit, LinkedIn, X/Twitter, Instagram, TikTok, Pinterest, Wikipedia, and 10+ other sites that never accept guest posts.
  • Outreach-ready output: Every result includes emails, social profiles, and contact page URLs extracted from the site footer.
  • Three output formats: Detailed (full data), Summary (condensed), CSV-friendly (flat, ready for spreadsheets).

Input example

{
"keywords": ["best seo tools 2025", "link building strategies guide"],
"googleDomain": "google.com",
"resultsPerKeyword": 10,
"searchLanguage": "en",
"searchCountry": "us",
"maxConcurrency": 5,
"outputFormat": "detailed"
}

Only keywords is required. Everything else has sensible defaults.

Input reference

FieldTypeDefaultDescription
keywordsstring[]requiredSearch keywords to find opportunities for
googleDomainstringgoogle.comGoogle domain (supports .co.uk, .de, .fr, .es, .it, .com.br, .ca, .com.au, .co.in, .co.jp)
resultsPerKeywordinteger10Google results to collect per keyword (1โ€“100)
searchLanguagestringenGoogle search language code
searchCountrystringusGoogle search country code
maxConcurrencyinteger5Parallel browser instances for crawling (1โ€“20)
requestTimeoutinteger30Page load timeout in seconds
maxPagesPerDomaininteger50Max pages to crawl per domain
skipDomainsstring[][]Additional domains to skip (social sites are auto-blocked)
checkExternalLinksbooleantrueCheck external links for broken status
linkCheckTimeoutinteger10Timeout per link check in seconds
maxExternalLinksPerPageinteger100Max external links to check per page
targetStatusCodesinteger[][404, 410]HTTP codes that count as "broken"
extractContactInfobooleantrueExtract emails and social links
extractEmailsbooleantrueExtract email addresses
extractSocialLinksbooleantrueExtract social media profiles
forceProxybooleanfalseSkip no-proxy attempt, start with datacenter proxy
outputFormatstringdetailedOutput format: detailed, summary, or csv-friendly

Output example (detailed format)

{
"keyword": "best seo tools 2025",
"serpRank": 4,
"opportunityScore": 70,
"scrapedAt": "2026-02-24T14:02:07.153Z",
"sourcePage": {
"url": "https://backlinko.com/link-building-strategies",
"title": "Link Building Strategies: The Complete List",
"domain": "backlinko.com",
"snippet": "Here are the best link building strategies for 2025..."
},
"brokenLinks": [
{
"#": 1,
"brokenUrl": "https://example.com/dead-page",
"anchorText": "keyword research tool",
"statusCode": 404,
"rel": "dofollow"
}
],
"brokenLinkCount": 5,
"totalDofollowLinksOnPage": 56,
"contact": {
"emails": ["outreach@backlinko.com"],
"socialLinks": [
{ "platform": "twitter", "url": "https://twitter.com/backlinko" }
],
"contactPageUrl": "https://backlinko.com/contact"
}
}

Output formats

Detailed

Full data with nested objects. Best for programmatic use and integrations.

Summary

Condensed view with top-level fields. First email and contact page only.

CSV-friendly

Flat structure with pipe-separated values. Export directly to CSV/Google Sheets for outreach campaigns.


Opportunity scoring

Each result gets a score from 0 to 100:

ConditionPoints
Has broken links+30
More than 3 broken links+20
More than 10 broken links+10
Has any contact info+20
Has email address+10
Has contact page URL+10

Higher score = easier outreach + more replacement opportunities.


Auto-blocked domains

These sites are automatically skipped (no outreach value):

Facebook, YouTube, LinkedIn, X/Twitter, Reddit, Instagram, TikTok, Pinterest, Quora, Medium, Tumblr, Threads, Snapchat, Discord, WhatsApp, Telegram, Wikipedia

Add more via the skipDomains input field.


How proxy works

PhaseProxyWhy
Google SERP scrapingDedicated SERP proxyRequired for Google โ€” handles CAPTCHAs and rate limits
Website crawling (start)No proxyMost sites don't block โ€” saves cost
Website crawling (blocked)Datacenter proxyAuto-escalates on 403/429/503
Website crawling (still blocked)Residential proxyFinal escalation for heavily protected sites

The actor tracks proxy level per domain. If example.com blocks you, it escalates only for that domain โ€” other sites keep running without proxy.


Use cases

  • Link building outreach: Find pages linking to dead resources in your niche, then pitch your content as a replacement
  • Competitor backlink analysis: Search for competitor brand keywords, find which sites link to dead competitor pages
  • Content gap discovery: Broken links reveal topics where content has disappeared โ€” create fresh content to fill the gap
  • SEO agency workflows: Run weekly for client keywords, export CSV, feed into your outreach CRM
  • Niche research: Discover which sites in a niche have the most broken outbound links (low maintenance = high outreach success)

Pricing

Pay-per-event โ€” you only pay for what you use. No monthly subscription.

EventPriceWhen it's charged
keyword-search$0.07Each keyword scraped from Google SERP
opportunity-found$0.15Each page with broken links added to your results

The actor start event is handled automatically by Apify at ~$0.00005 per start.

Cost examples

ScenarioKeywordsOpportunities foundTotal cost
Quick check5~8~$1.55
Standard run20~30~$5.90
Deep research50~75~$14.75
Agency weekly100~150~$29.50

You set a spending limit per run โ€” the actor stops gracefully when it's reached. No surprise charges.

Comparable SaaS tools (Ahrefs, SEMrush broken link reports) cost $99โ€“$249/month. Standalone broken link checkers don't scrape SERP or extract contacts.


Tips for best results

  • Start with 5โ€“10 keywords per run to gauge results, then scale up
  • Use long-tail keywords like "best [topic] resources list" โ€” resource pages have more outbound links
  • Set resultsPerKeyword to 20โ€“50 for deeper coverage
  • Use csv-friendly output format for direct import into Google Sheets or outreach tools
  • Run weekly on the same keywords โ€” new broken links appear constantly as sites go offline