Website Link Graph & Outbound Links Crawler avatar

Website Link Graph & Outbound Links Crawler

Pricing

from $3.50 / 1,000 results

Go to Apify Store
Website Link Graph & Outbound Links Crawler

Website Link Graph & Outbound Links Crawler

Crawl an entire website and export every internal and outbound link as a graph edge β€” source page, target URL, anchor text, rel and nofollow/sponsored/UGC flags. Built for SEO internal-linking audits, outbound-link analysis and site-structure mapping. No login, no browser.

Pricing

from $3.50 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

17 hours ago

Last modified

Share

Website Link Graph & Outbound Links Crawler β€” Link Extractor πŸ•ΈοΈ

Extract every link from a website and map its link graph. This link extractor crawls an entire site and exports every internal and outbound link as a graph edge β€” the source page, the target URL, the anchor text, the link type (internal / subdomain / external) and the rel flags (nofollow, sponsored, ugc). Map a whole site's internal linking and outbound links from a single start URL. No login, no headless browser.

Need to extract all links from a website, run an internal linking SEO audit, analyze outbound / external links, or check nofollow attributes at scale? This actor builds the complete link graph.


✨ Key features

  • πŸ•·οΈ Full-site crawl β€” start from one URL and follow internal links across the whole domain.
  • πŸ”— Every link as an edge β€” source β†’ target with anchor text, for internal and external links.
  • 🏷️ Rel & attribute parsing β€” nofollow, sponsored, ugc flags and target="_blank".
  • 🎯 Scope control β€” export all links, only internal, or only outbound/external links.
  • ⚑ Faster & higher-volume than a link checker β€” it maps the graph without HTTP status checks, so it's ideal for big sites.

πŸ’‘ Use cases

  • Internal-linking SEO audits β€” see which pages link where and with what anchor text to optimize link equity.
  • Outbound / external link analysis β€” list every external site you link to and which links are nofollow.
  • Site-structure mapping β€” build a link graph for visualization, analysis or crawl-budget optimization.
  • Link cleanup & compliance β€” find sponsored/UGC links that should carry the right rel attributes.
  • Anchor-text analysis β€” audit the anchor text distribution across an entire site.

πŸ“¦ What you get

One row per link (edge):

FieldDescription
sourceUrlThe page the link is on
targetUrlThe link target (absolute URL)
targetDomainHostname of the target
anchorTextThe clickable text
linkTypeinternal, subdomain or external
relRaw rel attribute
isNofollow / isUgc / isSponsoredParsed rel flags
opensNewTabtrue if target="_blank"
crawledAtISO 8601 timestamp

Example output

{
"sourceUrl": "https://example.com/blog/seo-guide",
"targetUrl": "https://partner.com/tool",
"targetDomain": "partner.com",
"anchorText": "our favorite SEO tool",
"linkType": "external",
"rel": "nofollow sponsored",
"isNofollow": true,
"isSponsored": true,
"isUgc": false,
"opensNewTab": true,
"crawledAt": "2026-05-25T14:17:17.531Z"
}

πŸš€ How to use it

  1. Click Try for free / Start.
  2. Paste one or more website URLs into Start URLs.
  3. (Optional) Set Max pages to crawl β€” 0 for the whole site.
  4. (Optional) Choose Which links to export: all / internal / external.
  5. Click Save & Start.
  6. Export the link graph as JSON, CSV, Excel or via API.

βš™οΈ Input

OptionDescriptionDefault
startUrlsWebsites to crawl– (required)
maxPagesToCrawlMax pages per run (0 = whole site)1000
linkScopeall, internal or externalall
maxConcurrencyParallel requests10

Example input

{
"startUrls": [{ "url": "https://example.com" }],
"maxPagesToCrawl": 5000,
"linkScope": "all"
}

πŸ” How it works

The crawler follows internal links within the same domain as your Start URLs, and on each page records every <a href> as an edge with its anchor text and rel attributes. URLs are resolved to absolute and de-duplicated per page. Use linkScope to keep only internal or only outbound links. Pure HTTP β€” fast and cheap.

🧰 Tips & best practices

  • Use linkScope: "external" for a pure outbound-link/backlink-target report.
  • Use linkScope: "internal" to analyze internal link distribution and orphan-page risk.
  • Pivot on targetDomain to see which external domains you link to most.
  • Set maxPagesToCrawl to 0 for complete coverage of large sites.

❓ FAQ

What's the difference vs the Broken Link Checker? This actor maps the link graph (anchors, rel, internal/external) without checking HTTP status, so it's faster and produces higher volume. Use the Broken Link Checker when you specifically need to find dead links and status codes.

Does it detect nofollow / sponsored / ugc? Yes β€” the rel attribute is parsed into boolean flags.

Does it render JavaScript? No β€” it parses server-rendered HTML for speed and low cost.

How do I crawl the whole site? Set maxPagesToCrawl to 0.

What formats can I export? JSON, CSV, Excel, HTML and a full REST API.

  • Broken Link Checker β€” crawl a site and find dead links with HTTP status codes.
  • Website SEO Audit Crawler β€” full on-page SEO audit for every page.
  • Sitemap to URL Crawler β€” extract all URLs from any sitemap.xml.
  • Website to Markdown & Text Crawler β€” clean text + Markdown for AI / RAG.

Changelog

  • 2026-05-25 β€” Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.

Last reviewed: 2026-05-25.