Website Link Graph & Outbound Links Crawler
Pricing
from $3.50 / 1,000 results
Website Link Graph & Outbound Links Crawler
Crawl an entire website and export every internal and outbound link as a graph edge β source page, target URL, anchor text, rel and nofollow/sponsored/UGC flags. Built for SEO internal-linking audits, outbound-link analysis and site-structure mapping. No login, no browser.
Pricing
from $3.50 / 1,000 results
Rating
0.0
(0)
Developer
Logiover
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
17 hours ago
Last modified
Categories
Share
Website Link Graph & Outbound Links Crawler β Link Extractor πΈοΈ
Extract every link from a website and map its link graph. This link extractor crawls an entire site and exports every internal and outbound link as a graph edge β the source page, the target URL, the anchor text, the link type (internal / subdomain / external) and the rel flags (nofollow, sponsored, ugc). Map a whole site's internal linking and outbound links from a single start URL. No login, no headless browser.
Need to extract all links from a website, run an internal linking SEO audit, analyze outbound / external links, or check nofollow attributes at scale? This actor builds the complete link graph.
β¨ Key features
- π·οΈ Full-site crawl β start from one URL and follow internal links across the whole domain.
- π Every link as an edge β
source β targetwith anchor text, for internal and external links. - π·οΈ Rel & attribute parsing β
nofollow,sponsored,ugcflags andtarget="_blank". - π― Scope control β export all links, only internal, or only outbound/external links.
- β‘ Faster & higher-volume than a link checker β it maps the graph without HTTP status checks, so it's ideal for big sites.
π‘ Use cases
- Internal-linking SEO audits β see which pages link where and with what anchor text to optimize link equity.
- Outbound / external link analysis β list every external site you link to and which links are nofollow.
- Site-structure mapping β build a link graph for visualization, analysis or crawl-budget optimization.
- Link cleanup & compliance β find sponsored/UGC links that should carry the right
relattributes. - Anchor-text analysis β audit the anchor text distribution across an entire site.
π¦ What you get
One row per link (edge):
| Field | Description |
|---|---|
sourceUrl | The page the link is on |
targetUrl | The link target (absolute URL) |
targetDomain | Hostname of the target |
anchorText | The clickable text |
linkType | internal, subdomain or external |
rel | Raw rel attribute |
isNofollow / isUgc / isSponsored | Parsed rel flags |
opensNewTab | true if target="_blank" |
crawledAt | ISO 8601 timestamp |
Example output
{"sourceUrl": "https://example.com/blog/seo-guide","targetUrl": "https://partner.com/tool","targetDomain": "partner.com","anchorText": "our favorite SEO tool","linkType": "external","rel": "nofollow sponsored","isNofollow": true,"isSponsored": true,"isUgc": false,"opensNewTab": true,"crawledAt": "2026-05-25T14:17:17.531Z"}
π How to use it
- Click Try for free / Start.
- Paste one or more website URLs into Start URLs.
- (Optional) Set Max pages to crawl β
0for the whole site. - (Optional) Choose Which links to export: all / internal / external.
- Click Save & Start.
- Export the link graph as JSON, CSV, Excel or via API.
βοΈ Input
| Option | Description | Default |
|---|---|---|
startUrls | Websites to crawl | β (required) |
maxPagesToCrawl | Max pages per run (0 = whole site) | 1000 |
linkScope | all, internal or external | all |
maxConcurrency | Parallel requests | 10 |
Example input
{"startUrls": [{ "url": "https://example.com" }],"maxPagesToCrawl": 5000,"linkScope": "all"}
π How it works
The crawler follows internal links within the same domain as your Start URLs, and on each page records every <a href> as an edge with its anchor text and rel attributes. URLs are resolved to absolute and de-duplicated per page. Use linkScope to keep only internal or only outbound links. Pure HTTP β fast and cheap.
π§° Tips & best practices
- Use
linkScope: "external"for a pure outbound-link/backlink-target report. - Use
linkScope: "internal"to analyze internal link distribution and orphan-page risk. - Pivot on
targetDomainto see which external domains you link to most. - Set
maxPagesToCrawlto0for complete coverage of large sites.
β FAQ
What's the difference vs the Broken Link Checker? This actor maps the link graph (anchors, rel, internal/external) without checking HTTP status, so it's faster and produces higher volume. Use the Broken Link Checker when you specifically need to find dead links and status codes.
Does it detect nofollow / sponsored / ugc? Yes β the rel attribute is parsed into boolean flags.
Does it render JavaScript? No β it parses server-rendered HTML for speed and low cost.
How do I crawl the whole site? Set maxPagesToCrawl to 0.
What formats can I export? JSON, CSV, Excel, HTML and a full REST API.
π Related actors by the same author
- Broken Link Checker β crawl a site and find dead links with HTTP status codes.
- Website SEO Audit Crawler β full on-page SEO audit for every page.
- Sitemap to URL Crawler β extract all URLs from any sitemap.xml.
- Website to Markdown & Text Crawler β clean text + Markdown for AI / RAG.
Changelog
- 2026-05-25 β Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.
Last reviewed: 2026-05-25.