Webpage Links Extractor avatar

Webpage Links Extractor

Pricing

$10.00 / 1,000 page scanneds

Go to Apify Store
Webpage Links Extractor

Webpage Links Extractor

Pull every link from a page, categorized internal/external, with anchor text, rel attributes, and nofollow/sponsored flags.

Pricing

$10.00 / 1,000 page scanneds

Rating

0.0

(0)

Developer

Anthony Snider

Anthony Snider

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Pull every link from a page — categorized internal vs external, with anchor text, rel attributes, and nofollow / sponsored / ugc flags.

Live on the Apify Store — run it instantly or call it as an agent tool via Apify MCP.

What you get

  • Every <a href> on the page, resolved to an absolute URL (respects <base href> and redirects)
  • Internal vs external classification by hostname
  • Clean anchor text (nested tags stripped, entities decoded, whitespace collapsed)
  • rel, target, and boolean nofollow / sponsored / ugc flags per link
  • Deduplicated links, plus mailto: and tel: listed separately
  • Per-page counts: total, internal, external, nofollow, sponsored, ugc, mailto, tel
  • Single URL or bulk list in one run

Input

{
"url": "https://github.com",
"maxUrls": 50
}

Or process many at once:

{
"urls": ["https://github.com", "https://apify.com"]
}

Output

One dataset item per URL:

{
"url": "https://github.com",
"finalUrl": "https://github.com/",
"statusCode": 200,
"redirected": false,
"counts": { "total": 142, "internal": 96, "external": 46, "nofollow": 8, "sponsored": 0, "ugc": 0, "mailto": 1, "tel": 0 },
"links": [
{
"href": "https://github.com/features/copilot",
"text": "GitHub Copilot",
"type": "internal",
"rel": null,
"target": null,
"nofollow": false,
"sponsored": false,
"ugc": false
}
],
"mailto": [{ "href": "mailto:press@github.com", "email": "press@github.com", "text": "Press" }],
"tel": []
}

Pricing

Pay-per-event: $0.01 per page scanned (page-scanned). No subscription, no idle charges.