Pricing

from $10.00 / 1,000 results

Go to Apify Store

Webpage Link Extractor

Try for free

Extract all links from webpages with optional depth crawling

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Donny

Actor stats

Bookmarked

Total users

Monthly active users

16 hours ago

Last modified

What does this actor do?

Webpage Link Extractor is an Apify actor that extract all links from webpages with optional depth crawling. It runs on the Apify platform and delivers structured data in JSON, CSV, or Excel formats that you can easily integrate into your workflows. For each item found, the actor extracts key data fields including source url, target url, anchor text, external, and more. All results are stored in an Apify dataset that you can download or connect to via the Apify API.

Why use this actor?

Manually collecting this data would be extremely time-consuming and error-prone. Webpage Link Extractor automates the entire process, saving you hours of manual work. This actor is ideal for data analysts, researchers, marketers, and developers who need reliable, structured data. You can schedule regular runs to keep your data fresh, integrate results directly into spreadsheets or databases, and scale your data collection without any coding required. The actor handles pagination, rate limiting, and data normalization automatically.

How does it work?

This actor uses the Cheerio HTTP scraping library to efficiently parse HTML pages from the target website. It sends lightweight HTTP requests without rendering JavaScript, making it fast and resource-efficient. The actor processes search results, follows pagination, and extracts structured data from each page using CSS selectors.

Input parameters

Parameter	Type	Description	Default
url	string	Starting URL to extract links from	None
maxDepth	integer	Maximum crawl depth (1 = only the starting page)	`1`
maxLinks	integer	Maximum number of links to extract	`1000`

Output fields

Each item in the output dataset contains the following fields:

Field	Description	Format
sourceUrl	Source URL	text
targetUrl	Target URL	text
anchorText	Anchor Text	text
isExternal	External	text
isNofollow	Nofollow	text

Example output:

{
  "sourceUrl": "Sample Source URL",
  "targetUrl": "Sample Target URL",
  "anchorText": "Sample Anchor Text",
  "isExternal": "Sample External",
  "isNofollow": "Sample Nofollow"
}

Cost and performance

This actor runs with a default memory allocation of 1024 MB. Using lightweight HTTP requests, each run typically costs around $0.10-0.25 in Apify platform credits per 1,000 results. A typical run processing 100 results completes in 1-3 minutes. You can reduce costs by limiting the number of results with the maxResults parameter and by scheduling runs during off-peak hours.

Tips and best practices

Start with a small number of results to test your configuration before scaling up.
Use the Apify scheduling feature to automate regular data collection runs.
Export results in the format that best fits your workflow: JSON for APIs, CSV for spreadsheets, or Excel for reports.
Connect this actor with other actors on the Apify platform for more comprehensive data pipelines.

Related actors you might find useful:

Crypto News Pro Scraper

buseta/crypto-news

Scrape crypto news from hundreds of resources all over the world! Get all the news about the market or your favorite cryptocurrency! Last Update: Feb 8, 2025

buseta

164

🔗✨ Link Extractor Pro: URL to HTML List Downloader

dainty_screw/link-extractor-pro-url-to-html-list-downloader

Maximize productivity with HTML URL List Downloader. Quickly extract, manage, and organize URLs from HTML pages. Ideal for SEO professionals and digital marketers. Streamline your workflow today!

codemaster devops

187

5.0

(1)

Video Download Link Crawler(fast and cheap)

thenetaji/video-download-link-crawler

Extract direct video download links from any webpage! Customize crawling with regex, export in multiple formats, and automate video collection. Fast, efficient, and cheap—try now!

The Netaji

127

5.0

(1)

Sitemap Generator - Crawl Website & Create XML Sitemap

scrappy_garden/sitemap-generator

Generate an XML sitemap for any website. Crawls internal pages from start URLs (with depth + page limits), deduplicates URLs, and stores a ready-to-submit sitemap.xml plus a structured dataset and summary for SEO audits.

Bikram Adhikari

Ai Ready Web Page To Markdown Converter

mustafa.irshaid.113/ai-ready-web-page-to-markdown-converter

Convert any webpage into structured Markdown and HTML using just a URL. Get the page title, link, and content—perfect for SEO, devs, and AI crawlers. Fast, clean, and ideal for repurposing or analysis. Start turning websites into Markdown instantly.

Mustafa Irshaid

Pinterest Audio Downloader

alpha-scraper/pinterest-audio-downloader

Extract audio URLs, thumbnails, titles, and metadata from public Pinterest posts in seconds. Pinterest Audio Downloader Pro delivers clean, structured datasets ready for automation, analysis, or media workflows on the Apify platform.

Alpha Scraper

5.0

(1)

Website Links Graph Generator

crawlerbros/web-link-graph-visualizer

Creates an oriented graph visualizing links between webpages. Outputs: graph.png (visual network diagram) and graph.json (structured data) saved to Key-Value Store, plus detailed dataset of all crawled pages. Configure depth, boundaries, and layout.

Crawler Bros

5.0

(7)

Cheerio Scraper

apify/cheerio-scraper

Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.

Apify

13K

5.0

(20)

Puppeteer Scraper

apify/puppeteer-scraper

Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.

Apify

12K

4.9

(17)

Playwright Scraper

apify/playwright-scraper

Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.

Apify

5.1K

2.4

(12)