Deprecated

Pricing

$10.00 / 1,000 results

See alternative Actors

Go to Apify Store

extract url website

Deprecated

See alternative Actors

API to scrape and extract all URLs from a specified website efficiently. It acts as a web crawler, retrieving internal and external links. Ideal for developers, marketers, and analysts collecting website data for SEO, research, or competitor analysis. Handles various web structures.

Pricing

$10.00 / 1,000 results

Rating

0.0

(0)

Developer

le bench

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Selenium & Chrome template

A template example built with Selenium and a headless Chrome browser to scrape a website and save the results to storage. The URL of the web page is passed in via input, which is defined by the input schema. The template uses the Selenium WebDriver to load and process the page. Enqueued URLs are stored in the default request queue. The data are then stored in the default dataset where you can easily access them.

Included features

Apify SDK for Python - a toolkit for building Apify Actors and scrapers in Python
Input schema - define and easily validate a schema for your Actor's input
Request queue - queues into which you can put the URLs you want to scrape
Dataset - store structured data where each object stored has the same attributes
Selenium - a browser automation library

How it works

This code is a Python script that uses Selenium to scrape web pages and extract data from them. Here's a brief overview of how it works:

The script reads the input data from the Actor instance, which is expected to contain a start_urls key with a list of URLs to scrape and a max_depth key with the maximum depth of nested links to follow.
The script enqueues the starting URLs in the default request queue and sets their depth to 1.
The script processes the requests in the queue one by one, fetching the URL using requests and parsing it using Selenium.
If the depth of the current request is less than the maximum depth, the script looks for nested links in the page and enqueues their targets in the request queue with an incremented depth.
The script extracts the desired data from the page (in this case, titles of each page) and pushes them to the default dataset using the push_data method of the Actor instance.
The script catches any exceptions that occur during the web scraping process and logs an error message using the Actor.log.exception method.

Resources

Selenium controlled Chrome example
Selenium Grid: what it is and how to set it up
Web scraping with Selenium and Python
Cypress vs. Selenium for web testing
Python tutorials in Academy
Video guide on getting scraped data using Apify API
A short guide on how to build web scrapers using code templates:

Getting started

For complete information see this article. In short, you will:

Build the Actor
Run the Actor

Pull the Actor for local development

If you would like to develop locally, you can pull the existing Actor from Apify console using Apify CLI:

Install apify-cli

Using Homebrew

$brew install apify-cli

Using NPM

$npm -g install apify-cli

Pull the Actor by its unique <ActorId>, which is one of the following:
- unique name of the Actor to pull (e.g. "apify/hello-world")
- or ID of the Actor to pull (e.g. "E2jjCZBezvAZnX8Rb")
You can find both by clicking on the Actor title at the top of the page, which will open a modal containing both Actor unique name and Actor ID.

This command will copy the Actor into the current directory on your local machine.
```
$apify pull <ActorId>
```

Documentation reference

To learn more about Apify and Actors, take a look at the following resources:

Instagram Bio Links Scraper - Extract Website URLs 2026

instaprism/instagram-bio-links-scraper

No login required. Extract website links and URLs from Instagram profile bios. Get contact links, Linktree, business websites. Perfect for B2B lead generation.

red

Sitemap Extractor

cerebral_aluminum/sitemap-extractor

Extract all URLs from website sitemaps. Pages, images, PDFs. Handles sitemap indexes and WordPress.

Benny

Sitemap URL Finder

thescrapelab/sitemap-target-url-extractor

Find and export URLs from any website’s robots.txt and sitemaps. Enter a domain or website URL, optionally filter matching URLs by text, and get clean dataset rows with the URL, domain, path, source sitemap, and match details.

Inus Grobler

Advanced Product Hunt Scraper

danpoletaev/product-hunt-scraper

Scrape product hunt "Top Products Launching Today" section. Actor crawls products and extracts information about the product: title, description, categories, images, maker info with contact links and website info with raw text and email. Export scraped datasets in JSON, csv, etc. Run via API.

Danil Poletaev

841

5.0

(1)

Google Maps Scraper: B2B Leads & Emails

aluminum_jam/local-business-lead-miner

The Ultimate G-Maps Lead Machine is designed for marketing agencies, sales teams, and wholesalers who need fresh, accurate business data without paying for expensive lists. Stop searching manually. Start mining leads automatically today.

anuj upadhyay

Google Maps Email Extractor

scraper-engine/google-maps-email-extractor

Google Maps Email Extractor pulls publicly available business email addresses from Google Maps listings. Build targeted contact lists by location, category, or rating. Ideal for sales teams and local marketers.

Scraper Engine

237

Facebook Pages & Posts Scraper — Public Page Data, Followers &

sovereigntaylor/facebook-scraper

Extract public data from Facebook Pages: page name, category, follower count, likes, about text, website, phone, email, address. Scrape posts with full text, engagement metrics (likes, comments, shares), dates, and images. Ideal for lead generation, competitor monitoring, and social listening. No lo

Ricardo Akiyoshi

204

Google Maps Email Scraper

scraper-engine/google-maps-email-scraper

Google Maps Email Scraper extracts publicly available business email addresses from Google Maps listings. Build targeted contact lists by location, category, or rating. Ideal for sales teams and local marketers.

Scraper Engine

5.0

(1)

Google Maps Email Extractor & Lead Generator

intelscrape/google-maps-email-extractor

Extract emails, phone numbers, and social media links from business websites. Search by business type + location or provide URLs directly. Perfect for B2B lead generation.

IntelScrape

Facebook Page Lead Scraper

scrapapi/facebook-page-lead-scraper

ScrapAPI

Keymex Scraper

corent1robert/keymex-scraper

Extract comprehensive data from Keymex centers including center information and collaborator details. Scrapes all centers from the main page and extracts collaborator data (names, emails, phones, job) from structured data. Optimized for fast extraction with retry mechanism and batch processing.