Pricing

Pay per usage

Website Crawler

Crawls a website starting from one or more URLs and extracts the title, meta description, headings and text from each page.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

elcon software

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Website Crawler (Apify Actor)

A simple Apify actor built with Crawlee that crawls a website starting from one or more URLs and extracts, for each page:

url, title, description (meta), h1
all h1/h2/h3 headings
wordCount of the body text
depth and crawledAt

It uses CheerioCrawler (fast, HTTP-only, no browser) so it's cheap to run.

Input

Field	Type	Default	Description
`startUrls`	array	required	URLs to start crawling from.
`maxRequestsPerCrawl`	integer	`50`	Hard cap on pages visited.
`maxCrawlDepth`	integer	`2`	How many links deep to follow (`0` = start URLs only).
`sameDomainOnly`	boolean	`true`	Only follow links on the start URL's hostname.

Run locally

npm install
npm run start:dev

Local input is read from storage/key_value_stores/default/INPUT.json. Results are written to storage/datasets/default/.

Build

npm run build      # compiles src -> dist
npm run start:prod # runs the compiled actor

Deploy to Apify

See the step-by-step in the project chat, or in short:

Install the CLI: npm install -g apify-cli
apify login
From this folder: apify push

Alternatively, connect this Git repo in the Apify Console (Actors → Create new → link Git repository) and Apify will build from the .actor/ config automatically.

Website Content Crawler

bhansalisoft/website-content-crawler

Website Content Crawler : scrap any website content with meta title and meta description and site logo

bhansalisoft

Website Content Crawler

rupom888/website-content-crawler

Syed Rupom

Website Content Crawler — Extract Full Site Content

oneary/website-content-crawler

🌐 Full website crawler that extracts structured content (text, headings, metadata, links, images) from any domain. Free platform compute pricing.

Luan M.

Website Content Extractor

glowing_glove/website-content-extractor

Crawl public pages and extract page titles, meta descriptions, headings, readable text, source URLs, and crawl metadata.

Ushba Khan

Website Analyzer Crawler

quarterly_lettuce/website-analyzer-crawler

A powerful web crawler that analyzes websites and extracts comprehensive SEO data including meta tags, headings structure, word count, internal/external links, and images.

Abhishek Kumar Giri

Website Contact Crawler

competent_clarinet/website-contact-crawler

Crawls websites to extract emails, phones, and social links.

Man Mohit verma

5.0

Universal Website Meta Scraper — SEO & Links Analysis

scrapepilot/universal-website-meta-scraper----seo-links-analysis

Extract meta data from any website instantly. Get title, description, headings, links, images, OG tags & status code. Perfect for SEO analysis, lead gen, and auditing. No coding required.

Scrape Pilot

Website text scraper

spark_actors/website-text-scraper

Extracts key content from any website URL you provide. It fetches the page’s title, meta description, all headings (H1 to H6), paragraphs, links, and tables — delivering structured data for easy use. Ideal for quick insights, SEO analysis, or data extraction without complex setup.

muhammad ubaid

Web Crawler & Semantic Schema-Enhanced Extractor

devil_port369-owner/web-crawler

Depth-controlled web crawler that transforms websites into structured analytics-ready data. Starting from one or more URLs, it crawls internal links up to a configurable depth and outputs detailed JSON records per page

DataFusionX

5.0

Website Content Crawler — Text, Titles & Metadata

hipersoft/website-content-crawler

Extract clean, readable content from any list of websites: page title, meta description, headings, main body text, word count and link/image counts. Optional same-domain crawl. Bulk-ready, no browser, no login. Great for LLM/RAG ingestion, content audits and research.