Pricing

$7.00/month + usage

Try for free

Go to Apify Store

Universal Website Content Scraper

Try for free

Extract structured page titles, meta descriptions, H1, H2, H3 headings, clean main content, and structured page text from websites with smart noise removal. Ideal for SEO audits, AI preprocessing, research, and automation

Pricing

$7.00/month + usage

Rating

5.0

(1)

Developer

Techionik

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

PURPOSE

Extract structured content from general websites in a consistent and reusable format.

DATA EXTRACTED

Page title
Meta description
Headings (H1 to H3)
Main text content
Page URL

HOW IT WORKS

Starts from one or more provided URLs
Automatically detects the main content area
Removes navigation, footers, popups, and cookie banners
Extracts readable text using a smart fallback strategy
Optionally follows internal links with depth control

INPUT OPTIONS

Start URLs

One or more URLs to begin scraping from

Crawl Links

Enable or disable link crawling

Max Enqueue Depth

Controls how deep link crawling goes

Same Domain Only

Restricts crawling to the starting domain

Max Requests per Crawl

Limits the number of pages processed per run

All inputs are configurable from the Apify Console.

OUTPUT

Each scraped page produces one dataset item containing:

pageTitle
metaDescription
headings
mainText
pageUrl

An overview table is included for quick browsing of page titles and URLs.

TYPICAL USE CASES

Website content extraction
SEO and content audits
Research and data collection
AI and search preprocessing
Website archiving

TECHNOLOGY STACK

Apify SDK
Crawlee (CheerioCrawler)
Cheerio
Mozilla Readability

NOTES

Best suited for static and semi-static websites
Not intended for heavily JavaScript-rendered applications

STATUS

Simple Clean Production-ready

Website Content Crawler

alizarin_refrigerator-owner/website-crawler

Crawl websites for SEO audits. Extracts HTML, title, meta tags, headings, links, & text content from pages. Automatic sitemap detection & parsing Extracts metadata (title, description, OG tags) Heading structure (H1, H2, H3) Internal & external link analysis Image extraction w/alt text Word count

The Howlers

Meta Tags Extractor

krawlify/meta-tags-extractor

Extract SEO meta tags, Open Graph, Twitter Cards, JSON-LD structured data, and headings from any website. Perfect for SEO analysis, competitor research, and content audits.

Krawlify Krawlify

Website text scraper

spark_actors/website-text-scraper

Extracts key content from any website URL you provide. It fetches the page’s title, meta description, all headings (H1 to H6), paragraphs, links, and tables — delivering structured data for easy use. Ideal for quick insights, SEO analysis, or data extraction without complex setup.

muhammad ubaid

Universal Website Scraper (Python)

fortuitous_inch/my-actor

Scrape structured data from any website URL using Python and BeautifulSoup. Extract titles, links, and page content for research and automation.

Amol Pandgale

Competitor-Based Keyword Recommendations for On-Page SEO

antonio_espresso/keyword-competitor-recommendation

This actor takes a keyword, language, and Google engine, then returns structured SEO insights: ideal word count, title/content terms with usage ranges, relevant questions (H1–H3, PAA), and competitor data including URLs, rankings, titles, and content scores.

Antonio Blago

3.2

DataForSeo On-page SEO

alizarin_refrigerator-owner/dataforseo-onpage

This actor performs in-depth on-page SEO audits using the DataForSEO On-Page API. Page titles, meta descriptions, headings, content, technical SEO factors w/actionable recommendations. Technical SEO, Content Analysis, Meta Tags, Heading Structure, Image Optimization, Link Analysis & Core Web Vitals

The Howlers

AI Website Content Extractor

scrapeai/ai-website-content-extractor

Crawl website pages, strip noise, and convert the main content to clean Markdown for RAG/LLM training.

ScrapeAI

5.0

Website Main Content Extractor

sync-network/website-main-content-extractor

Alam

Website SEO Auditor

koreyoshi/website-seo-auditor

Comprehensive SEO audit tool. Analyze page titles, meta tags, headings, images, links, mobile-friendliness, structured data, and get actionable scores.

Mr-chen

Facebook Page Posts Scraper

scraper-engine/facebook-page-posts-scraper

Scrape Facebook page posts with text, images, reactions, comments, and timestamps. Ideal for research, analytics, content tracking, and competitor insights with clean structured output.