Website Html Content avatar
Website Html Content

Pricing

$7.00 / 1,000 results

Go to Apify Store
Website Html Content

Website Html Content

Website Html Content - Extract all the content from a blog post/article or website.

Pricing

$7.00 / 1,000 results

Rating

0.0

(0)

Developer

Edge

Edge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Website HTML Content API Extract all content from a blog post, article, or any website

The Website HTML Content API allows you to extract all meaningful content from web pages, such as blog posts, recipes, news articles, and informational pages.

Perfect for n8n, Make, Zapier, Apify, no-code automations, AI content pipelines, dataset creation, SEO tools, and data repurposing workflows.

With a single request, the API returns:

📝 Title & full structured text 🖼️ Featured (hero) image 🖼️ All images inside the article 🔗 Original source URL 🧱 Content blocks (h1, h2, h3, p, strong, span) 🚀 How it Works

Send a request with the target URL and receive a clean JSON response containing structured text and images extracted from the article or page.

Request Example { "url": "https://example.com/article/sample-post" }

Response Example { "url": "https://example.com/article", "featuredImage": "https://example.com/uploads/featured.jpg", "articleImages": [ "https://example.com/uploads/photo1.jpg", "https://example.com/uploads/photo2.jpg" ], "content": [ { "level": "h1", "text": "Simple and Quick Pizza" }, { "level": "p", "text": "A delicious and easy recipe for the whole family..." } ] }

🔧 Key Features & Benefits Feature Description Full structured content extraction Extracts headings, paragraphs, and text blocks Image extraction Featured image + all in-article images Clean JSON output Ideal for automations, AI, NLP, and SEO Works on any website Blogs, e-commerce, recipes, news, static sites Compatible with automation tools n8n, Make, Apify, Zapier, Airtable, Notion Perfect for AI dataset building Feed agents, RAG, or content generators

🧠 Use Cases

Build structured datasets for AI training Automatic article / eBook / summary generation Convert webpages into Markdown or PDF processing Create video automation with text + images SEO content repurposing at scale Web scraping without messy HTML and ads

🛠️ Tech Stack

Node.js Apify SDK Cheerio (HTML parser) Axios

Automatic Markdown formatting

Automatic keyword extraction

⭐ Support & Customization

Need a custom version of this extractor (e.g., structured recipe model, article sections, NLP-only text)? Feel free to reach out — improvements are always welcome! 😄