Article Scraper — Extract Clean Text & Markdown
Created by
Tugelbay Konabayev
Scrape clean article text, title, author, and date from any URL as Markdown, HTML, or text. Bulk-extract for RAG, AI agents, and content monitoring.
Article Extraction APItugelbay/article-extractor
Title
Author
Published Date
Word Count
+3 fieldsTextNumberBooleanListObject
Input
URLs to extract(required)
url:https://blog.apify.com/what-is-web-scraping/
Output format:markdown
Max articles:10
Extract images:true
Extract links:false
Timeout per page (seconds):30
Max concurrency:5
Output fields
Title
Author
Published Date
Word Count
Language
Site Name
URL
Sign up on Apify01
Create your Apify account to access the Article Extraction API.
Start the run02
The Actor will start running based on the input automatically.
Receive the output03
Monitor the progress in real-time. You will be notified as soon as your dataset is complete and ready for review.
Integrate into your workflow04
The final output is delivered in JSON, CSV, or Excel format, ready to be plugged into your workflow.
