News Article To Markdown
Pricing
from $50.00 / 1,000 result extracteds
Go to Apify Store
News Article To Markdown
Extract news articles as clean, ad-free Markdown with automatic author and publish date detection.
Pricing
from $50.00 / 1,000 result extracteds
Rating
0.0
(0)
Developer
Extreme Scrapes
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Share
Extract news articles as clean, ad-free Markdown with automatic author and publish date detection. Strips navigation, ads, related articles, comments, and social sharing widgets.
Features
- Clean extraction — removes nav, footer, ads, related articles, newsletters, comments, social widgets
- Author detection — automatically extracts author name from article header
- Date detection — automatically extracts publish date in various formats
- Image captions — generates alt text for images lacking captions
- Batch processing — extract multiple articles in a single run
- Works with any news site — BBC, CNN, Reuters, NYT, TechCrunch, etc.
How It Works
- Provide news article URLs as input.
- The Actor fetches each article, stripping all non-content elements.
- Author and publish date are extracted from the first 20 lines.
- Clean Markdown with metadata is stored in the Apify dataset.
Input
{"startUrls": [{ "url": "https://www.bbc.com/news/technology-67988517" },{ "url": "https://techcrunch.com/2024/01/15/some-article" }]}
Output
{"url": "https://www.bbc.com/news/technology-67988517","author": "Jane Smith","publishDate": "2024-01-15","markdown": "# Article Title\n\nArticle content..."}
Use Cases
- Media monitoring and news aggregation
- Build news datasets for AI training
- Track coverage of specific topics
- Feed news into summarization pipelines
Keywords
news scraper, article extractor, news to markdown, media monitoring, news parser, article scraper
Pricing
$50 per 1,000 article extractions.