Tech Blog Aggregator
Pricing
Pay per usage
Go to Apify Store
Tech Blog Aggregator
Pricing
Pay per usage
Rating
0.0
(0)
Developer

Donny Nguyen
Maintained by Community
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share
Aggregate articles from top technology blogs and news sources into a unified dataset. Supports multiple sources including Hacker News, dev.to, TechCrunch, The Verge, and Ars Technica.
Features
- Multi-source aggregation from the most popular tech news and blog platforms
- Structured data extraction including title, summary, author, date, tags, and URL
- Hacker News API integration for reliable top story extraction with point scores
- dev.to API support for developer community content with tags and reactions
- Configurable source selection to choose exactly which blogs to aggregate
- Deduplication of articles that appear across multiple sources
Use Cases
- Build a personalized tech news feed for your team or organization
- Monitor technology trends across multiple publications simultaneously
- Create datasets for NLP analysis of tech industry coverage
- Feed content into recommendation engines or content curation tools
- Track emerging technologies and developer tool announcements
Input Configuration
| Parameter | Type | Default | Description |
|---|---|---|---|
sources | array | ["hackernews", "devto"] | Blog sources to scrape |
maxArticles | integer | 200 | Maximum articles per source |
Output Format
Each article produces a dataset item with:
source- The originating blog/platform nametitle- Article titlesummary- Article summary or descriptionauthor- Author name when availabledate- Publication dateurl- Direct link to the articletags- Array of associated tags or categoriesscore- Engagement score (points, reactions) when availablescrapedAt- ISO timestamp of extraction
Supported Sources
Currently supported sources include Hacker News, dev.to, TechCrunch, The Verge, and Ars Technica. Additional sources can be added by providing the source name.
Limitations
- Some sources use heavy JavaScript rendering and may return limited data
- Rate limiting is automatically handled but may slow down large requests
- Article summaries are truncated to maintain reasonable dataset sizes