AI & ML News & Papers Scraper — Compliant Feed
Pricing
from $0.50 / 1,000 results
AI & ML News & Papers Scraper — Compliant Feed
Scrape fresh AI/ML news & papers (HuggingFace, Anthropic, Google, the AI press) into structured JSON — a robots-compliant, self-healing, no-PII research feed for builders and funds.
Pricing
from $0.50 / 1,000 results
Rating
0.0
(0)
Developer
Connor Teskey
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Compliant AI & ML Research Radar
Never miss what's new in AI. A structured, always-fresh feed of new papers, lab announcements, and AI news — HuggingFace daily papers + blog, Anthropic and Google releases, and the AI press — ready to drop into a model, an alert, or a research dashboard.
For
Researchers, funds, and agent builders who want an early, legally clean signal on model releases, papers, and lab moves — without hand-maintaining a dozen scrapers.
The guarantees
- Compliant first. Every source is checked against
robots.txtat runtime and skipped if disallowed. Public pages only, no personal data — safe to run anywhere. - Resilient. Titles are pulled by link-text + URL shape rather than page-specific selectors, so site redesigns don't break it — and a source that suddenly yields nothing is flagged, not dropped.
- Fresh by design. Schedule it and the radar stays current.
Output
| field | meaning |
|---|---|
category | papers, blog, labs, news, … |
title | paper / post title |
url | canonical link |
source | source domain |
fetched_at | run timestamp (UTC) |
Input
sources—{ url, category }list; defaults to a robots-clean set (HuggingFace papers & blog, Anthropic, Google AI, The Decoder).maxItemsPerSource— cap per source (default 25).respectRobots— keeptrue.
Public sources, structured output, robots respected. No PII, no violations.