AI & ML News & Papers Scraper — Compliant Feed avatar

AI & ML News & Papers Scraper — Compliant Feed

Pricing

from $0.50 / 1,000 results

Go to Apify Store
AI & ML News & Papers Scraper — Compliant Feed

AI & ML News & Papers Scraper — Compliant Feed

Scrape fresh AI/ML news & papers (HuggingFace, Anthropic, Google, the AI press) into structured JSON — a robots-compliant, self-healing, no-PII research feed for builders and funds.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

Connor Teskey

Connor Teskey

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Compliant AI & ML Research Radar

Never miss what's new in AI. A structured, always-fresh feed of new papers, lab announcements, and AI news — HuggingFace daily papers + blog, Anthropic and Google releases, and the AI press — ready to drop into a model, an alert, or a research dashboard.

For

Researchers, funds, and agent builders who want an early, legally clean signal on model releases, papers, and lab moves — without hand-maintaining a dozen scrapers.

The guarantees

  • Compliant first. Every source is checked against robots.txt at runtime and skipped if disallowed. Public pages only, no personal data — safe to run anywhere.
  • Resilient. Titles are pulled by link-text + URL shape rather than page-specific selectors, so site redesigns don't break it — and a source that suddenly yields nothing is flagged, not dropped.
  • Fresh by design. Schedule it and the radar stays current.

Output

fieldmeaning
categorypapers, blog, labs, news, …
titlepaper / post title
urlcanonical link
sourcesource domain
fetched_atrun timestamp (UTC)

Input

  • sources{ url, category } list; defaults to a robots-clean set (HuggingFace papers & blog, Anthropic, Google AI, The Decoder).
  • maxItemsPerSource — cap per source (default 25).
  • respectRobots — keep true.

Public sources, structured output, robots respected. No PII, no violations.