Pricing

Pay per usage

Google News Scraper — Fast Headlines & Sources [No API Key]

Monitor Google News fast. No API, no RSS limits, no blocks. Titles, dates, snippets, sources → CSV. 75 lifetime runs · 100% 30d success · u30d=3, u7d=1 · 8 paying users. dev.to/0012303 (Proxy-Seller 2320w paid) · blog.spinov.online · spinov001@gmail.com

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Alex

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

package.json

{
  "name": "google-news-scraper",
  "version": "1.0.0",
  "main": "src/main.js",
  "scripts": {
    "start": "node src/main.js"
  },
  "keywords": [],
  "author": "",
  "license": "ISC",
  "description": "",
  "dependencies": {
    "apify": "^3.7.0",
    "crawlee": "^3.16.0"
  },
  "type": "module"
}

.actor/Dockerfile

FROM apify/actor-node:22

COPY package*.json ./
RUN npm --quiet set progress=false \
    && npm install --omit=dev --omit=optional

COPY . ./

.actor/actor.json

{
    "actorSpecification": 1,
    "name": "google-news-scraper",
    "title": "Google News Scraper \u2014 RSS-Based, Never Breaks (Free)",
    "description": "Scrape Google News articles via RSS feed. Title, source, date, link for any keyword. Never breaks. No API key. Free. News monitoring, media analysis, competitive intelligence, PR tracking.",
    "version": "1.0",
    "seoTitle": "Google News Scraper - RSS-Based, Always Reliable",
    "seoDescription": "Scrape Google News articles via RSS feed. Title, source, date, link. Never breaks. No API key needed. Free. News monitoring, media analysis, competitive intel.",
    "input": "./input_schema.json",
    "dockerfile": "./Dockerfile"
}

.actor/input_schema.json

{
    "title": "Google News Scraper Input",
    "type": "object",
    "schemaVersion": 1,
    "properties": {
        "searchQueries": {
            "title": "Search Keywords",
            "type": "array",
            "description": "Keywords or topics to search for in Google News",
            "editor": "stringList",
            "default": ["artificial intelligence"],
            "prefill": ["artificial intelligence", "startup funding", "climate change"]
        },
        "language": {
            "title": "Language",
            "type": "string",
            "description": "Language for Google News results",
            "default": "en",
            "editor": "select",
            "enum": ["en", "es", "fr", "de", "it", "pt", "ru", "ja", "ko", "zh-CN", "ar", "hi", "tr", "pl", "nl"]
        },
        "country": {
            "title": "Country",
            "type": "string",
            "description": "Country/region for localized Google News results",
            "default": "US",
            "editor": "select",
            "enum": ["US", "GB", "CA", "AU", "DE", "FR", "ES", "IT", "BR", "IN", "JP", "KR", "RU", "MX", "TR"]
        },
        "maxArticlesPerQuery": {
            "title": "Max Articles Per Query",
            "type": "integer",
            "description": "Maximum number of news articles to extract per search query",
            "default": 50,
            "minimum": 1,
            "maximum": 200
        },
        "timeRange": {
            "title": "Time Range",
            "type": "string",
            "description": "Only return articles published within this time period",
            "default": "week",
            "enum": ["hour", "day", "week", "month", "year"],
            "editor": "select"
        }
    },
    "required": ["searchQueries"]
}

src/main.js

1import { Actor } from 'apify';
2import { CheerioCrawler } from 'crawlee';
3
4await Actor.init();
5
6const input = await Actor.getInput() ?? {};
7
8const {
9    searchQueries = ['artificial intelligence'],
10    language = 'en',
11    country = 'US',
12    maxArticlesPerQuery = 50,
13    timeRange = 'week',
14} = input;
15
16const timeRangeMap = {
17    'hour': 'qdr:h',
18    'day': 'qdr:d',
19    'week': 'qdr:w',
20    'month': 'qdr:m',
21    'year': 'qdr:y',
22};
23
24const crawler = new CheerioCrawler({
25    maxRequestsPerCrawl: searchQueries.length * 10,
26    maxConcurrency: 3,
27    requestHandlerTimeoutSecs: 30,
28
29    async requestHandler({ $, request, log }) {
30        const { label, query, collected = 0 } = request.userData;
31
32        if (label === 'SEARCH') {
33            log.info(`Processing Google News search: ${query}`);
34
35            const articles = [];
36
37            // Parse Google News RSS feed results
38            $('item').each((i, el) => {
39                const $item = $(el);
40
41                const title = $item.find('title').text().trim();
42                const link = $item.find('link').text().trim();
43                const pubDate = $item.find('pubDate').text().trim();
44                const description = $item.find('description').text().trim();
45                const source = $item.find('source').text().trim();
46                const sourceUrl = $item.find('source').attr('url') || '';
47
48                if (title && link) {
49                    articles.push({
50                        title: title.replace(/<[^>]*>/g, ''),
51                        url: link,
52                        publishedAt: pubDate ? new Date(pubDate).toISOString() : null,
53                        description: description.replace(/<[^>]*>/g, '').substring(0, 500),
54                        source,
55                        sourceUrl,
56                        query,
57                        scrapedAt: new Date().toISOString(),
58                    });
59                }
60            });
61
62            // Also try HTML parsing for non-RSS responses
63            if (articles.length === 0) {
64                $('article, [data-n-tid], .NiLAwe').each((i, el) => {
65                    const $article = $(el);
66                    const titleEl = $article.find('h3, h4, [role="heading"]').first();
67                    const linkEl = $article.find('a').first();
68                    const sourceEl = $article.find('.wEwyrc, .vr1PYe, time').first();
69                    const timeEl = $article.find('time').first();
70
71                    const title = titleEl.text().trim();
72                    const link = linkEl.attr('href');
73                    const source = sourceEl.text().trim();
74                    const datetime = timeEl.attr('datetime');
75
76                    if (title && link) {
77                        const fullUrl = link.startsWith('http') ? link :
78                            link.startsWith('./') ? `https://news.google.com${link.substring(1)}` :
79                            `https://news.google.com${link}`;
80
81                        articles.push({
82                            title,
83                            url: fullUrl,
84                            publishedAt: datetime || null,
85                            source,
86                            query,
87                            scrapedAt: new Date().toISOString(),
88                        });
89                    }
90                });
91            }
92
93            log.info(`Found ${articles.length} articles for "${query}"`);
94
95            for (const article of articles.slice(0, maxArticlesPerQuery - collected)) {
96                await Actor.pushData(article);
97            }
98        }
99    },
100});
101
102// Build request queue using Google News RSS
103for (const query of searchQueries) {
104    const tbs = timeRangeMap[timeRange] || '';
105    const rssUrl = `https://news.google.com/rss/search?q=${encodeURIComponent(query)}${tbs ? `+when:${timeRange.charAt(0)}` : ''}&hl=${language}&gl=${country}&ceid=${country}:${language}`;
106
107    await crawler.addRequests([{
108        url: rssUrl,
109        userData: { label: 'SEARCH', query },
110    }]);
111}
112
113await crawler.run();
114await Actor.exit();

Reddit Scraper Pro — Posts, Comments, Subreddits, No API Key

knotless_cadence/reddit-discussion-scraper

Reddit scraper via public JSON — posts + comments, no login. 20 fields/post (score, ratio, flair, NSFW). CSV/JSON. 101 runs · 6 users · u30d=2 · 27/30d. Trend research + LLM training data. blog.spinov.online · dev.to/0012303 · spinov001@gmail.com

Alex

Bluesky Scraper — Posts, Followers & Profiles [No API Limits]

knotless_cadence/bluesky-scraper

Bluesky posts, profiles & feeds in CSV in 2 min — no API waitlist, no rate limits, no bans. 44 runs · fresh u7d signal · 100% 30d success. Text/images/likes/reposts/profile metadata. Post-Twitter audience tracking + creator discovery + brand listening. dev.to/0012303 · blog.spinov.online

Alex

Glassdoor Scraper — Reviews, Salaries, CSV, No Login Required

knotless_cadence/glassdoor-reviews-scraper

Glassdoor reviews + salary in CSV/JSON in 5 min — no coding, no login, no rate-limits. 59 lifetime runs · 5 paying users · u30d=1 active. Ratings/pros-cons/titles/dates/salary schema. Competitive intel + recruiter outreach + comp planning. dev.to/0012303 · blog.spinov.online

Alex

MCP Company Researcher — AI Agent Business Intel, JSON, No Key

knotless_cadence/mcp-company-researcher

44 lifetime runs · 1 user active 30d. Get company intel as JSON in 30 sec — feed a domain, get back website meta + tech-stack markers + DNS + SSL + Google News + HN mentions. No login. For SDR enrichment + ABM + investor due-diligence. dev.to/0012303 · blog.spinov.online

Alex

Trustpilot Review Scraper — Unlimited Reviews, Bypass 200 Limit

knotless_cadence/trustpilot-review-scraper

Trustpilot reviews → CSV/JSON/Excel in 2min. 972 runs · 797/30d · 100% success · bypasses 200-review cap. 9 fields (stars, text, author, date, lang, company, URL, headline, verified). BI, competitor research, lead enrichment. blog.spinov.online · dev.to/0012303 · spinov001@gmail.com

Alex

Meta Threads Scraper — CSV, No Login, No Rate Limits

knotless_cadence/threads-scraper

Meta Threads (threads.net) JSON/CSV — POSTs (author, text, source) + PROFILEs (followers, bio, avatar) by username/search. 46 runs / 8 users / 32-actor portfolio (2190 lifetime). Audience research + brand mentions. Sample: dev.to/0012303. spinov001@gmail.com · blog.spinov.online · t.me/scraping_ai

Alex

Social Profiles — Bio, Followers, Posts in CSV, Bulk

knotless_cadence/social-profile-scraper

Social profile data CSV/JSON — username, bio, followers, following, posts. Same schema LinkedIn/GitHub/Reddit. 52 lifetime runs · 9 users · 5 active 30d · 100% success rate. B2B prospecting/ABM/recruiter sourcing. dev.to/0012303 · blog.spinov.online

Alex

Walmart Reviews Scraper — Product Reviews to CSV/JSON in 2 min

knotless_cadence/walmart-reviews-scraper

25 runs / u7d=1 fresh signal. Backed by 971-run Trustpilot flagship + 32-actor portfolio (2190 lifetime runs). Walmart reviews → CSV/JSON. Bypasses 100-review UI cap. 17 fields: stars, text, author, date, helpful, images. spinov001@gmail.com · blog.spinov.online · t.me/scraping_ai

Alex

Google News Scraper

groupoject/google-news-scraper

Scrape Google News by keyword or topic for brand monitoring, competitor tracking, market research, headlines, sources, dates, and snippets. No API key required.

Group Oject

Hacker News — CSV, Stories + Comments + Users, No API Key

knotless_cadence/hacker-news-scraper

Scrape HN top/new/Show HN/Ask HN/jobs in minutes. No rate limits, no API key. Title, URL, score, comments, author → JSON/CSV. 46 lifetime runs. For launch monitoring, competitor tracking, market trend research. dev.to/0012303 · blog.spinov.online

Alex