Mojeek News Scraper
Under maintenancePricing
from $4.99 / 1,000 results
Go to Apify Store
Mojeek News Scraper
Under maintenanceScrapes news articles from Mojeek News (mojeek.com/news). Extracts title, URL, source, category, publication time, and snippet.
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
codingfrontend
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 days ago
Last modified
Categories
Share
Extract news articles from Mojeek News — an independent, privacy-focused news aggregator powered by Mojeek's own search index. No Google. No Bing. No tracking.
Features
- News articles — title, URL, source/publisher, category, published time, snippet, thumbnail
- Pagination — automatically follows pages up to
maxItems - Category filtering — filter by World, Business, Science, Technology, Entertainment & Arts, Sports, Health
- Time parsing — converts relative times (e.g. "3 hours ago") to ISO 8601 timestamps
- Source parsing — extracts publisher name and category from Mojeek's source line format
- Deduplication — skips duplicate articles across pages
- Geo-targeting — set
regionandlanguagefor localized news - Proxy support — works with Apify residential/datacenter proxies
Input Parameters
| Field | Type | Default | Description |
|---|---|---|---|
query | string | (required) | News search query |
maxItems | integer | 50 | Maximum number of articles to extract |
category | string | "" | Category filter: world, business, science, technology, entertainment, sports, health |
language | string | "en" | Language code (e.g. en, de, fr) |
region | string | "" | Region code (e.g. US, GB, IN) |
proxyConfiguration | object | — | Apify proxy config (recommended: residential) |
Example INPUT.json
{"query": "artificial intelligence","maxItems": 50,"category": "technology","language": "en","region": "US","proxyConfiguration": { "useApifyProxy": true }}
Output Fields
| Field | Type | Description |
|---|---|---|
position | integer | Rank in results (1-based) |
title | string | Article headline |
url | string | Full article URL |
source | string | Publisher name (e.g. "BBC News") |
sourceDomain | string | Publisher domain |
category | string | News category (e.g. "World", "Technology") |
publishedAt | string | ISO 8601 publication time (parsed from relative) |
publishedAtRaw | string | Raw publication time as shown on page (e.g. "3 hours ago") |
snippet | string | Article excerpt (when available) |
thumbnail | string | Thumbnail image URL (when available) |
searchQuery | string | Query that produced this result |
region | string | Region code used |
language | string | Language code used |
pageStart | integer | Pagination start offset for this result's page |
scrapedAt | string | ISO 8601 scrape timestamp |
Example Output
{"position": 1,"title": "New AI Model Beats Human Performance on Key Benchmarks","url": "https://www.bbc.com/news/technology-12345678","source": "BBC News","sourceDomain": "bbc.com","category": "Technology","publishedAt": "2025-05-01T09:00:00.000Z","publishedAtRaw": "3 hours ago","snippet": "Researchers have developed a new AI system that outperforms humans...","thumbnail": null,"searchQuery": "artificial intelligence","region": null,"language": "en","pageStart": 0,"scrapedAt": "2025-05-01T12:00:00.000Z"}
News Categories
Mojeek News organizes articles into the following categories:
| Category | Description |
|---|---|
world | International news |
business | Finance, markets, economy |
science | Scientific research and discoveries |
technology | Tech news and product launches |
entertainment | Arts, entertainment, culture |
sports | Sports news and results |
health | Health and medical news |
Dataset Views
The dataset provides two views:
- News Results Overview — all key fields in a table (default)
- Articles By Source — grouped view showing publisher and domain
Pagination
Mojeek News paginates using the s parameter:
- Page 1:
s=0(default) - Page 2:
s=10 - The scraper queues the next page automatically until
maxItemsis reached
Notes
- Publication times shown as relative (e.g. "3 hours ago") are converted to ISO 8601 timestamps
- Thumbnail availability depends on Mojeek's indexing of the source article
- Use a residential proxy for best reliability at scale
- Mojeek News indexes a variety of sources without filter bubble effects