Mojeek News Scraper avatar

Mojeek News Scraper

Under maintenance

Pricing

from $4.99 / 1,000 results

Go to Apify Store
Mojeek News Scraper

Mojeek News Scraper

Under maintenance

Scrapes news articles from Mojeek News (mojeek.com/news). Extracts title, URL, source, category, publication time, and snippet.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

codingfrontend

codingfrontend

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Share

Extract news articles from Mojeek News — an independent, privacy-focused news aggregator powered by Mojeek's own search index. No Google. No Bing. No tracking.

Features

  • News articles — title, URL, source/publisher, category, published time, snippet, thumbnail
  • Pagination — automatically follows pages up to maxItems
  • Category filtering — filter by World, Business, Science, Technology, Entertainment & Arts, Sports, Health
  • Time parsing — converts relative times (e.g. "3 hours ago") to ISO 8601 timestamps
  • Source parsing — extracts publisher name and category from Mojeek's source line format
  • Deduplication — skips duplicate articles across pages
  • Geo-targeting — set region and language for localized news
  • Proxy support — works with Apify residential/datacenter proxies

Input Parameters

FieldTypeDefaultDescription
querystring(required)News search query
maxItemsinteger50Maximum number of articles to extract
categorystring""Category filter: world, business, science, technology, entertainment, sports, health
languagestring"en"Language code (e.g. en, de, fr)
regionstring""Region code (e.g. US, GB, IN)
proxyConfigurationobjectApify proxy config (recommended: residential)

Example INPUT.json

{
"query": "artificial intelligence",
"maxItems": 50,
"category": "technology",
"language": "en",
"region": "US",
"proxyConfiguration": { "useApifyProxy": true }
}

Output Fields

FieldTypeDescription
positionintegerRank in results (1-based)
titlestringArticle headline
urlstringFull article URL
sourcestringPublisher name (e.g. "BBC News")
sourceDomainstringPublisher domain
categorystringNews category (e.g. "World", "Technology")
publishedAtstringISO 8601 publication time (parsed from relative)
publishedAtRawstringRaw publication time as shown on page (e.g. "3 hours ago")
snippetstringArticle excerpt (when available)
thumbnailstringThumbnail image URL (when available)
searchQuerystringQuery that produced this result
regionstringRegion code used
languagestringLanguage code used
pageStartintegerPagination start offset for this result's page
scrapedAtstringISO 8601 scrape timestamp

Example Output

{
"position": 1,
"title": "New AI Model Beats Human Performance on Key Benchmarks",
"url": "https://www.bbc.com/news/technology-12345678",
"source": "BBC News",
"sourceDomain": "bbc.com",
"category": "Technology",
"publishedAt": "2025-05-01T09:00:00.000Z",
"publishedAtRaw": "3 hours ago",
"snippet": "Researchers have developed a new AI system that outperforms humans...",
"thumbnail": null,
"searchQuery": "artificial intelligence",
"region": null,
"language": "en",
"pageStart": 0,
"scrapedAt": "2025-05-01T12:00:00.000Z"
}

News Categories

Mojeek News organizes articles into the following categories:

CategoryDescription
worldInternational news
businessFinance, markets, economy
scienceScientific research and discoveries
technologyTech news and product launches
entertainmentArts, entertainment, culture
sportsSports news and results
healthHealth and medical news

Dataset Views

The dataset provides two views:

  1. News Results Overview — all key fields in a table (default)
  2. Articles By Source — grouped view showing publisher and domain

Pagination

Mojeek News paginates using the s parameter:

  • Page 1: s=0 (default)
  • Page 2: s=10
  • The scraper queues the next page automatically until maxItems is reached

Notes

  • Publication times shown as relative (e.g. "3 hours ago") are converted to ISO 8601 timestamps
  • Thumbnail availability depends on Mojeek's indexing of the source article
  • Use a residential proxy for best reliability at scale
  • Mojeek News indexes a variety of sources without filter bubble effects