Arab News Aggregator - منصة رصد الأخبار avatar

Arab News Aggregator - منصة رصد الأخبار

Pricing

from $0.90 / 1,000 articles

Go to Apify Store
Arab News Aggregator - منصة رصد الأخبار

Arab News Aggregator - منصة رصد الأخبار

Aggregate and extract full-text news articles from 25+ major Arab news sources across the Middle East and North Africa. Get structured article data including title, full text, authors, publish date, and more. Select individual sources via input toggles.

Pricing

from $0.90 / 1,000 articles

Rating

0.0

(0)

Developer

Zen Studio

Zen Studio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Collect and extract full-text news articles from 25+ major Arab news sources across the Middle East and North Africa. Get structured article data including title, full text, authors, publish date, and more - updated with every run.

No login required. No API keys to manage. Just set your filters and start collecting articles.

Why Arab News?

The Arab world generates a massive volume of daily news content across hundreds of outlets in Arabic, English, and French. This actor provides a single, unified feed covering the region's most important news sources, enabling:

  • Media monitoring: Track news coverage across the Arab world in real time
  • Sentiment analysis: Feed structured article text into NLP pipelines
  • Market intelligence: Monitor business, economic, and political developments
  • Academic research: Collect large-scale Arabic news corpora
  • Content aggregation: Build news feeds or dashboards for MENA coverage

Quick Start

Default: Get 100 Recent Articles

{
"limit": 100
}

Search for a Topic

{
"search_query": "الذكاء الاصطناعي",
"limit": 100
}

Articles from a Date Range

{
"start_date": "2026-02-01",
"end_date": "2026-02-15",
"limit": 500
}

Combined: Search Within a Date Range

{
"search_query": "oil prices",
"start_date": "2026-02-01",
"limit": 200
}

Select Specific Sources

{
"limit": 100,
"al_jazeera": true,
"al_arabiya": true,
"sky_news_arabia": true,
"asharq_al_awsat": false,
"cnn_arabic": false
}

Unlimited (All Available Articles)

{
"limit": 0
}

Input Parameters

ParameterTypeRequiredDefaultDescription
search_querystringNo-Filter articles by keyword. Only articles whose title, content, or URL contain this keyword will be returned.
start_datedateNo-Only include articles published on or after this date (YYYY-MM-DD).
end_datedateNo-Only include articles published on or before this date (YYYY-MM-DD).
limitintegerNo100Maximum number of articles to return. Set to 0 for unlimited. Max: 10,000.

Filtering Details

  • Search query matches against article title and full text (case-insensitive). During discovery, it also pre-filters on title metadata when available. Articles that can't be ruled out (missing metadata) are kept for full analysis.
  • Date filters apply to the article's date_publish field. Articles without a publish date are included (can't be ruled out).
  • Filters are applied in two stages: first as a pre-filter on discovery metadata (fast, avoids unnecessary downloads), then as a post-filter on extracted full content (thorough, checks title + text).
  • When using search_query, the actor may process more articles in the background than the final result count, since articles are filtered after extraction.

Source Selection

Each news source can be individually enabled or disabled via a toggle. All sources are enabled by default. Set a source to false to exclude it from the run.

See the Supported News Sources section below for the full list of available source keys.

Supported News Sources

The actor monitors 25+ news outlets across 8 countries, covering politics, business, sports, culture, and more.

Pan-Arab / International (عربي دولي)

SourceDomainSummaryLanguage
Asharq Al-Awsat (الشرق الأوسط)aawsat.comLondon-based pan-Arab daily owned by Saudi Research and Media Group.Arabic
Al-Quds Al-Arabi (القدس العربي)alquds.co.ukIndependent pan-Arab daily published in London since 1989.Arabic
The New Arab (العربي الجديد)alaraby.co.ukPan-Arab media platform covering news, opinion, and culture.Arabic
CNN Arabic (سي إن إن عربي)arabic.cnn.comArabic edition of the CNN international news network.Arabic

Egypt (مصر)

SourceDomainSummaryLanguage
Al-Masry Al-Youm (المصري اليوم)almasryalyoum.comLeading independent Egyptian daily covering politics, economy, and society.Arabic
Masrawy (مصراوي)masrawy.comPopular Egyptian news portal with broad lifestyle and entertainment coverage.Arabic
Shorouk News (الشروق)shorouknews.comEgyptian newspaper covering politics, sports, arts, and local affairs.Arabic

UAE (الإمارات)

SourceDomainSummaryLanguage
Sky News Arabia (سكاي نيوز عربية)skynewsarabia.com24-hour Arabic news channel headquartered in Abu Dhabi.Arabic
Al Arabiya (العربية)alarabiya.netSaudi-owned Arabic news channel based in Dubai Media City.Arabic
Al Khaleej (الخليج)alkhaleej.aeLong-running Sharjah-based daily, one of the UAE's leading Arabic papers.Arabic
24.ae (٢٤)24.aeUAE digital news outlet covering local and regional stories.Arabic
Emarat Al Youm (الإمارات اليوم)emaratalyoum.comUAE national daily focused on local Emirati news and affairs.Arabic

Saudi Arabia (السعودية)

SourceDomainSummaryLanguage
Al Riyadh (الرياض)alriyadh.comRiyadh-based daily and one of Saudi Arabia's most widely read newspapers.Arabic
Sabq (سبق)sabq.orgSaudi Arabia's most popular online-only news outlet.Arabic
Okaz (عكاظ)okaz.com.saJeddah-based daily covering national news, sports, and culture.Arabic

Qatar (قطر)

SourceDomainSummaryLanguage
Al Jazeera (الجزيرة)aljazeera.netDoha-based international news network with global Arabic-language reach.Arabic
Al Watan (الوطن)al-watan.comQatari daily newspaper covering local and regional news.Arabic
Al Raya (الراية)raya.comQatari daily offering broad coverage of politics, sports, and society.Arabic

Jordan (الأردن)

SourceDomainSummaryLanguage
Al Rai (الرأي)alrai.comJordan's largest-circulation daily and semi-official national newspaper.Arabic

Iraq (العراق)

SourceDomainSummaryLanguage
Azzaman (الزمان)azzaman.comIndependent Iraqi daily published in Baghdad, London, and several other cities.Arabic
Sot al-Iraq (صوت العراق)sotaliraq.comIraqi news website covering domestic politics and current affairs.Arabic
Al Mada (المدى)almadapaper.netIndependent Iraqi daily known for cultural and investigative reporting.Arabic

Kuwait (الكويت)

SourceDomainSummaryLanguage
Al Anba (الأنباء)alanba.com.kwMajor Kuwaiti daily covering local, Gulf, and international news.Arabic

Oman (عُمان)

SourceDomainSummaryLanguage
Al Watan (الوطن)alwatan.omOman's leading Arabic daily covering national and regional affairs.Arabic

Output Example

Each article is returned as a structured object with full text and metadata:

{
"url": "https://www.aljazeera.net/news/2026/2/15/example-article",
"article_title": "Example article headline",
"authors": ["Author Name"],
"date_publish": "2026-02-15T10:30:00",
"article_description": "Brief summary of the article content.",
"text": "Full article text extracted from the page...",
"language": "ar",
"image_url": "https://www.aljazeera.net/image.jpg",
"source_domain": "www.aljazeera.net",
"error": null
}

Output Fields

FieldTypeDescription
urlstringFull URL of the article
article_titlestringArticle headline
authorsarrayList of author names
date_publishstringPublication date (ISO 8601)
article_descriptionstringArticle summary / meta description
textstringFull article text
languagestringLanguage code (e.g. ar, en, fr)
image_urlstringURL of the main article image
source_domainstringDomain name of the news source
errorstringError message, or null if successful

Failed Articles

Articles that can't be extracted (paywall, timeout, etc.) are returned with an error:

{
"url": "https://example.com/article",
"error": "Failed to download article"
}

Pricing: Pay As You Go

This actor uses Apify's pay-per-event billing model.

EventCostDescription
Actor start$0.0002Charged once per run
Article$0.99 / 1,000 articlesCharged per successfully extracted article

Key billing details:

  • Only successful articles are charged: Failed extractions are free
  • No minimum commitment: Pay only for what you use
  • Budget controls: Set spending limits in your Apify account
  • Discounts available: Gold, Silver and Bronze members of Apify receive discounts.

API Integration

Call via Apify API

curl -X POST "https://api.apify.com/v2/acts/zen-studio~arab-news-collector/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{"limit": 100, "search_query": "oil prices", "start_date": "2026-02-01"}'

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("zen-studio/arab-news-collector").call(
run_input={
"limit": 100,
"search_query": "oil prices",
"start_date": "2026-02-01",
}
)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
if item.get("error"):
continue
print(f"[{item['source_domain']}] {item['article_title']}")
print(f" {item['url']}")
print(f" Published: {item['date_publish']}")
print()

JavaScript / Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('zen-studio/arab-news-collector').call({
limit: 100,
search_query: 'oil prices',
start_date: '2026-02-01',
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items
.filter(item => !item.error)
.forEach(item => {
console.log(`[${item.source_domain}] ${item.article_title}`);
console.log(` ${item.url}`);
});

Use Cases

  • News monitoring dashboards: Aggregate headlines from across the Arab world into a single feed for journalists, analysts, or PR teams
  • NLP & sentiment analysis: Extract full-text Arabic articles at scale for training language models or running sentiment classifiers
  • Competitive intelligence: Track press coverage of companies, brands, or industries across MENA media
  • Academic research: Build Arabic news corpora for linguistics, political science, or media studies
  • Risk & compliance: Monitor political and economic developments that may impact business operations in the region

Advanced Usage

Scheduled Monitoring

Set up recurring runs to build a continuous news archive:

  1. Go to the actor's Schedules tab
  2. Create a new schedule (e.g., every 6 hours)
  3. Configure your input with the desired limit
  4. Results accumulate in datasets for historical analysis

Integration with Google Sheets

Export results directly to Google Sheets:

  1. Run the actor
  2. Go to the dataset
  3. Click Export > Google Sheets

Technical Details

How It Works

  1. Discovery: The actor queries the news sources to find article URLs
  2. Pre-filtering: Articles are filtered early based on discovery metadata (date, title keyword) to skip irrelevant downloads
  3. Deduplication: Duplicate URLs across sources are removed, and articles are interleaved round-robin across sources for fair distribution
  4. Extraction: Each article is downloaded and parsed in batches to extract structured content (title, text, authors, date, etc.)
  5. Post-filtering: Extracted articles are filtered again on full content (date + keyword search in title and text)
  6. Output: Results are pushed to your Apify dataset

Performance

  • Batched processing: Large runs are automatically chunked for reliability
  • Fault tolerance: If one source or batch fails, others continue processing
  • Memory: 256-512 MB per run

Error Handling

Error TypeBehavior
Source unavailableSkipped, other sources continue
Article download failedReturned with error message
Rate limit (429)Automatic retry with exponential backoff
Server error (5xx)Automatic retry (3 attempts)

Data Export

Results can be exported in multiple formats:

  • JSON - Full structured data
  • CSV - Flattened for spreadsheets
  • Excel - Direct .xlsx download
  • XML - For legacy system integration
# JSON
curl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=json&token=YOUR_TOKEN"
# CSV
curl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=csv&token=YOUR_TOKEN"

FAQ

How many articles does each run return?

It depends on what the sources have published recently. A typical run with limit: 0 returns several hundred to a few thousand articles from across all sources.

How often is the source list updated?

Sources are curated and updated periodically. If you'd like to suggest a source, open an issue on the actor's repository.

Does the actor store historical articles?

No. Each run discovers currently available articles from source feeds. To build a historical archive, set up scheduled runs.

What languages are the articles in?

Most articles are in Arabic. Article language is indicated in the language field.

Can I filter by keyword or date?

Yes. Use search_query to filter articles by keyword (matches title and full text). Use start_date and end_date to restrict articles to a date range. These filters can be combined.

Can I filter by source or country?

Yes. Each source has a toggle in the input (e.g., al_jazeera, al_ahram). Set any source to false to exclude it. All sources are enabled by default. You can also filter the output dataset by source_domain after the run completes.

What happens if a news source goes offline?

The actor continues processing all other sources. Failed sources are logged but don't stop the run.

When search_query is set, the actor needs to download and extract articles to check their full content against your keyword. The limit controls how many matching articles you receive, but more articles may be processed behind the scenes.

Disclaimer

This actor extracts publicly available news articles from public web pages. It does not bypass paywalls, authentication, or access restrictions. Use the data in compliance with applicable laws and the terms of service of the respective news outlets.


Need help? Open an issue on the actor's repository or contact support through Apify Console.