Arab News Aggregator - منصة رصد الأخبار
Pricing
from $0.90 / 1,000 articles
Arab News Aggregator - منصة رصد الأخبار
Aggregate and extract full-text news articles from 25+ major Arab news sources across the Middle East and North Africa. Get structured article data including title, full text, authors, publish date, and more. Select individual sources via input toggles.
Pricing
from $0.90 / 1,000 articles
Rating
0.0
(0)
Developer

Zen Studio
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Collect and extract full-text news articles from 25+ major Arab news sources across the Middle East and North Africa. Get structured article data including title, full text, authors, publish date, and more - updated with every run.
No login required. No API keys to manage. Just set your filters and start collecting articles.
Why Arab News?
The Arab world generates a massive volume of daily news content across hundreds of outlets in Arabic, English, and French. This actor provides a single, unified feed covering the region's most important news sources, enabling:
- Media monitoring: Track news coverage across the Arab world in real time
- Sentiment analysis: Feed structured article text into NLP pipelines
- Market intelligence: Monitor business, economic, and political developments
- Academic research: Collect large-scale Arabic news corpora
- Content aggregation: Build news feeds or dashboards for MENA coverage
Quick Start
Default: Get 100 Recent Articles
{"limit": 100}
Search for a Topic
{"search_query": "الذكاء الاصطناعي","limit": 100}
Articles from a Date Range
{"start_date": "2026-02-01","end_date": "2026-02-15","limit": 500}
Combined: Search Within a Date Range
{"search_query": "oil prices","start_date": "2026-02-01","limit": 200}
Select Specific Sources
{"limit": 100,"al_jazeera": true,"al_arabiya": true,"sky_news_arabia": true,"asharq_al_awsat": false,"cnn_arabic": false}
Unlimited (All Available Articles)
{"limit": 0}
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
search_query | string | No | - | Filter articles by keyword. Only articles whose title, content, or URL contain this keyword will be returned. |
start_date | date | No | - | Only include articles published on or after this date (YYYY-MM-DD). |
end_date | date | No | - | Only include articles published on or before this date (YYYY-MM-DD). |
limit | integer | No | 100 | Maximum number of articles to return. Set to 0 for unlimited. Max: 10,000. |
Filtering Details
- Search query matches against article title and full text (case-insensitive). During discovery, it also pre-filters on title metadata when available. Articles that can't be ruled out (missing metadata) are kept for full analysis.
- Date filters apply to the article's
date_publishfield. Articles without a publish date are included (can't be ruled out). - Filters are applied in two stages: first as a pre-filter on discovery metadata (fast, avoids unnecessary downloads), then as a post-filter on extracted full content (thorough, checks title + text).
- When using
search_query, the actor may process more articles in the background than the final result count, since articles are filtered after extraction.
Source Selection
Each news source can be individually enabled or disabled via a toggle. All sources are enabled by default. Set a source to false to exclude it from the run.
See the Supported News Sources section below for the full list of available source keys.
Supported News Sources
The actor monitors 25+ news outlets across 8 countries, covering politics, business, sports, culture, and more.
Pan-Arab / International (عربي دولي)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Asharq Al-Awsat (الشرق الأوسط) | aawsat.com | London-based pan-Arab daily owned by Saudi Research and Media Group. | Arabic |
| Al-Quds Al-Arabi (القدس العربي) | alquds.co.uk | Independent pan-Arab daily published in London since 1989. | Arabic |
| The New Arab (العربي الجديد) | alaraby.co.uk | Pan-Arab media platform covering news, opinion, and culture. | Arabic |
| CNN Arabic (سي إن إن عربي) | arabic.cnn.com | Arabic edition of the CNN international news network. | Arabic |
Egypt (مصر)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Al-Masry Al-Youm (المصري اليوم) | almasryalyoum.com | Leading independent Egyptian daily covering politics, economy, and society. | Arabic |
| Masrawy (مصراوي) | masrawy.com | Popular Egyptian news portal with broad lifestyle and entertainment coverage. | Arabic |
| Shorouk News (الشروق) | shorouknews.com | Egyptian newspaper covering politics, sports, arts, and local affairs. | Arabic |
UAE (الإمارات)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Sky News Arabia (سكاي نيوز عربية) | skynewsarabia.com | 24-hour Arabic news channel headquartered in Abu Dhabi. | Arabic |
| Al Arabiya (العربية) | alarabiya.net | Saudi-owned Arabic news channel based in Dubai Media City. | Arabic |
| Al Khaleej (الخليج) | alkhaleej.ae | Long-running Sharjah-based daily, one of the UAE's leading Arabic papers. | Arabic |
| 24.ae (٢٤) | 24.ae | UAE digital news outlet covering local and regional stories. | Arabic |
| Emarat Al Youm (الإمارات اليوم) | emaratalyoum.com | UAE national daily focused on local Emirati news and affairs. | Arabic |
Saudi Arabia (السعودية)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Al Riyadh (الرياض) | alriyadh.com | Riyadh-based daily and one of Saudi Arabia's most widely read newspapers. | Arabic |
| Sabq (سبق) | sabq.org | Saudi Arabia's most popular online-only news outlet. | Arabic |
| Okaz (عكاظ) | okaz.com.sa | Jeddah-based daily covering national news, sports, and culture. | Arabic |
Qatar (قطر)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Al Jazeera (الجزيرة) | aljazeera.net | Doha-based international news network with global Arabic-language reach. | Arabic |
| Al Watan (الوطن) | al-watan.com | Qatari daily newspaper covering local and regional news. | Arabic |
| Al Raya (الراية) | raya.com | Qatari daily offering broad coverage of politics, sports, and society. | Arabic |
Jordan (الأردن)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Al Rai (الرأي) | alrai.com | Jordan's largest-circulation daily and semi-official national newspaper. | Arabic |
Iraq (العراق)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Azzaman (الزمان) | azzaman.com | Independent Iraqi daily published in Baghdad, London, and several other cities. | Arabic |
| Sot al-Iraq (صوت العراق) | sotaliraq.com | Iraqi news website covering domestic politics and current affairs. | Arabic |
| Al Mada (المدى) | almadapaper.net | Independent Iraqi daily known for cultural and investigative reporting. | Arabic |
Kuwait (الكويت)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Al Anba (الأنباء) | alanba.com.kw | Major Kuwaiti daily covering local, Gulf, and international news. | Arabic |
Oman (عُمان)
| Source | Domain | Summary | Language |
|---|---|---|---|
| Al Watan (الوطن) | alwatan.om | Oman's leading Arabic daily covering national and regional affairs. | Arabic |
Output Example
Each article is returned as a structured object with full text and metadata:
{"url": "https://www.aljazeera.net/news/2026/2/15/example-article","article_title": "Example article headline","authors": ["Author Name"],"date_publish": "2026-02-15T10:30:00","article_description": "Brief summary of the article content.","text": "Full article text extracted from the page...","language": "ar","image_url": "https://www.aljazeera.net/image.jpg","source_domain": "www.aljazeera.net","error": null}
Output Fields
| Field | Type | Description |
|---|---|---|
url | string | Full URL of the article |
article_title | string | Article headline |
authors | array | List of author names |
date_publish | string | Publication date (ISO 8601) |
article_description | string | Article summary / meta description |
text | string | Full article text |
language | string | Language code (e.g. ar, en, fr) |
image_url | string | URL of the main article image |
source_domain | string | Domain name of the news source |
error | string | Error message, or null if successful |
Failed Articles
Articles that can't be extracted (paywall, timeout, etc.) are returned with an error:
{"url": "https://example.com/article","error": "Failed to download article"}
Pricing: Pay As You Go
This actor uses Apify's pay-per-event billing model.
| Event | Cost | Description |
|---|---|---|
| Actor start | $0.0002 | Charged once per run |
| Article | $0.99 / 1,000 articles | Charged per successfully extracted article |
Key billing details:
- Only successful articles are charged: Failed extractions are free
- No minimum commitment: Pay only for what you use
- Budget controls: Set spending limits in your Apify account
- Discounts available: Gold, Silver and Bronze members of Apify receive discounts.
API Integration
Call via Apify API
curl -X POST "https://api.apify.com/v2/acts/zen-studio~arab-news-collector/runs?token=YOUR_API_TOKEN" \-H "Content-Type: application/json" \-d '{"limit": 100, "search_query": "oil prices", "start_date": "2026-02-01"}'
Python
from apify_client import ApifyClientclient = ApifyClient("YOUR_API_TOKEN")run = client.actor("zen-studio/arab-news-collector").call(run_input={"limit": 100,"search_query": "oil prices","start_date": "2026-02-01",})for item in client.dataset(run["defaultDatasetId"]).iterate_items():if item.get("error"):continueprint(f"[{item['source_domain']}] {item['article_title']}")print(f" {item['url']}")print(f" Published: {item['date_publish']}")print()
JavaScript / Node.js
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });const run = await client.actor('zen-studio/arab-news-collector').call({limit: 100,search_query: 'oil prices',start_date: '2026-02-01',});const { items } = await client.dataset(run.defaultDatasetId).listItems();items.filter(item => !item.error).forEach(item => {console.log(`[${item.source_domain}] ${item.article_title}`);console.log(` ${item.url}`);});
Use Cases
- News monitoring dashboards: Aggregate headlines from across the Arab world into a single feed for journalists, analysts, or PR teams
- NLP & sentiment analysis: Extract full-text Arabic articles at scale for training language models or running sentiment classifiers
- Competitive intelligence: Track press coverage of companies, brands, or industries across MENA media
- Academic research: Build Arabic news corpora for linguistics, political science, or media studies
- Risk & compliance: Monitor political and economic developments that may impact business operations in the region
Advanced Usage
Scheduled Monitoring
Set up recurring runs to build a continuous news archive:
- Go to the actor's Schedules tab
- Create a new schedule (e.g., every 6 hours)
- Configure your input with the desired limit
- Results accumulate in datasets for historical analysis
Integration with Google Sheets
Export results directly to Google Sheets:
- Run the actor
- Go to the dataset
- Click Export > Google Sheets
Technical Details
How It Works
- Discovery: The actor queries the news sources to find article URLs
- Pre-filtering: Articles are filtered early based on discovery metadata (date, title keyword) to skip irrelevant downloads
- Deduplication: Duplicate URLs across sources are removed, and articles are interleaved round-robin across sources for fair distribution
- Extraction: Each article is downloaded and parsed in batches to extract structured content (title, text, authors, date, etc.)
- Post-filtering: Extracted articles are filtered again on full content (date + keyword search in title and text)
- Output: Results are pushed to your Apify dataset
Performance
- Batched processing: Large runs are automatically chunked for reliability
- Fault tolerance: If one source or batch fails, others continue processing
- Memory: 256-512 MB per run
Error Handling
| Error Type | Behavior |
|---|---|
| Source unavailable | Skipped, other sources continue |
| Article download failed | Returned with error message |
| Rate limit (429) | Automatic retry with exponential backoff |
| Server error (5xx) | Automatic retry (3 attempts) |
Data Export
Results can be exported in multiple formats:
- JSON - Full structured data
- CSV - Flattened for spreadsheets
- Excel - Direct .xlsx download
- XML - For legacy system integration
# JSONcurl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=json&token=YOUR_TOKEN"# CSVcurl "https://api.apify.com/v2/datasets/DATASET_ID/items?format=csv&token=YOUR_TOKEN"
FAQ
How many articles does each run return?
It depends on what the sources have published recently. A typical run with limit: 0 returns several hundred to a few thousand articles from across all sources.
How often is the source list updated?
Sources are curated and updated periodically. If you'd like to suggest a source, open an issue on the actor's repository.
Does the actor store historical articles?
No. Each run discovers currently available articles from source feeds. To build a historical archive, set up scheduled runs.
What languages are the articles in?
Most articles are in Arabic. Article language is indicated in the language field.
Can I filter by keyword or date?
Yes. Use search_query to filter articles by keyword (matches title and full text). Use start_date and end_date to restrict articles to a date range. These filters can be combined.
Can I filter by source or country?
Yes. Each source has a toggle in the input (e.g., al_jazeera, al_ahram). Set any source to false to exclude it. All sources are enabled by default. You can also filter the output dataset by source_domain after the run completes.
What happens if a news source goes offline?
The actor continues processing all other sources. Failed sources are logged but don't stop the run.
Why does the actor process more articles than my limit when using search?
When search_query is set, the actor needs to download and extract articles to check their full content against your keyword. The limit controls how many matching articles you receive, but more articles may be processed behind the scenes.
Disclaimer
This actor extracts publicly available news articles from public web pages. It does not bypass paywalls, authentication, or access restrictions. Use the data in compliance with applicable laws and the terms of service of the respective news outlets.
Need help? Open an issue on the actor's repository or contact support through Apify Console.