# AirLive Aviation News Scraper (`parseforge/airlive-aviation-news-rss-scraper`) Actor

Tap the AirLive feed for breaking aviation industry news with article title, link, publication date, author, and summary. Filter by keyword to track airline incidents, route launches, and safety stories for newsroom monitoring, briefings, and aviation research.

- **URL**: https://apify.com/parseforge/airlive-aviation-news-rss-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Automation, Integrations, News
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $7.50 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## ✈️ AirLive Aviation News Scraper

> 🚀 **Export aviation breaking news articles from AirLive in seconds. From the public AirLive source to spreadsheet, JSON, JSONL, XML, RSS, or HTML.**

> 🕒 **Last updated:** 2026-06-05 · **📊 9 fields** per record · Public AirLive data · Real-time

The AirLive Aviation News Scraper turns the [AirLive](https://www.airlive.net/) public endpoint into a clean, structured dataset. It pulls aviation breaking news articles from AirLive, normalizes the fields, and pushes one row per record.

Coverage is whatever AirLive publishes on the public endpoint at the moment you run the actor. No login, no API key, no scraping tricks. Just clean data, ready for analysis.

| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| 📊 Analysts | Pull a fresh snapshot for dashboards |
| 🛠️ Developers | Mirror the feed into your own database |
| 📰 Journalists | Verify facts quickly without writing client code |
| 🤖 ML engineers | Build training sets from the public feed |
| 🏢 Operations teams | Monitor changes over time |
| 🎓 Researchers | Use the data for academic studies |

### 📋 What the AirLive Aviation News Scraper does

- Calls the public AirLive endpoint and parses the response.
- Flattens the data into one row per record with clean field names.
- Surfaces upstream errors as a single record with the `error` field set, so your pipeline does not crash.
- Lets you cap the output with `maxItems` for cheap previews.
- Streams results to the Apify dataset for instant export.

> 💡 **Why it matters:** Public feeds change shape and field names constantly. This actor gives you a stable, documented schema your downstream tooling can rely on.

### 🎬 Full Demo

_🚧 Coming soon._

### ⚙️ Input

<table>
<tr><th>Field</th><th>Type</th><th>Required</th><th>Description</th></tr>
<tr><td><code>maxItems</code></td><td>integer</td><td>No</td><td>Free users: 10. Paid users: up to 1,000,000. Prefill: 10.</td></tr>
<tr><td><code>feedUrl</code></td><td>string</td><td>No</td><td>RSS feed URL to scrape.</td></tr>
<tr><td><code>query</code></td><td>string</td><td>No</td><td>Optional title substring filter.</td></tr>
</table>

**Example input:**
```json
{
  "maxItems": 10,
  "feedUrl": "https://www.airlive.net/feed/"
}
````

> ⚠️ **Good to Know:** This actor depends on the public AirLive endpoint. If the upstream is rate-limiting or temporarily unavailable, the actor pushes a single record with the `error` field populated. Re-run later.

### 📊 Output

Each record is a flat object. The `error` field is always last.

| Field | Type | Description |
|---|---|---|
| 📰 Title `title` | string | Article title. |
| 🔗 Link `link` | string | Article URL. |
| 📅 Published `pubDate` | string | Publication date. |
| ✍️ Author `author` | string | Author name. |
| 🏷️ Categories `categories` | array | Categories. |
| 📝 Summary `summary` | string | Article summary. |
| 📄 Content `content` | string | Full content. |
| 🕒 `scrapedAt` | string | When this row was fetched. |
| ❌ `error` | string | Set if the upstream response was an error. |

**Sample record:**

```json
{
  "title": "",
  "link": "",
  "pubDate": "",
  "author": "",
  "categories": [],
  "summary": "",
  "content": "",
  "scrapedAt": "2026-06-05T12:00:00.000Z",
  "error": null
}
```

### ✨ Why choose this Actor

| 🆓 | Works on the free Apify tier with no AirLive account required. |
| 🧹 | Clean, snake-friendly field names ready for BI tools. |
| 🛟 | Surfaces upstream rate-limit and error responses as a clean record instead of crashing. |
| 🔌 | Zero setup. Click run, get data. |
| 💾 | Push to dataset for instant spreadsheet, JSON, JSONL, XML, RSS, or HTML export. |
| ⚡ | Lightweight. No browser, no heavy dependencies. |

### 📈 How it compares to alternatives

| Approach | Setup time | Clean fields? | Error handling? |
|---|---|---|---|
| Roll your own fetch | 30 min + | ❌ | ❌ |
| Generic RSS reader | 5 min | partial | ❌ |
| **This Actor** | 5 sec, no install | ✅ | ✅ |

### 🚀 How to use

1. Click **Try for free**.
2. Adjust any filters in the input form (defaults are fine for a first run).
3. Click **Start**. Your dataset is ready in seconds. Download as spreadsheet, JSON, JSONL, XML, RSS, or HTML, or pipe to your warehouse.

### 💼 Business use cases

**📊 Operational dashboards.** Pull a fresh snapshot every few minutes and feed your status board.

**🤖 ML feature engineering.** Build a clean training set from the public feed for downstream modelling.

**📰 Newsroom fact-checking.** Reporters can grab a verified snapshot in seconds, no specialist tool needed.

**🏢 Compliance and audit.** Keep a tamper-evident log of what the upstream source said, when.

### 🔌 Automating AirLive Aviation News Scraper

- **Make / Zapier**: trigger this actor on a schedule, push results to Airtable, Google Sheets, or Slack.
- **Cron schedule**: native Apify scheduler.
- **Webhooks**: get a POST to your endpoint the moment a run finishes.
- **Pipe to BigQuery / Snowflake / Postgres**: native Apify integrations move datasets straight into your warehouse.

### 🌟 Beyond business use cases

**🎓 Education.** Teach a class using a real-time public feed instead of a stale textbook dataset.

**🧪 Personal research.** Track changes over time for whatever you find interesting.

**🤝 Non-profit and open data.** Build public dashboards without writing client code.

**🧰 Tinkering and prototyping.** Spin up a clean feed in 5 seconds to test a new tool or pipeline idea.

### 🤖 Ask an AI assistant about this scraper

Paste this README into ChatGPT, Claude, or any AI assistant and ask it to map your workflow to the actor's inputs. The schema, examples, and field list above contain everything an LLM needs to design a working pipeline.

### ❓ Frequently Asked Questions

**❓ Do I need an account with AirLive?** No. This actor uses the public endpoint only.

**❓ Is this scraping or API?** It is a thin wrapper over the public AirLive endpoint, normalised for downstream tools.

**❓ Will the schema change?** The documented fields are stable. New fields from upstream are passed through as-is.

**❓ How fresh is the data?** As fresh as AirLive publishes it. The actor reads live every run.

**❓ What happens during a rate-limit?** The actor pushes a single record with the `error` field populated instead of crashing.

**❓ Can I schedule runs?** Yes, use the Apify scheduler or hook into Make, Zapier, or cron.

**❓ How do I cap the cost of a test run?** Set `maxItems` to a small number, such as 5 or 10.

**❓ What format can I download?** Spreadsheet, JSON, JSONL, XML, RSS, or HTML straight from the Apify dataset UI.

**❓ Is the data accurate?** It mirrors what AirLive publishes. Any inaccuracy upstream will be mirrored downstream.

**❓ Can I run it from my own code?** Yes, every Apify actor exposes a REST API and SDK clients in popular languages.

### 🔌 Integrate with any app

Apify ships native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST API or webhook endpoint. Trigger runs from a calendar event, a form submission, a cron job, or pipe results straight into BigQuery, Snowflake, or a Postgres warehouse.

### 🔗 Recommended Actors

| Actor | What it does |
|---|---|
| [ParseForge OurAirports Scraper](https://apify.com/parseforge/ourairports-scraper) | Global airport database. |
| [ParseForge Alpha Vantage Public Scraper](https://apify.com/parseforge/alpha-vantage-public-scraper) | Stocks, FX, crypto, indicators. |
| [ParseForge NBA Stats Scraper](https://apify.com/parseforge/nba-stats-scraper) | NBA player and team stats. |
| [ParseForge CurseForge Mods Scraper](https://apify.com/parseforge/curseforge-mods-scraper) | Public mod metadata. |

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.

***

**Disclaimer:** This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by any of the third-party services referenced. Users are responsible for complying with the target site's terms of service and applicable law. [Create a free account w/ $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp).

# Actor input Schema

## `maxItems` (type: `integer`):

Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000

## `feedUrl` (type: `string`):

RSS feed URL to scrape.

## `query` (type: `string`):

Optional title substring filter.

## Actor input object example

```json
{
  "maxItems": 10,
  "feedUrl": "https://www.airlive.net/feed/"
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "maxItems": 10,
    "feedUrl": "https://www.airlive.net/feed/"
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/airlive-aviation-news-rss-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "maxItems": 10,
    "feedUrl": "https://www.airlive.net/feed/",
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/airlive-aviation-news-rss-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "maxItems": 10,
  "feedUrl": "https://www.airlive.net/feed/"
}' |
apify call parseforge/airlive-aviation-news-rss-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/airlive-aviation-news-rss-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "AirLive Aviation News Scraper",
        "description": "Tap the AirLive feed for breaking aviation industry news with article title, link, publication date, author, and summary. Filter by keyword to track airline incidents, route launches, and safety stories for newsroom monitoring, briefings, and aviation research.",
        "version": "0.1",
        "x-build-id": "I5Y8GzjLKGswuU1Ok"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~airlive-aviation-news-rss-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-airlive-aviation-news-rss-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~airlive-aviation-news-rss-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-airlive-aviation-news-rss-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~airlive-aviation-news-rss-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-airlive-aviation-news-rss-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000"
                    },
                    "feedUrl": {
                        "title": "Feed URL",
                        "type": "string",
                        "description": "RSS feed URL to scrape."
                    },
                    "query": {
                        "title": "Title filter",
                        "type": "string",
                        "description": "Optional title substring filter."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
