Artificial Intelligence News Intelligence Pipeline (AI-Powered) avatar
Artificial Intelligence News Intelligence Pipeline (AI-Powered)
Under maintenance

Pricing

Pay per event

Go to Apify Store
Artificial Intelligence News Intelligence Pipeline (AI-Powered)

Artificial Intelligence News Intelligence Pipeline (AI-Powered)

Under maintenance

Developed by

bySeitz AI & Automation

bySeitz AI & Automation

Maintained by Community

This Apify Actor processes AI and Tech news from major RSS feeds and transforms headlines into structured, actionable intelligence using the Google Search Results Scraper's AI Overview and combined Language Model (LLM) analysis.

0.0 (0)

Pricing

Pay per event

1

2

2

Last modified

11 hours ago

🤖 AI News Intelligence Pipeline

This actor provides a powerful, automated pipeline to transform raw cybersecurity news from top RSS feeds into structured, actionable threat intelligence. It uses the Google Programmable Search API to gather real-time context and an LLM (OpenAI) to perform advanced analysis, ensuring your data is both current and insightful.


Features

  • Comprehensive Source Aggregation: Gathers news from a curated list of top-tier cybersecurity RSS feeds.
  • Real-Time Grounding: Uses the Google Programmable Search API to enrich each article with up-to-the-minute context before analysis, preventing outdated results.
  • Advanced AI Analysis: Leverages a Large Language Model to analyze each article for sentiment, categorize the threat type (e.g., Malware, Data Breach), and extract key entities (e.g., threat groups, CVEs, companies).
  • Duplicate Prevention: Intelligently tracks processed articles across runs to ensure you only process and pay for new information.
  • Cost-Saving Test Mode: Includes a test mode to run the full workflow with dummy data, allowing for development and testing without incurring API costs.

Setup and Configuration

Before running the actor, you need to provide API keys for the services it relies on.

  1. Google Programmable Search API:
    • You will need a Google API Key and a Search Engine ID.
    • Follow the instructions here to get your credentials.
  2. OpenAI API Key:
    • You will need an API key from your OpenAI account.

Add Keys to Apify Secrets

For security, add these keys as secret environment variables in your Apify Actor settings:

  • GOOGLE_API_KEY: Your Google API Key.
  • GOOGLE_CSE_ID: Your Programmable Search Engine ID.
  • OPENAI_API_KEY: Your OpenAI API Key.

Cost of Usage 💸

This actor incurs costs from three sources:

  1. Apify Platform Usage: Standard platform costs for running the actor, which depends on the duration of the run.
  2. Google Programmable Search API: The actor makes one search query for every article it processes. Google provides a free tier of 100 queries/day, after which it costs approximately $5 per 1,000 queries.
  3. OpenAI API: This is the primary cost. The actor makes two LLM calls for every article: one for analysis and one for summarization. The cost depends on the model you use and the number of tokens processed.

Input

FieldTypeDefaultDescription
sourceStringallThe cybersecurity news source category to use.
customFeedUrlStringnullA custom RSS feed URL to use if source is set to custom.
maxArticlesInteger20The maximum number of new articles to fetch and process in a single run.
useSummarizationBooleantrueIf enabled, the actor will generate an AI summary for each article, incurring an additional LLM cost.
runTestModeBooleanfalseBypasses all external API calls for zero-cost testing. Do not enable in production.

Output

The actor saves its results in the dataset. Each item is a structured JSON object with the following fields:

FieldTypeDescription
sourceStringThe name of the news source (e.g., 'The Hacker News').
titleStringThe original title of the news article.
urlStringThe URL of the original article.
publishedStringThe publication date string from the RSS feed.
summaryStringThe AI-generated summary of the article.
sentimentStringThe AI-analyzed sentiment (e.g., High Risk, Medium Risk).
categoryStringThe AI-assigned category (e.g., 'Malware/Ransomware', 'Data Breach/Hack').
key_entitiesArray of StringsA list of key entities like companies, threat actors, or CVEs mentioned.