Industry Updates Scrapper avatar
Industry Updates Scrapper

Pricing

$20.00/month + usage

Go to Apify Store
Industry Updates Scrapper

Industry Updates Scrapper

Scrape the latest Indian industry news from top sources like Economic Times, Mint & The Hindu Business Line. Get full articles with images, published dates & source URLs in clean CSV format. Perfect for market research, content curation & business intelligence. Fast, reliable & proxy-enabled.

Pricing

$20.00/month + usage

Rating

5.0

(3)

Developer

ai-scraper-labs

ai-scraper-labs

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

8 days ago

Last modified

Share

Indian Industry Updates Scraper — Apify Actor

Scrape the latest Indian Industry news from trusted sources including Economic Times, The Hindu Business Line, Mint, Business Standard, and Financial Express. Extract full article text and get structured data ready for analysis.

Features

  • Fetches articles from 5 major Indian business news RSS feeds
  • Direct RSS feeds (no Google blocking issues)
  • Extracts full article content using newspaper3k and readability
  • Returns structured data with title, source, URL, full text, authors, keywords, and more
  • Configurable number of articles (1-50)
  • Robust content extraction with fallback mechanisms
  • Fast and reliable scraping

News Sources

  • Economic Times - Industry section
  • The Hindu Business Line - Economy coverage
  • Mint - Industry news
  • Business Standard - Economy & Policy
  • Financial Express - Industry updates

Inputs

FieldTypeNotes
num_articlesintegerNumber of articles to scrape (default: 10, max: 50)

Note: Workers (10) and Apify proxy (enabled) are automatically configured for optimal performance.

Output

JSON dataset with article objects containing:

  • title: Article headline
  • source: News source name
  • source_url: Direct link to article
  • published: Publication date/time
  • full_text: Complete article content (minimum 300 characters)
  • authors: List of article authors
  • keywords: Extracted keywords
  • summary: Article summary
  • thumbnail: Featured image URL
  • extraction_source: Method used to extract content (newspaper3k or readability)
  • text_length: Character count of article text

Example Output

```json { "title": "Fee income boosts bank profits as treasury, margins feel the pinch", "source": "Economic Times - Industry", "source_url": "https://economictimes.indiatimes.com/industry/banking/finance/banking/...", "published": "Fri, 07 Nov 2025 00:29:46 +0530", "full_text": "Banks are increasingly relying on fee-based income...", "authors": [], "keywords": ["banking", "finance", "india"], "summary": "Banks pivot to fee income as margins compress...", "thumbnail": "https://...", "extraction_source": "readability", "text_length": 2259 } ```

Usage

Deploy this actor on Apify and configure the input parameters. The actor will scrape Indian industry news from multiple RSS feeds and return structured article data in the dataset.

Local Testing

```bash python src/test.py ```

Minimum Text Length

Articles must have at least 300 characters of text content to be included in the results.