Industry Updates Scrapper
Pricing
$20.00/month + usage
Industry Updates Scrapper
Scrape the latest Indian industry news from top sources like Economic Times, Mint & The Hindu Business Line. Get full articles with images, published dates & source URLs in clean CSV format. Perfect for market research, content curation & business intelligence. Fast, reliable & proxy-enabled.
Pricing
$20.00/month + usage
Rating
5.0
(3)
Developer
ai-scraper-labs
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
8 days ago
Last modified
Categories
Share
Indian Industry Updates Scraper — Apify Actor
Scrape the latest Indian Industry news from trusted sources including Economic Times, The Hindu Business Line, Mint, Business Standard, and Financial Express. Extract full article text and get structured data ready for analysis.
Features
- Fetches articles from 5 major Indian business news RSS feeds
- Direct RSS feeds (no Google blocking issues)
- Extracts full article content using newspaper3k and readability
- Returns structured data with title, source, URL, full text, authors, keywords, and more
- Configurable number of articles (1-50)
- Robust content extraction with fallback mechanisms
- Fast and reliable scraping
News Sources
- Economic Times - Industry section
- The Hindu Business Line - Economy coverage
- Mint - Industry news
- Business Standard - Economy & Policy
- Financial Express - Industry updates
Inputs
| Field | Type | Notes |
|---|---|---|
num_articles | integer | Number of articles to scrape (default: 10, max: 50) |
Note: Workers (10) and Apify proxy (enabled) are automatically configured for optimal performance.
Output
JSON dataset with article objects containing:
- title: Article headline
- source: News source name
- source_url: Direct link to article
- published: Publication date/time
- full_text: Complete article content (minimum 300 characters)
- authors: List of article authors
- keywords: Extracted keywords
- summary: Article summary
- thumbnail: Featured image URL
- extraction_source: Method used to extract content (newspaper3k or readability)
- text_length: Character count of article text
Example Output
```json { "title": "Fee income boosts bank profits as treasury, margins feel the pinch", "source": "Economic Times - Industry", "source_url": "https://economictimes.indiatimes.com/industry/banking/finance/banking/...", "published": "Fri, 07 Nov 2025 00:29:46 +0530", "full_text": "Banks are increasingly relying on fee-based income...", "authors": [], "keywords": ["banking", "finance", "india"], "summary": "Banks pivot to fee income as margins compress...", "thumbnail": "https://...", "extraction_source": "readability", "text_length": 2259 } ```
Usage
Deploy this actor on Apify and configure the input parameters. The actor will scrape Indian industry news from multiple RSS feeds and return structured article data in the dataset.
Local Testing
```bash python src/test.py ```
Minimum Text Length
Articles must have at least 300 characters of text content to be included in the results.