Pricing

$19.95 / 1,000 results

Go to Apify Store

Deal Scraper PRNewswire

Try for free

Scrapes last 10 articles published on M&A section of PRNewswire and provides relevant deal info

Pricing

$19.95 / 1,000 results

Rating

0.0

(0)

Developer

Brad

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

M&A x402 Service

A FastAPI service for Coinbase AgentKit that scrapes PR Newswire articles and extracts M&A (Mergers & Acquisitions) entities using OpenAI's GPT models.

Features

🔍 Scrapes PR Newswire article links from a given URL
📄 Extracts full article text content
🤖 Uses OpenAI API to extract structured M&A entities:
- BUYER: Acquiring companies/entities
- SELLER: Companies being sold/divested
- FUND: Investment funds, PE, VC firms
- LAW_FIRM: Legal firms involved
- INTERMEDIARY: Investment banks, advisors
- PROFESSIONAL: Individual professionals
- MONEY: Deal values and financial figures
- DATE: Transaction dates
- DEAL_TYPE: Type of transaction

Tech Stack

Python 3.11
FastAPI: Modern web framework
Uvicorn: ASGI server
BeautifulSoup4: HTML parsing
Newspaper3k: Article extraction
OpenAI API: Entity extraction
Pydantic: Data validation

Setup

1. Clone and Install Dependencies

cd mna-x402-service
pip install -r requirements.txt

2. Environment Variables

Copy .env.example to .env and add your OpenAI API key:

$cp .env.example .env

Edit .env:

OPENAI_API_KEY=sk-your-actual-key-here

3. Run Locally

$uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

The API will be available at http://localhost:8000

API Endpoints

Health Check

GET /
GET /health

Scrape Articles

POST /x402/scrape
Content-Type: application/json
{
  "site_url": "https://www.prnewswire.com/news-releases/financial-services-latest-news/acquisitions-mergers-and-takeovers-list/",
  "max_articles": 10
}

Parameters:

site_url (required): URL of the PR Newswire page to scrape
max_articles (optional): Maximum number of newest articles to process. Defaults to 10, maximum 100. Returns the most recently posted articles first. Helps prevent timeouts and control processing time.

Response:

{
  "site_url": "https://...",
  "articles": [
    {
      "url": "https://...",
      "content": "Full article text...",
      "entities": {
        "buyer": ["Company A"],
        "seller": ["Company B"],
        "fund": ["PE Fund XYZ"],
        "law_firm": ["Law Firm ABC"],
        "intermediary": ["Investment Bank DEF"],
        "professional": ["John Doe"],
        "money": ["$100M", "$50 million"],
        "date": ["2024-01-15", "Q1 2024"],
        "deal_type": "Acquisition"
      }
    }
  ],
  "count": 10,
  "total_found": 25,
  "processed": 10,
  "limit": 10
}

Response Fields:

site_url: The URL that was scraped
articles: Array of article results with extracted entities
count: Number of articles in the response
total_found: Total number of articles found on the page
processed: Number of articles actually processed
limit: The limit that was applied (max_articles parameter)

## Docker

### Build
```bash
docker build -t mna-x402-service .

Run

$docker run -p 8000:8000 --env-file .env mna-x402-service

API Documentation

Once running, visit:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

Project Structure

mna-x402-service/
├── app/
│   ├── __init__.py
│   ├── main.py          # FastAPI application
│   ├── models.py        # Pydantic models
│   ├── scraper.py       # PR Newswire scraping
│   └── extractor.py     # OpenAI entity extraction
├── requirements.txt
├── Dockerfile
├── .env.example
└── README.md

Error Handling

The service includes comprehensive error handling:

Network errors during scraping
Article extraction failures
OpenAI API errors
JSON parsing errors

Failed articles are included in the response with an error field.

Production Considerations

Add rate limiting
Implement caching for repeated requests
Add authentication/API keys
Set up monitoring and logging
Configure proper CORS origins
Use environment-specific configurations

License

MIT

Cars.com Scraper | Flip Score & Deal Finder | $2.50/1K

benthepythondev/cars-com-scraper

Scrape Cars.com with AI Flip Score (0-100) for arbitrage opportunities. Extract Great Deal/Good Deal ratings, price vs market, VIN, CARFAX data, dealer info & photos. Perfect for car flippers, dealers & market researchers. Only scraper with deal scoring!

ben

5.0

Pepper Scraper

trev0n/pepper-scraper

Extract deal data from Pepper.pl, most popular deal-sharing platform. Get comprehensive information including prices, discounts, deal temperatures (vote scores), merchant details, user comments, and timestamps.

Paweł

Appsumo Deal Scraper

consummate_mandala/appsumo-deal-scraper

Appsumo Deal Scraper. Extract structured data with automatic pagination, proxy rotation, and JSON/CSV export. Pay only for results.

Donny Nguyen

Venture Capital & Startup News Intelligence

visita/venture-capital-startup-intelligence

🚀 Turn noisy news into structured Deal Flow. Scrapes top sources (TechCrunch, Sifted) and uses AI to extract Deal Size 💰, Round Stage, and Investors 🤝. Perfect for tracking funding rounds and M&A activity.

Visita AI & Automation

Groupon.com Scraper

lexis-solutions/groupon

Scrape deal listings from Groupon.com - including titles, discount rates, categories, prices, expiration dates, and merchant info. Ideal for deal aggregation, market research, and price monitoring. Fast, structured, and customizable extraction.

Lexis Solutions

5.0

Venture Capital Deal Scraper

consummate_mandala/venture-capital-deal-scraper

Venture Capital Deal Scraper. Extract structured data with automatic pagination, proxy rotation, and JSON/CSV export. Pay only for results.

Donny Nguyen

Life Time Deal Aggregator

happitap/life-time-deal-aggregator

This Apify actor scrapes lifetime deal listings from multiple sources including AppSumo, DealMirror, and StackSocial. It normalizes the data into a consistent format, supports incremental runs, and can send webhook notifications for new deals.

HappiTap

5.0

Amazon Deals to Auto Post on Telegram (Affiliate Ready)

ittechinnovators/amazon-deals-to-auto-post-on-telegram-affiliate-ready

Automatically fetch Amazon deal pages, extract full product titles, prices, ratings, and coupons, append your Amazon affiliate tag, and auto-post deals to Telegram channels. Built for deal marketers, affiliates, and Telegram admins who want hands-free deal posting with monetization.

IT-TechInnovators

5.0

Amazon Today's Deals Scraper

simpleapi/amazon-todays-deals-scraper

Collect Amazon Today’s Deals data automatically for analysis and reporting. This scraper fetches deal listings, prices, discounts, ratings, and categories. Useful for deal tracking, market insights, and large-scale ecommerce data extraction.

SimpleAPI

Capitaloneshopping.com Scraper

lexis-solutions/capital-one-shopping

Scrape deal data from capitaloneshopping.com - including price comparisons, history, coupons, and retailer links. Ideal for deal aggregation, price monitoring, and market research. Fast, structured, and customizable extraction.

Lexis Solutions

5.0