Epstein Files Scraper, Downloader & Search API avatar
Epstein Files Scraper, Downloader & Search API

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Epstein Files Scraper, Downloader & Search API

Epstein Files Scraper, Downloader & Search API

Fast search, extract, and structure Epstein files with keyword-based discovery, automatic PDF text parsing, and AI-ready output.

Pricing

from $1.00 / 1,000 results

Rating

5.0

(1)

Developer

Lofomachines

Lofomachines

Maintained by Community

Actor stats

0

Bookmarked

6

Total users

2

Monthly active users

3 days ago

Last modified

Share

Turn millions of public Epstein-related court documents into structured, searchable, AI-ready data.

This Apify Actor is built for high-intent use cases like:

  • Epstein files scraper
  • Epstein files download tool
  • Epstein documents searchable database
  • Epstein files API workflows
  • Bulk download and analysis pipelines

Why This Actor

Search demand for Epstein files surged dramatically and remains high during every new release cycle. Most existing tools are fragmented, technical, or hard to automate.

This Actor gives you one practical workflow to:

  • discover relevant files by keyword
  • extract text from PDFs automatically
  • output clean structured records for investigation, reporting, and analysis

What You Get

  • Multi-keyword search in one run
  • Per-keyword result limits (0 means unlimited per keyword)
  • Direct + proxy fallback strategy for better reliability
  • Automatic PDF parsing for text extraction
  • Structured dataset output ready for BI, AI, and automation tools

Built For

  • Journalists and investigative teams
  • OSINT and digital forensics analysts
  • Legal research professionals
  • Policy and transparency researchers
  • AI builders who need clean document pipelines

High-Value Use Cases

n8n / automation workflows

Use this Actor as the first step in a no-code pipeline:

  1. Trigger Actor run (scheduled or event-based)
  2. Read dataset items
  3. Route outputs to Airtable, Notion, Google Sheets, Slack, or webhooks
  4. Build recurring monitoring for new keyword matches

AI document intelligence

Feed extractedText into LLM workflows to:

  • summarize large document batches
  • detect entities (people, organizations, locations)
  • cluster themes across releases
  • generate evidence timelines

Forensic & OSINT triage

Prioritize relevant files quickly by:

  • keyword targeting
  • text extraction previews
  • structured metadata filtering
  • downstream enrichment and cross-referencing

Input Example

{
"keywords": [
"dentist",
"table"
],
"maxItems": 20,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": []
}
}

maxItems behavior:

  • 20 = up to 20 results for each keyword
  • 0 = unlimited results for each keyword

Output Example

{
"keyword": "epstein flight logs",
"page": 1,
"documentId": "doc_123",
"chunkIndex": 0,
"originFileName": "EFTA01638670.pdf",
"originFileUri": "https://www.justice.gov/epstein/files/DataSet%2010/EFTA01638670.pdf",
"sourceContentType": "application/pdf",
"extractedText": "This is a parsed text preview from the original PDF...",
"highlight": [
"...keyword match snippet..."
],
"processedAt": "2026-01-01T10:00:00Z",
"indexedAt": "2026-01-01T10:05:00Z"
}

Keywords

  • epstein files scraper
  • epstein files downloader tool
  • epstein files API
  • download epstein files pdf
  • bulk download epstein documents
  • epstein files AI analysis
  • epstein documents searchable database

Quick Start

  1. Open the Actor on Apify
  2. Enter one or more keywords
  3. Set max results per keyword
  4. Run and use the dataset output in your workflow

Data Source Note

This Actor is designed to process publicly accessible document sources and produce structured output for analysis workflows.