Extract text from PDF avatar

Extract text from PDF

Pricing

$2.50/month + usage

Go to Apify Store
Extract text from PDF

Extract text from PDF

Efficiently extract text content from PDF files, ideal for data processing, content analysis, and automation workflows. Supports various PDF structures and outputs clean, readable text.

Pricing

$2.50/month + usage

Rating

0.0

(0)

Developer

Akash Kumar Naik

Akash Kumar Naik

Maintained by Community

Actor stats

1

Bookmarked

48

Total users

4

Monthly active users

17 days ago

Last modified

Share

PDF Text Extractor

Extract text from PDF files with support for direct URLs, cloud storage links (Google Drive, Dropbox, OneDrive), and local files. This Apify Actor converts PDF documents to clean, structured text output.

Features

  • Multiple Source Support: Process PDFs from URLs, cloud storage share links, or local files
  • Cloud Storage Compatible: Works with Google Drive, Dropbox, and OneDrive share links
  • Configurable Processing: Set page limits and download timeouts
  • Structured Output: Returns JSON with extracted text and metadata
  • Pay-Per-Event Pricing: Only pay for successfully processed PDFs

Input

FieldTypeRequiredDefaultDescription
pdfUrlstringYes-URL to PDF file, cloud share link, or local file path (file:///path/to/file.pdf)
maxPagesintegerNo0Maximum pages to extract (0 = all pages)
timeoutintegerNo60Download timeout in seconds (10-300)

Output

The Actor outputs a JSON object with the following fields:

{
"originalPdfUrl": "https://example.com/document.pdf",
"processedPdfUrl": "https://example.com/document.pdf",
"extractedText": "Full text content from the PDF...",
"pageCount": 10,
"extractedPages": 10,
"timestamp": "2025-01-01T12:00:00.000Z"
}

Pricing

This Actor uses pay-per-event pricing:

EventPriceDescription
init$0.001Actor initialization
pdf-processed$0.01Per successfully processed PDF

Use Cases

  • Extract text from invoices, reports, and documents
  • Convert PDF archives to searchable text
  • Process academic papers and research documents
  • Extract content from legal documents and contracts

Example Usage

{
"pdfUrl": "https://drive.google.com/file/d/FILE_ID/view",
"maxPages": 5,
"timeout": 120
}

Support

For issues or questions, please contact the Actor developer through the Apify platform.