Pricing

$4.00 / 1,000 pdf processeds

Go to Apify Store

PDF Toolkit — Extract Text, Metadata & Page Count

Try for free

Extract text from PDFs, read metadata (title, author, dates), count pages. Bulk processing from URLs. $0.003 per PDF.

Pricing

$4.00 / 1,000 pdf processeds

Rating

0.0

(0)

Developer

Manchitt Sanan

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Operations

Operation	What it returns
`extract-text`	Full text content + page count
`get-metadata`	Title, author, subject, creator, producer, creation/modification dates + page count
`page-count`	Number of pages only

Quick start

{
    "items": [
        {
            "url": "https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf",
            "operation": "extract-text"
        }
    ]
}

Input

Each item in the items array:

Field	Type	Required	Description
`url`	string	Yes	URL to a PDF file
`operation`	enum	Yes	`extract-text`, `get-metadata`, or `page-count`

Output

{
    "url": "https://example.com/document.pdf",
    "operation": "extract-text",
    "text": "Full extracted text content...",
    "pageCount": 12,
    "fileSize": 245760,
    "status": "success",
    "error": null
}

Pricing

$0.003 per PDF processed (pay-per-event pricing).

Errors and dry runs are never charged.
100 PDFs = $0.30
1,000 PDFs = $3.00

Limitations

Text extraction only — no OCR. Scanned PDFs (images of text) will return empty or minimal text.
Max file size depends on Apify memory allocation. Default 256MB handles most PDFs.
No PDF generation — this actor reads PDFs, doesn't create them. Use Apify's official HTML-to-PDF actor for generation.

Other tools by accurate_pouch for content + asset processing:

QR Code Toolkit — Generate + decode, custom colors, logos, SVG/PNG/base64. $0.004/QR.
TheCrawler — Web scraper + LLM-powered structured extraction, includes PDF + DOCX. AGPL-3.0, also on npm (thecrawler@0.1.1). $0.005/page.
Google Sheets R/W — Read, append, replace, modify, backup. $0.004/op.
Broken Link Checker — Recursive crawl, sitemap + robots.txt parsing, webhook, Sheets export. $0.005/page.

Run on Apify

No setup needed. Click above to run in the cloud. $0.003 per operation.

PDF Text Extractor - Bulk PDF to Text & Metadata

santamaria-automations/pdf-extractor

Extract text and metadata from any PDF URL in bulk. Get page content, author, title, creation date, and more. Detects scanned PDFs that need OCR. Perfect for document analysis, research, and compliance.

NanoScrape

PDF Text Extractor

automation-lab/pdf-text-extractor

Extract text, metadata, and page-by-page content from PDF files. Provide PDF URLs and get structured JSON with full text, per-page text, page count, author, title, creation date, and more. Export as JSON, CSV, or Excel. No browser or proxy needed.

Stas Persiianenko

124

PDF Parser API

george.the.developer/pdf-parser-api

Instant API that parses any PDF from a URL — extracts full text, page count, metadata (title, author, dates), and PDF version. Returns structured JSON. Perfect for document processing pipelines and AI agents.

George Kioko

PDF Extractor: Structured Text + Metadata

aitoolbreakdown/atb-pdf-extractor

Point it at one or many PDF URLs. Get clean structured JSON back: full text, per-page text, title, author, page count, and word count. Ready for RAG, search, or doc automation.

AI Tool Breakdown

PDF Scraper

onidivo/pdf-scraper

Scrape and extract text from PDF links.

Onidivo Technologies

515

PDF to Text Extractor — Native Text & Metadata

junipr/pdf-to-text-extractor

Extract text, page metadata, outlines, links, and document info from PDFs with page-level output and automation-friendly exports.

junipr

Extract text from PDF

akash9078/pdf-text-extractor

Efficiently extract text content from PDF files, ideal for data processing, content analysis, and automation workflows. Supports various PDF structures and outputs clean, readable text.

Akash Kumar Naik

110

Pdf To Text Scraper

getdataforme/pdf-to-text-scraper

The Pdf To Text Scraper is an Apify Actor that efficiently extracts text from PDFs, preserving structure and supporting batch processing....

GetDataForMe

PDF Text & Table Extractor (pdfplumber, batch URLs)

gochujang/pdf-text-extractor

Download any PDF by URL and extract clean per-page text + detected tables (as 2D arrays) + document metadata (title/author/created/modified). Powered by pdfplumber. Batch up to 50 PDFs. $0.01 per PDF + $0.0005 per page.

Hojun Lee

Fast Pdf Processor

contemporary_fruit/pdf-processor-actor

This API is a PDF Processing Service allowing users to upload a PDF to: Extract Text: Reads all text from the PDF and returns it as structured JSON data per page. Merge Pages: Creates a new PDF containing only the specific pages selected by the user. (260 characters)