Extractor from PDF URL
Pricing
from $0.01 / 1,000 results
Go to Apify Store
Under maintenance
Extractor from PDF URL
Extract text and tables from PDFs in a clear, readable format. Provides well-organized tables and cleans up messy spacing, making PDF content easy to view, copy, or share—directly from a PDF link.
Pricing
from $0.01 / 1,000 results
Rating
0.0
(0)
Developer

Muhammad Zain Abid
Maintained by Community
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
14 days ago
Last modified
Categories
Share
PDF Text & Table Extractor (JavaScript)
Extract readable text and clean tables from any PDF by simply providing a URL. This Actor downloads the PDF, parses the content, and formats messy spacing so the result is easy to view, copy, or process.
Included features
- PDF fetching from a provided URL
- Text extraction using
pdf-parse - Cleaning & formatting to improve table readability
- JSON output to Apify key-value store
Use cases
- Convert PDF content into clean text
- Make tables readable for reporting or copying
- Prepare data for further processing or automation
- Quickly inspect PDF content without opening the file
Input
Provide the following input JSON:
{"pdfUrl": "https://assets.accessible-digital-documents.com/uploads/2017/01/sample-tables.pdf"}
Output
The Actor returns:
- Full extracted text saved to:
KEY_VALUE_STORE/result - Short preview in the API response
Example:
{"status": "success","extracted": "Table 1\nColumn 1 Column 2 Column 3..."}
Getting started
- Build the Actor in Apify Console
- Supply a PDF URL as input
- Run and view extracted output in the key-value store
Local development (optional)
Use Apify CLI to pull and edit locally:
npm -g install apify-cliapify pull <ActorId>
Dependencies
- Apify SDK – actor environment & input/output handling
- node-fetch – PDF downloading
- pdf-parse – text extraction
