
PDF Scraper
- onidivo/pdf-scraper
- Modified
- Users 33
- Runs 257
- Created by
Onidivo Technologies
Scrape and extract PDF text from a PDF link.
Scrape and extract PDF text from PDF link.
Input
You can provide input either through the editor on the Apify platform or as a JSON object.
The only mandatory field you need to provide is the PDF URL (pdfUrl).
An example of minimal input:
{ "pdfUrl": "http://www.pdf995.com/samples/pdf.pdf", "proxyConfiguration": { "useApifyProxy": true } }
We recommend using the proxies to overcome blocking and detection like this:
{ "pdfUrl": "http://www.pdf995.com/samples/pdf.pdf", "proxyConfiguration": { "useApifyProxy": true } }
Output
The scraped text is saved to the key-value store in the record named "OUTPUT", and it looks like this:
{ "The pdf995 suite of products - Pdf995, PdfEdit995, and Signature995 - is a complete solution for your document publishing needs. It\nprovides ease of use, flexibility in format, and industry-standard security- and all at no cost to you...", ... }