PDF Text Extraction From URLs
Created by
Stas Persiianenko
Actor
PDF Text Extractor
Extract text and metadata from direct PDF URLs with configurable concurrency, timeout, and per-page text output.
PDF Text Extractorautomation-lab/pdf-text-extractor
URL
File Name
Pages
Title
+8 fieldsTextNumberBooleanListObject
Input
📄 PDF URLs(required):https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf+1
⚡ Max concurrency:3
⏱️ Timeout per PDF (seconds):60
📑 Include per-page text:true
Output fields
URL
File Name
Pages
Title
Author
Subject
Creator
Created
Modified
Size (bytes)
PDF Version
Full Text
Sign up on Apify01
Create your Apify account to access the PDF Text Extractor.
Start the run02
The Actor will start running based on the input automatically.
Receive the output03
Monitor the progress in real-time. You will be notified as soon as your dataset is complete and ready for review.
Integrate into your workflow04
The final output is delivered in JSON, CSV, or Excel format, ready to be plugged into your workflow.

