PDF Extractor 2.0 avatar
PDF Extractor 2.0

Pricing

$30.00/month + usage

Go to Store
PDF Extractor 2.0

PDF Extractor 2.0

Developed by

cat

cat

Maintained by Community

💫 Extract PDF Document Contents including Metadata, Images, Pages, Tables, Attachments, etc.

0.0 (0)

Pricing

$30.00/month + usage

4

Total users

75

Monthly users

11

Runs succeeded

>99%

Last modified

6 months ago

🔑 Password

passwordstringOptional

💡 Document Password

🕮 URL

urlarrayOptional

💡 Document URL Location

Content Format

contentEnumOptional

💡 Content Format

Value options:

"text": string"svg": string"jpeg": string"png": string

Extract Images

imagesbooleanOptional

💡 Extract Embedded Images

Extract Attachments

attachmentsbooleanOptional

💡 Extract Embedded Files

Extract Tables

tablesbooleanOptional

💡 Extract Tables

🌐 PROXY NETWORKING

dev_proxy_configobjectOptional

💡 Supported protocol:

HTTP(S), SOCKS5
{http|socks5}://{user:pass}@{hostname|ip-address}:port

Example: socks5://example.com:9000

📜 HTTP HEADERS

dev_custom_headersarrayOptional

💡 Additional HTTP Headers

🍰 HTTP COOKIES

dev_custom_cookiesarrayOptional

💡 Additional HTTP Cookies

♻️ CUSTOM FIELD

dev_transform_fieldsarrayOptional

💡 Transform the resulting output. Select only needed fields.

For nested object use DOT. For example:

address.streetAddress

For nested array use NUMBER (index of array element starting from index=0). For example:
images.0.url

📁 CUSTOM STORAGE

dev_dataset_namestringOptional

💡 Save results into custom named Dataset, use mask to customize dataset name

{ACTOR} = actor name
{DATE} = date (YYYYMMDD)
{TIME} = time (HHMMSS)


This masks can be used to autogenerate Dataset Name.

example: data-{DATE}
Depending on today date the dataset name will be: data-20230603

default: data-{ACTOR}-{DATE}-{TIME}

Clear Storage

dev_dataset_clearbooleanOptional

Clear Dataset before insert/update.

Disable data cleansing

dev_no_stripbooleanOptional

💡 Keep/Save empty values (NULL, FALSE, empty ARRAY, empty OBJECT, empty STRING)