Scribd Scraper avatar
Scribd Scraper

Pricing

$5.00 / 1,000 results

Go to Store
Scribd Scraper

Scribd Scraper

Developed by

cat

cat

Maintained by Community

💫 Search Scribd.com Documents

0.0 (0)

Pricing

$5.00 / 1,000 results

0

Total users

5

Monthly users

5

Runs succeeded

>99%

Last modified

6 days ago

❓ Query

queryarrayOptional

💡 Search Terms

♾️ Limit

limitintegerOptional

💡 Number of results

Category

filters:categoryarrayOptional

Length

filters:num_pagesEnumOptional

Value options:

"1-3": string"4-100": string"100+": string

File Type

filters:file_typearrayOptional

Date Upload

filters:date_uploadEnumOptional

Value options:

"1week": string"1month": string"3month": string"6month": string"1year": string

Language

filters:languagearrayOptional

🌐 PROXY NETWORKING

dev_proxy_configobjectOptional

💡 Supported protocol:

HTTP(S), SOCKS5
{http|socks5}://{user:pass}@{hostname|ip-address}:port

Example: socks5://example.com:9000

📜 HTTP HEADERS

dev_custom_headersarrayOptional

💡 Additional HTTP Headers

🍰 HTTP COOKIES

dev_custom_cookiesarrayOptional

💡 Additional HTTP Cookies

♻️ CUSTOM FIELD

dev_transform_fieldsarrayOptional

💡 Transform the resulting output. Select only needed fields.

For nested object use DOT. For example:

address.streetAddress

For nested array use NUMBER (index of array element starting from index=0). For example:
images.0.url

📁 CUSTOM STORAGE

dev_dataset_namestringOptional

💡 Save results into custom named Dataset, use mask to customize dataset name

{ACTOR} = actor name
{DATE} = date (YYYYMMDD)
{TIME} = time (HHMMSS)


This masks can be used to autogenerate Dataset Name.

example: data-{DATE}
Depending on today date the dataset name will be: data-20230603

default: data-{ACTOR}-{DATE}-{TIME}

Clear Storage

dev_dataset_clearbooleanOptional

Clear Dataset before insert/update.

Disable data cleansing

dev_no_stripbooleanOptional

💡 Keep/Save empty values (NULL, FALSE, empty ARRAY, empty OBJECT, empty STRING)