PDF Scraper avatar
PDF Scraper

Pricing

$20.00/month + usage

Go to Store
PDF Scraper

PDF Scraper

Developed by

Onidivo Technologies

Onidivo Technologies

Maintained by Community

Scrape and extract text from PDF links.

0.0 (0)

Pricing

$20.00/month + usage

7

Total users

350

Monthly users

36

Runs succeeded

99%

Last modified

4 months ago

RK

Failed to download PDF if the PDF name in the URL has special chars

Closed

rajasekar_krishnan opened this issue
a year ago

https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf

In the above PDF url, the URL has special char which is not in our control and it failed with the following error

ERROR FRH:Pdf: Request failed completely: https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf, error: (string key) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'() 2024-04-02T18:32:09.727Z WARN BasicCrawler: Reclaiming failed request back to the list or queue. (string key) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'() 2024-04-02T18:32:09.729Z at async Module.handlePdf (file:///home/myuser/dist/routes.js:58:9) {"url":"https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf","retryCount":1}

But the actor was not marked as Failed even though the PDF was not downloaded to Dataset.

onidivo avatar

Hi, thanks for reporting that. We will look into that and get back to you.

onidivo avatar

The problem is fixed. We are looking forward to your feedback.