
PDF Scraper
Pricing
$20.00/month + usage

PDF Scraper
Scrape and extract text from PDF links.
0.0 (0)
Pricing
$20.00/month + usage
7
Total users
350
Monthly users
36
Runs succeeded
99%
Last modified
4 months ago
Failed to download PDF if the PDF name in the URL has special chars
Closed
https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf
In the above PDF url, the URL has special char which is not in our control and it failed with the following error
ERROR FRH:Pdf: Request failed completely: https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf, error: (string key
) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'()
2024-04-02T18:32:09.727Z WARN BasicCrawler: Reclaiming failed request back to the list or queue. (string key
) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'()
2024-04-02T18:32:09.729Z at async Module.handlePdf (file:///home/myuser/dist/routes.js:58:9) {"url":"https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf","retryCount":1}
But the actor was not marked as Failed even though the PDF was not downloaded to Dataset.

Onidivo Technologies (onidivo)
Hi, thanks for reporting that. We will look into that and get back to you.

Onidivo Technologies (onidivo)
The problem is fixed. We are looking forward to your feedback.