PDF Scraper
1 day trial then $20.00/month - No credit card required now
PDF Scraper
1 day trial then $20.00/month - No credit card required now
Scrape and extract text from PDF links.
https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf
In the above PDF url, the URL has special char which is not in our control and it failed with the following error
ERROR FRH:Pdf: Request failed completely: https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf, error: (string key
) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'()
2024-04-02T18:32:09.727Z WARN BasicCrawler: Reclaiming failed request back to the list or queue. (string key
) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'()
2024-04-02T18:32:09.729Z at async Module.handlePdf (file:///home/myuser/dist/routes.js:58:9) {"url":"https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf","retryCount":1}
But the actor was not marked as Failed even though the PDF was not downloaded to Dataset.
Hi, thanks for reporting that. We will look into that and get back to you.
The problem is fixed. We are looking forward to your feedback.
Actor Metrics
21 monthly users
-
4 stars
98% runs succeeded
Created in Apr 2023
Modified 8 months ago