PDF Scraper avatar

PDF Scraper

Try for free

1 day trial then $20.00/month - No credit card required now

View all Actors
PDF Scraper

PDF Scraper

onidivo/pdf-scraper
Try for free

1 day trial then $20.00/month - No credit card required now

Scrape and extract text from PDF links.

RK

Failed to download PDF if the PDF name in the URL has special chars

Closed

rajasekar_krishnan opened this issue
8 months ago

https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf

In the above PDF url, the URL has special char which is not in our control and it failed with the following error

ERROR FRH:Pdf: Request failed completely: https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf, error: (string key) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'() 2024-04-02T18:32:09.727Z WARN BasicCrawler: Reclaiming failed request back to the list or queue. (string key) The "key" argument must be at most 256 characters long and only contain the following characters: a-zA-Z0-9!-.'() 2024-04-02T18:32:09.729Z at async Module.handlePdf (file:///home/myuser/dist/routes.js:58:9) {"url":"https://globalprivacycontrol.org/Implementing%20GPC%20for%20Publishers.pdf","retryCount":1}

But the actor was not marked as Failed even though the PDF was not downloaded to Dataset.

onidivo avatar

Hi, thanks for reporting that. We will look into that and get back to you.

onidivo avatar

The problem is fixed. We are looking forward to your feedback.

Developer
Maintained by Community

Actor Metrics

  • 21 monthly users

  • 4 stars

  • 98% runs succeeded

  • Created in Apr 2023

  • Modified 8 months ago