PDF Text Extractor avatar

PDF Text Extractor

Try for free

No credit card required

Go to Store
PDF Text Extractor

PDF Text Extractor

jirimoravcik/pdf-text-extractor
Try for free

No credit card required

PDF Text Extractor allows you to extract text from PDF files. It also supports chunking of the text to prepare the data for usage with large language models.

AN

maxpagecrawl

Closed
authentic_nightfall opened this issue
a month ago

I have the maxpagecrawl set to 1 but it it still scrapes all pages. Is there a way to have the task only scrape the first page?

jirimoravcik avatar

Hello, where did you set the maxpagecrawl parameter? AFAIK this Actor does not support it. Thank you

AN

authentic_nightfall

a month ago

Is there any way to instruct the actor only to scrape the first page or only scrape the first x number of characters?

jirimoravcik avatar

Not currently, but I can implement it. But given the logic for PDF parsing, you still have to parse the whole thing anyway.

Developer
Maintained by Community

Actor Metrics

  • 47 monthly users

  • 24 bookmarks

  • >99% runs succeeded

  • 12 hours response time

  • Created in Oct 2023

  • Modified 5 months ago