PDF Text Extractor avatar
PDF Text Extractor
Try for free

No credit card required

View all Actors
PDF Text Extractor

PDF Text Extractor

jirimoravcik/pdf-text-extractor
Try for free

No credit card required

PDF Text Extractor allows you to extract text from PDF files. It also supports chunking of the text to prepare the data for usage with large language models.

The code examples below show how to run the Actor and get its results. To run the code, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token, which you can find under Settings > Integrations in Apify Console. Learn more

1# Set API token
2API_TOKEN=<YOUR_API_TOKEN>
3
4# Prepare Actor input
5cat > input.json <<'EOF'
6{
7  "urls": [
8    "https://arxiv.org/pdf/2307.12856"
9  ]
10}
11EOF
12
13# Run the Actor using an HTTP API
14# See the full API reference at https://docs.apify.com/api/v2
15curl "https://api.apify.com/v2/acts/jirimoravcik~pdf-text-extractor/runs?token=$API_TOKEN" \
16  -X POST \
17  -d @input.json \
18  -H 'Content-Type: application/json'
Developer
Maintained by Community
Actor metrics
  • 43 monthly users
  • 9 stars
  • 99.5% runs succeeded
  • Created in Oct 2023
  • Modified 3 months ago