# PDF Report Text Extractor

**Use case:** 

Extract full-document and page-level text from specific PDF reports for downstream analysis, search indexing, or archiving.

## Input

```json
{
  "urls": [
    "https://www.apple.com/environment/pdf/Apple_Environmental_Progress_Report_2024.pdf"
  ],
  "maxConcurrency": 1,
  "timeoutPerPdfSecs": 120,
  "includePages": true
}
```

## Output

```json
{
  "url": {
    "label": "URL",
    "format": "link"
  },
  "fileName": {
    "label": "File Name",
    "format": "text"
  },
  "pageCount": {
    "label": "Pages",
    "format": "number"
  },
  "title": {
    "label": "Title",
    "format": "text"
  },
  "author": {
    "label": "Author",
    "format": "text"
  },
  "subject": {
    "label": "Subject",
    "format": "text"
  },
  "creator": {
    "label": "Creator",
    "format": "text"
  },
  "creationDate": {
    "label": "Created",
    "format": "text"
  },
  "modificationDate": {
    "label": "Modified",
    "format": "text"
  },
  "fileSizeBytes": {
    "label": "Size (bytes)",
    "format": "number"
  },
  "pdfVersion": {
    "label": "PDF Version",
    "format": "text"
  },
  "fullText": {
    "label": "Full Text",
    "format": "text"
  }
}
```

## About this Actor

This example demonstrates how to use [PDF Text Extractor](https://apify.com/automation-lab/pdf-text-extractor) with a specific input configuration. Visit the [Actor detail page](https://apify.com/automation-lab/pdf-text-extractor) to learn more, explore other use cases, and run it yourself.