Document OCR avatar
Document OCR

Pricing

Pay per event

Go to Store
Document OCR

Document OCR

Developed by

Josef Válek

Maintained by Community

0.0 (0)

Pricing

Pay per event

0

Monthly users

4

Runs succeeded

59%

Last modified

23 days ago

Due to underlying costs, the Actor is only available to users with Apify subscription.

The Actor takes list of document urls (PDFs or images) and language, runs them through OCR, and stores the results in dataset.

The Actor is using OCR Space service to perform the OCR.

Pricing

The Actor is using Pay-per-event pricing mode. There is flat fee for spinning the Actor up, and then for every successfully processed document page.

Documents failed to be processed are not counted.

EventPrice
Actor start$0.01 / start
OCR$0.003 / page

Output

Each processed url is stored in default dataset in following shape:

1{
2    text: string, // The full text extracted from document.
3    language: string, // Language of the document.
4    url: string, // Original url.
5    raw: object, // Raw data from the underlying OCR service. May change in the future.
6}

In the processing ended up with error, the shape is different:

1{
2    url: string, // Original url.
3    error: string, // Error message.
4}

Planned features

The Actor is Work-in-progress - stay tuned for new features.

Integrability improvements

  • Process all images and PDF files in provided key value store
  • Process all urls from given dataset.

Pricing

Pricing model

Pay per event 

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Actor start

$0.010

Actor was started

OCR Page

$0.003

Page succesfully processed with OCR