Pricing

from $3.49 / 1,000 image ocrs

Google Lens OCR API - Image to Text Under 500ms REST API

Extract text from any image via Google Lens OCR API. Under 500ms per image, no browser needed. Returns word-level bounding boxes with pixel coordinates, detected language, and structured paragraphs/lines/words. Batch and HTTP API modes.

Pricing

from $3.49 / 1,000 image ocrs

Rating

0.0

(0)

Developer

Zen Studio

Actor stats

Bookmarked

340

Total users

Monthly active users

8 days ago

Last modified

Google Lens OCR - Extract Text from Images with Bounding Boxes

Extract text from any image with word-level bounding boxes and pixel coordinates. Returns detected language, full text, and structured paragraph/line/word data in a single request.

Sub-second processing per image, no browser required
Word-level bounding boxes with both normalized and pixel coordinates
Supports JPEG, PNG, WebP, BMP, TIFF, and HEIC formats
Standby mode for low-latency HTTP API access

Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

zen-studio/google-lens-ocr on Apify. Call: ApifyClient("TOKEN").actor("zen-studio/google-lens-ocr").call(run_input={...}), then client.dataset(run["defaultDatasetId"]).list_items().items for results. Also supports Standby mode (REST API): GET https://google-lens-ocr.apify.actor/ocr?imageUrl=...&token=TOKEN. Key inputs: imageUrl: string (required), outputDetail: string (full|paragraphs|lines|words|text_only), language: string. Full actor spec (input schema with all params/enums/defaults, output dataset fields, README): GET https://api.apify.com/v2/acts/zen-studio~google-lens-ocr/builds/default (Bearer TOKEN) → inputSchema, actorDefinition.storages.dataset, readme. Pricing: $0.00499/image OCR, $0.002/actor start, $0.00001/dataset item. Get token: https://console.apify.com/account/integrations

Output Example

Pricing -- Pay Per Event

Event	Free	Starter	Scale	Business
Image OCR (/1,000)	$4.99	$4.49	$3.99	$3.49
Result (/1,000)	$0.01	$0.01	$0.01	$0.01
Actor start (one-time)	$0.002	$0.002	$0.002	$0.002

Apify plan subscribers get automatic volume discounts.

Free trial: 5 runs, no credit card required.

How Do You Want to Use It?

Two modes, same deployment. Pick what fits your workflow.

Option 1: REST API (Standby Mode)

Best for: developers, automation scripts, no-code tools (n8n, Make, Zapier)

Never heard of Standby mode? It keeps the Actor running as a persistent HTTP server. No cold starts, no waiting for builds. You send a request, you get a response in under 500ms. Think of it as a regular API endpoint that happens to run on Apify.

Authentication

All API requests require authentication. Get your token from the Apify Console under Settings > Integrations.

Two ways to authenticate:

Bearer header (recommended): -H "Authorization: Bearer YOUR_APIFY_TOKEN"
Query parameter (convenient for testing and no-code tools): ?token=YOUR_APIFY_TOKEN

Endpoints

Method	Endpoint	Description
`GET`	`/ocr`	OCR via query parameters
`POST`	`/ocr`	OCR via JSON body (supports `imageUrl` and `imageBase64`)
`GET`	`/health`	Returns `{"status": "ok"}`

The base URL for your requests:

https://google-lens-ocr.apify.actor

Examples

curl -- OCR an image by URL

curl "https://google-lens-ocr.apify.actor/ocr?imageUrl=https://example.com/document.png&outputDetail=lines" \
  -H "Authorization: Bearer YOUR_APIFY_TOKEN"

curl -- OCR a local file via base64

base64 -i photo.jpg | curl -X POST "https://google-lens-ocr.apify.actor/ocr" \
  -H "Authorization: Bearer YOUR_APIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d "$(jq -n --arg img "$(cat -)" '{imageBase64: $img, outputDetail: "full"}')"

curl -- POST with image URL

curl -X POST "https://google-lens-ocr.apify.actor/ocr" \
  -H "Authorization: Bearer YOUR_APIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "imageUrl": "https://example.com/document.png",
    "outputDetail": "lines",
    "translateTo": "en"
  }'

Python -- URL and base64

import requests
import base64

TOKEN = "your_apify_token"
BASE = "https://google-lens-ocr.apify.actor"
HEADERS = {"Authorization": f"Bearer {TOKEN}"}

# From URL
resp = requests.get(f"{BASE}/ocr", headers=HEADERS, params={
    "imageUrl": "https://example.com/document.png",
    "outputDetail": "lines"
})
print(resp.json()["fullText"])

# From local file (base64)
with open("photo.jpg", "rb") as f:
    image_b64 = base64.b64encode(f.read()).decode()

resp = requests.post(f"{BASE}/ocr", headers=HEADERS, json={
    "imageBase64": image_b64,
    "outputDetail": "full"
})
for line in resp.json()["lines"]:
    print(line["text"])

n8n / Make / Zapier

Use an HTTP Request node pointed at https://google-lens-ocr.apify.actor/ocr with your Bearer token in the Authorization header. Pass imageUrl as a query parameter (GET) or in a JSON body (POST).

Option 2: Batch Processing

Best for: one-off image processing, scheduled jobs, results saved to a dataset

The simplest way: paste an image URL in the Apify Console, click Start, and download results from the dataset.

Quick Start Input

Minimal

{
  "imageUrl": "https://example.com/document.png"
}

With options

{
  "imageUrl": "https://example.com/page1.png",
  "outputDetail": "lines",
  "language": "de",
  "region": "DE"
}

Text only (smallest output)

{
  "imageUrl": "https://example.com/receipt.jpg",
  "outputDetail": "text_only"
}

Using the Apify API Client

Python

from apify_client import ApifyClient

client = ApifyClient("your_token")
run = client.actor("zen-studio/google-lens-ocr").call(run_input={
    "imageUrl": "https://example.com/receipt.jpg",
    "outputDetail": "lines"
})

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(f"Language: {item['language']}")
    print(f"Text: {item['fullText']}")
    for line in item.get("lines", []):
        box = line["boundingBox"]["pixelCoords"]
        print(f"  Line: {line['text']} at ({box['x']}, {box['y']})")

JavaScript

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'your_token' });
const run = await client.actor('zen-studio/google-lens-ocr').call({
    imageUrl: 'https://example.com/receipt.jpg',
    outputDetail: 'full'
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
for (const item of items) {
    console.log(`Language: ${item.language}`);
    console.log(`Text: ${item.fullText}`);
}

Scheduled jobs: Create a Schedule in the Apify Console to run the Actor on a recurring basis.

Getting Your Apify API Token

Go to the Apify Console
Navigate to Settings > Integrations
Copy your Personal API token

Use it as a Bearer token for Standby mode, or pass it to the ApifyClient constructor for batch mode.

Input Parameters

Parameter	Type	Description	Default
`imageUrl`	string	Image URL to extract text from	required
`outputDetail`	string	Level of detail: `full`, `paragraphs`, `lines`, `words`, `text_only`	`full`
`language`	string	Language hint (ISO 639-1 code)	`en`
`region`	string	Region hint (ISO 3166-1 alpha-2)	`US`
`translateTo`	string	Translate text to this language (ISO 639-1 code)	--

Output Detail Levels

full -- paragraphs with nested lines and words, all with bounding boxes
paragraphs -- paragraph text and bounding boxes
lines -- line text and bounding boxes
words -- lines with individual word-level bounding boxes
text_only -- just the extracted text, no coordinate data

Output

Each run produces one result with these fields:

Field	Type	Description
`imageUrl`	string	Source image URL
`language`	string	Detected language code
`fullText`	string	All extracted text joined
`lines`	array	Lines with text and bounding boxes (when detail >= `lines`)
`paragraphs`	array	Paragraphs with nested lines/words (when detail >= `paragraphs`)
`error`	string	Error message if processing failed

Bounding Box Format

Every text element includes a bounding box with normalized coordinates (0-1) and pixel coordinates:

{
  "centerX": 0.4457,
  "centerY": 0.1070,
  "width": 0.8512,
  "height": 0.1152,
  "rotation": 0.0032,
  "pixelCoords": {
    "x": 30,
    "y": 33,
    "width": 1265,
    "height": 77
  }
}

centerX, centerY -- center point (0-1 normalized)
width, height -- dimensions (0-1 normalized)
rotation -- rotation angle in radians (only present when non-zero)
pixelCoords -- absolute pixel coordinates computed from image dimensions

Full Output Example

Real output from processing this test image with outputDetail: "full":

{
  "imageUrl": "https://tesseract.projectnaptha.com/img/eng_bw.png",
  "language": "en",
  "fullText": "Mild Splendour of the various-vested Night!\n\nMother of wildly-working visions! hail!\n\nI watch thy gliding, while with watery light\nThy weak eye glimmers through a fleecy veil;\nAnd when thou lovest thy pale orb to shroud\nBehind the gather'd blackness lost on high;\nAnd when thou dartest from the wind-rent cloud\nThy placid lightning o'er the awaken'd sky.",
  "lines": [
    {
      "text": "Mild Splendour of the various-vested Night!",
      "boundingBox": {
        "centerX": 0.445742,
        "centerY": 0.107093,
        "width": 0.851279,
        "height": 0.115269,
        "rotation": 0.003223,
        "pixelCoords": { "x": 30, "y": 33, "width": 1265, "height": 77 }
      },
      "words": [
        {
          "text": "Mild",
          "boundingBox": {
            "centerX": 0.066876,
            "centerY": 0.105125,
            "width": 0.09354,
            "height": 0.113772,
            "rotation": 0.003223,
            "pixelCoords": { "x": 30, "y": 32, "width": 139, "height": 76 }
          }
        },
        {
          "text": "Splendour",
          "boundingBox": {
            "centerX": 0.226699,
            "centerY": 0.106229,
            "width": 0.192463,
            "height": 0.115269,
            "rotation": 0.003223,
            "pixelCoords": { "x": 194, "y": 32, "width": 286, "height": 77 }
          }
        }
        // ... "of", "the", "various", "-", "vested", "Night", "!"
      ]
    },
    {
      "text": "Mother of wildly-working visions! hail!",
      "boundingBox": {
        "centerX": 0.436285,
        "centerY": 0.21934,
        "width": 0.739569,
        "height": 0.113772,
        "rotation": 0.004585,
        "pixelCoords": { "x": 99, "y": 109, "width": 1099, "height": 76 }
      },
      "words": [
        { "text": "Mother", "boundingBox": { "centerX": 0.135817, "centerY": 0.216275, "width": 0.138627, "height": 0.113772, "rotation": 0.004585, "pixelCoords": { "x": 99, "y": 106, "width": 206, "height": 76 } } }
        // ... "of", "wildly", "-", "working", "visions", "!", "hail", "!"
      ]
    }
    // ... 6 more lines
  ],
  "paragraphs": [
    {
      "text": "Mild Splendour of the various-vested Night!",
      "boundingBox": {
        "centerX": 0.445742,
        "centerY": 0.107093,
        "width": 0.851279,
        "height": 0.115269,
        "rotation": 0.003223,
        "pixelCoords": { "x": 30, "y": 33, "width": 1265, "height": 77 }
      },
      "contentLanguage": null,
      "lines": [
        {
          "text": "Mild Splendour of the various-vested Night!",
          "boundingBox": {
            "centerX": 0.445742,
            "centerY": 0.107093,
            "width": 0.851279,
            "height": 0.115269,
            "rotation": 0.003223,
            "pixelCoords": { "x": 30, "y": 33, "width": 1265, "height": 77 }
          },
          "words": [
            { "text": "Mild", "boundingBox": { "centerX": 0.066876, "centerY": 0.105125, "width": 0.09354, "height": 0.113772, "rotation": 0.003223, "pixelCoords": { "x": 30, "y": 32, "width": 139, "height": 76 } } },
            { "text": "Splendour", "boundingBox": { "centerX": 0.226699, "centerY": 0.106229, "width": 0.192463, "height": 0.115269, "rotation": 0.003223, "pixelCoords": { "x": 194, "y": 32, "width": 286, "height": 77 } } }
            // ... 7 more words
          ]
        }
      ]
    }
    // ... 2 more paragraphs
  ]
}

FAQ

Do I need to start the Actor before using the API? No. Standby mode auto-starts the Actor when it receives a request. There's no manual step required.

Can I use both modes at the same time? Yes. Batch runs and Standby API requests are independent. You can run a batch while also making API calls.

What's the difference between Standby and a normal run? A normal run processes an image and saves the result to a dataset. Standby mode keeps the Actor alive as an HTTP server, returning results directly in the response. Use Standby for real-time requests. Use batch for one-off processing with dataset output.

What image formats are supported? JPEG, PNG, WebP, BMP, TIFF, HEIC, and GIF.

How accurate is the text detection? Uses the same OCR engine as Google Lens in Chrome. Accuracy is high for printed text in good lighting. Handwriting and very low-resolution images may produce partial results.

What languages are detected? Language detection is automatic. The language hint improves accuracy for specific languages but doesn't limit detection. The detected language is returned in the output.

How does the bounding box data work? Each text element (paragraph, line, word) includes normalized coordinates (0-1 range) and pixel coordinates. Normalized coordinates are relative to the image dimensions. Pixel coordinates give absolute positions for direct use in image annotation or cropping.

What is the rotation field? The rotation angle in radians for text that isn't perfectly horizontal. Only included when the rotation is significant (> 0.001 radians).

Can I send base64 images instead of URLs? Yes, via the Standby mode POST endpoint. Include imageBase64 in the JSON body instead of imageUrl. Supports raw base64 or data URI format (data:image/png;base64,...).

What happens if the image fails to process? The Actor returns an error result with the imageUrl and error field set. Failed images are not charged.

Is there a rate limit? No hard rate limit. In Standby mode, concurrent requests are handled by thread-safe sessions.

Legal Compliance

This Actor processes publicly accessible images provided by the user. Users are responsible for ensuring they have the rights to process the images they submit and must comply with applicable data protection regulations (GDPR, CCPA).

Image to Text OCR — Extract Text from Images

junipr/image-to-text

Extract text from images with OCR, confidence scores, language options, page/image metadata, and automation-ready text exports.

junipr

Google Lens Search API - Reverse Image Search & OCR

zen-studio/google-lens-visual-search

Reverse image search via Google Lens. Returns visual matches, AI descriptions, related links, related searches, and OCR text with bounding boxes. Four modes from fast OCR-only to full all-tabs extraction.

Zen Studio

211

Google Lens | AI Mode | Reverse image search | Translation+OCR

borderline/google-lens

Google Lens | Reverse image search | AI Mode🌟 Seamlessly identify text, translate in real time 🌐, recognize and classify objects 🎁, reverse search images 🔍, and extract detailed structured data 📚. It’s fast, reliable, and affordable—your essential tool for all visual intelligence needs! 🚀

borderline

1.3K

3.9

Google Lens Search: Reverse Image Finder & OCR

getascraper/google-lens-visual-search

Reverse image search via Google Lens. Get visual matches, exact duplicates, AI descriptions, related links, and parallel OCR text with bounding boxes. Bypasses blocks natively using residential proxies.

GetAScraper

Google Lens OCR API: Sub-second Image to Text

getascraper/google-lens-ocr

Extract text from any image with exact word-level bounding boxes and pixel coordinates. Powered by the official Google Lens engine for sub-second, multi-language OCR under 500ms. No browser required.

GetAScraper

TikTok Slideshow Downloader

maximedupre/tiktok-slideshow-downloader

Download photos from public TikTok slideshow URLs. Save each image to Apify storage with source links, author data, captions, post stats, file metadata, and dataset exports.

Maxime Dupré

Image Scraper

rapidtech1898/image-scraper

Extract image links from any website quickly and easily. Enter a URL and the scraper collects all available image URLs in seconds. Perfect for designers, marketers, and developers who need fast access to image sources without manual searching.

Max Pohler

111

1.0

TikTok Video Downloader

maximedupre/tiktok-video-downloader

Download videos and audio from public TikTok video URLs. Save each media file to Apify storage with source links, author data, captions, file metadata, and dataset exports.

Maxime Dupré

Video Subtitle & Caption Extractor

khadinakbar/video-subtitle-extractor

Extract subtitles, captions, and AI transcripts from any video URL across 1000+ platforms (YouTube, Vimeo, TikTok, Instagram, X/Twitter, Facebook, Twitch, TED, Bilibili). Native captions first, Whisper AI fallback when none. JSON, SRT, VTT, text, or LLM-ready markdown.