Under maintenance

Pricing

from $0.01 / 1,000 results

Try for free

Go to Apify Store

Image To Image Localization Actor

Under maintenance

Try for free

Image to Image Text Translation Actor Translate text within images while preserving the original layout, styling, and visual appearance. This Actor uses Google Cloud Vision API for text detection and Lingo.dev or Gemini AI for high-quality translation.

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

Agung Sidharta So

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Image to Image Text Translation Actor

Translate text within images while preserving the original layout, styling, and visual appearance. This Actor uses Google Cloud Vision API for text detection and Lingo.dev for high-quality translation with brand voice preservation.

Features

🔍 Smart Text Detection: Uses Google Cloud Vision API to detect text with precise bounding boxes
🌍 70+ Languages: Supports all major languages with regional variants
🎨 Visual Preservation: Maintains original colors, fonts, and layout
📏 Adaptive Sizing: Automatically adjusts font size and background for different text lengths
🔤 Alphanumeric Filtering: Only translates readable text (letters, numbers, spaces)
📁 Batch Processing: Process up to 100 images in a single run
🔄 Dual Translation Options: Choose between Gemini AI (default, direct image translation) or Lingo.dev (brand voice preservation)

How It Works

Text Detection: Scans the image using Google Cloud Vision API (Lingo.dev mode only)
Smart Filtering: Only processes alphanumeric text - skips symbols, decorations (Lingo.dev mode only)
Translation: Choose between Gemini AI (default, direct image translation) or Lingo.dev (brand voice preservation with text detection)
Color Analysis: Detects original text colors and background patterns (Lingo.dev mode only)
Adaptive Rendering: Adjusts font size and background based on translation length (Lingo.dev mode only)
AI Quality Check: Gemini analyzes result for text overlaps and applies fixes (Lingo.dev mode with GEMINI_API_KEY)
Image Composition: Overlays translated text while preserving visual integrity (Lingo.dev mode)

Input

Field	Type	Required	Description
`imageUrls`	Array	Yes	Array of image URLs to translate (up to 100 images per run)
`targetLanguage`	String	Yes	Target language (dropdown selection)
`translationProvider`	String	No	Translation service: Gemini AI (default) or Lingo.dev (brand voice)
`fontSize`	Integer	No	Override font size (8-72px, Lingo.dev only, default: auto-detect)
`fontFamily`	String	No	Font family (Lingo.dev only: Arial, Helvetica, Times New Roman, Courier New)

Supported Languages

The Actor supports 70+ languages including:

Major Languages: English, Spanish, French, German, Chinese, Japanese, Arabic, Russian
Regional Variants: en-US, en-GB, es-ES, es-MX, zh-CN, zh-TW, etc.
Specialized: Bavarian, Neapolitan, Tamazight, and more

Environment Variables

Set these as secret environment variables in the Apify Console:

Variable	Required	Description
`GEMINI_API_KEY`	Conditional*	Gemini API key (required if using Gemini provider)
`LINGO_API_KEY`	Conditional*	Your Lingo.dev API key (required if using Lingo.dev provider)
`GOOGLE_CLOUD_CREDENTIALS_JSON`	Conditional**	Google Cloud credentials as JSON string (required for Lingo.dev provider)

*Required based on selected translation provider (Gemini is default) **Only required when using Lingo.dev provider

Setup Instructions

1. Get API Keys

Gemini API:

Get API key from Google AI Studio
Set as GEMINI_API_KEY environment variable

Lingo.dev API Key (Optional):

Sign up at Lingo.dev
Create a new project
Generate an API key
Set as LINGO_API_KEY environment variable

Google Cloud Vision API (Optional - for Lingo.dev provider):

Create a project in Google Cloud Console
Enable the Vision API
Create a service account with Vision API permissions
Download the JSON key file
Set as GOOGLE_CLOUD_CREDENTIALS_JSON environment variable

Gemini API (Optional - for Lingo.dev AI enhancement):

Get API key from Google AI Studio
Set as GEMINI_API_KEY environment variable for AI image editing with Lingo.dev

2. Configure Environment Variables

In the Apify Console:

Go to your Actor settings
Add environment variables based on your chosen provider:
- For Gemini (default): GEMINI_API_KEY
- For Lingo.dev: LINGO_API_KEY and GOOGLE_CLOUD_CREDENTIALS_JSON
- Optional: Add GEMINI_API_KEY when using Lingo.dev for AI-powered text overlap fixes

3. Run the Actor

Select target language from dropdown
Choose translation provider (Gemini AI is default, or Lingo.dev for brand voice)
Provide one or more image URLs (up to 100)
Optionally adjust font settings (Lingo.dev only)
Run the Actor

Output

The Actor produces:

Dataset Items

{
  "originalUrl": "https://example.com/image.jpg",
  "originalType": "url",
  "originalFileName": null,
  "translatedImageUrl": "https://api.apify.com/v2/key-value-stores/.../translated-image.png",
  "targetLanguage": "es-ES",
  "processingTime": 3.2
}

Key-Value Store

Translated Image: PNG file with translated text overlaid

Use Cases

Marketing Localization: Translate ads, banners, and promotional materials
E-commerce: Localize product images for international markets
Documentation: Translate screenshots and diagrams
Social Media: Adapt visual content for different regions
Website Localization: Translate UI elements and graphics

Limitations

Only processes alphanumeric text (letters, numbers, spaces)
Requires clear, readable text in images
Works best with horizontal text layouts
Very complex backgrounds may affect color detection

Performance

Processing Time: 5-8 seconds per image with Gemini, 8-12 seconds with Lingo.dev (including AI analysis)
Batch Processing: Up to 100 images per run
Supported Formats: JPEG, PNG, WebP, GIF
Max Image Size: 10MB
Concurrent Runs: Up to 100 (depending on plan)

Error Handling

Common errors and solutions:

Error	Cause	Solution
"No text detected"	Image has no readable text	Use images with clear text
"LINGO_API_KEY required"	Missing API key	Set environment variable
"Google Cloud credentials not found"	Missing credentials	Set Google Cloud environment variable
"No alphanumeric text found"	Only symbols/decorations detected	Use images with letters/numbers

Example Usage

Via Apify Console

Open the Actor in Apify Console
Add one or more image URLs
Select "Spanish (Spain)" as target language
Choose translation provider (Gemini AI is default)
Click "Start"

Via API

const { ApifyApi } = require('apify-client');

const client = new ApifyApi({
    token: 'your-apify-token'
});

const run = await client.actor('your-actor-id').call({
    imageUrls: [
        'https://example.com/image1.jpg',
        'https://example.com/image2.jpg'
    ],
    targetLanguage: 'es-ES',
    translationProvider: 'gemini' // default, or 'lingo'
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item, i) => {
    console.log(`Image ${i + 1}:`, item.translatedImageUrl);
});

Support

Documentation: Apify Docs
Community: Apify Discord
Issues: Create an issue in this repository

License

This Actor is licensed under the Apache 2.0 License.

image to image

evoort-solutions-llc/image-to-image

Evoort Solutions LLC

Image To Text

calm_necessity/image-to-text

Image to Text Actor analyzes images and generates detailed text descriptions of scenes, objects, and visual context. Upload an image and receive a human-readable explanation of what the image contains. Ideal for accessibility, content understanding, and automation workflows.

Taher Ali Badnawarwala

Google Images Scraper

hooli/google-images-scraper

Scrape image details from images.google.com. Add your query and number of images and extract image details such as image URL, image source, description, image dimensions, thumbnail, and more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Hooli

3.5K

4.4

Image to Prompt Generator 🎨 ✨

easyapi/image-to-prompt-generator

Transform any image into detailed text descriptions using AI. Perfect for content creators, SEO specialists, and developers who need automated image-to-text conversion. Powered by Phot.ai's advanced image recognition technology.

EasyApi

Google Images Scraper 🖼️ - Cheapest

scrapestorm/google-images-scraper---cheapest

Easily collect image results from Google Images 🔥 the world’s most-used visual search engine. Just enter any keyword to extract key details like image URL, title, alt text, source website, resolution & result position 🔍 Perfect for datasets, research, AI training or automated image sourcing 📊🖼️

Storm_Scraper

Midjourney Bot

imageaibot/midjourney-bot

This Actor integrates Discord Midjourney-related APIs, including text-to-image, image operations (such as upscaling or extending an already generated image), image blending, image-to-text, task query, and more .website: [Midjourney中国官网](https://mj.bandeyu.com)

ImageAI Bot

5.0

Google Images Scraper 🖼️ - Cheapest (Pay Per results) 🔥

scrapestorm/google-images-scraper---cheapest-pay-per-results

Storm_Scraper

Bulk Image Downloader

onescales/bulk-image-downloader

The Bulk Image Downloader is a powerful Apify actor that extracts and downloads images from web pages or processes direct image URLs in bulk. Whether you need to download a single image or thousands of images from multiple websites, this tool handles it all efficiently.

One Scales

550

5.0

Image Scraper

rapidtech1898/image-scraper

Extract image links from any website quickly and easily. Enter a URL and the scraper collects all available image URLs in seconds. Perfect for designers, marketers, and developers who need fast access to image sources without manual searching.