Pricing

$2.90/month + usage

Image Text Extractor

Extract text from images using OCR (Optical Character Recognition) via direct URLs or uploaded JSON/CSV files. Works with multiple languages and automatically enriches your structured file with the text found inside images.

Pricing

$2.90/month + usage

Rating

0.0

(0)

Developer

M3Web

Actor stats

Bookmarked

Total users

Monthly active users

a year ago

Last modified

🖼️ Image Text Extractor

✅ Features

Accepts image URLs either:
- Directly through startUrls, or
- From uploaded .json or .csv files
Applies OCR (Optical Character Recognition) to each image and extracts:
- extractedText: Full raw text detected
- paragraphs: Text split into readable blocks
- urls: Any links found within the image text
Supports Tesseract OCR with multiple languages (e.g. English, German, Spanish, etc.)
Saves results in Apify Key-Value Store with a shareable download link
Logs are clean and easy to follow

📥 Input

This Actor accepts these input fields:

Field	Type	Description
`Image URLs`	`array`	(Optional) One or more direct image URLs to process
`Upload a structured file`	`file`	(Optional) Upload a `.json` or `.csv` file that contains image URLs
`Field name for image URL`	`string`	The name of the column or field in your file that holds the image URLs
`language`	`string`	Choose the OCR language from the dropdown (default is English)

👇 Explaining `Field name for image URL` in simple terms

If you're uploading a .json or .csv file, you need to tell the Actor which part of each item contains the image URL. This is what the Field name for image URL is for:

🔢 In a CSV file, each column has a name (like "image_url" or "photo"). You should type in the exact column name where the image URL is located.

Example:

title,image_url
Product 1,https://example.com/image1.jpg
Product 2,https://example.com/image2.jpg

In this case, you'd set Field name for image URL to image_url.

🧱 In a JSON file, each object has a label for its fields. You need to write the name of the field that stores the image link.

Example:

[
  { "name": "Item A", "photo": "https://example.com/photo.jpg" }
]

Here, you'd set Field name for image URL to photo.

💬 You can also use dot notation to reach inside nested fields. For example, if your JSON file looks like this:

Example:

[
  { "assets": { "image": "https://example.com/image.jpg" } }
]

Then set Field name for image URL to assets.image.

🔢 Multiple Images in One Row

If your .json or .csv file contains more than one image URL per item, you can still process them all! Simply point to the field that holds an array of URLs.

Example .json input:

{
  "title": "Product Set",
  "images": [
    "https://example.com/photo1.jpg",
    "https://example.com/photo2.jpg"
  ]
}

Set Field name for image URL to images — the Actor will automatically process all image URLs inside that array.

This also works with dot notation for nested arrays:

{
  "media": {
    "photos": [
      "https://example.com/one.jpg",
      "https://example.com/two.jpg"
    ]
  }
}

In this case, set Field name for image URL to media.photos

🌍 OCR Language

The Actor supports many languages beyond English. At the input step, you'll see a dropdown menu labeled language. Select the appropriate language for your images (e.g. German, French, Spanish...) - the default language is English.

This helps the OCR engine correctly detect and read the text in your image.

📤 Output

After processing, you'll receive:

A structured CSV or JSON file with enriched data:
- extractedText: All text found in each image
- paragraphs: Text broken into readable chunks
- urls: Any links found inside the image text
🔗 A downloadable link to your processed file saved in Apify's Key-Value Store
📊 OCR results also pushed to Apify Dataset (optional)

🚀 Example Use Cases

Extracting text from screenshot-based Google Ads
Enriching scraped product data with visible text
Identifying links or CTAs from image banners

🤖 Behind the Scenes

This Actor uses:

Tesseract.js for OCR
Sharp for image preprocessing (grayscale, normalize)
Support for both in-memory JSON and CSV parsing/stringifying
Output is clean and downloadable, with clear logs and no clutter

💡 Tip

Want to extract thousands of image ads from Google’s Ad Transparency Center? Combine this with a crawler that scrapes adstransparency.google.com, then feed that structured JSON into this Actor. Boom — text from image ads, at scale.

Facebook Posts Media & OCR Content Harvester

scrapier/facebook-posts-scraper

This Facebook Posts Scraper extracts text posts, media, reactions, shares, comments, and metadata from public pages and profiles. Ideal for trend tracking, social listening, competitor analysis, and building large-scale Facebook intelligence.

Scrapier

5.0

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.

Louis Deconinck

178

5.0

Facebook Videos (Watch) Scraper

lexis-solutions/facebook-videos-watch-scraper

The Facebook Videos (Facebook Watch) Scraper can obtain public data for videos on Meta's Facebook given an exact URL or a query to search. It provides data like views, reactions, comments and more.

Lexis Solutions

389

5.0

Facebook Video & Reel AI Transcript Extractor

sian.agency/facebook-ai-transcript-extractor

SIÁN OÜ

450

5.0

Instagram Bulk Scraper

mikolabs/instagram-bulk-scraper

Bulk scraper for Instagram profiles, posts, and reels. Effortlessly extract follower statistics, engagement, and detailed content data across multiple accounts with optional hashtag filtering. Export the collected data in your preferred formats for seamless analysis and reporting.

mikolabs

147

5.0

Facebook Video Search Scraper

apify/facebook-video-search-scraper

Add keywords and extract all associated Facebook reels and videos. Get video data like video URL, video title, description, video owner profile and URL, date posted, views, duration, label, and much more. Export scraped data, schedule scraper via API, integrate with other tools or AI workflows.

Apify

904

5.0

Facebook Videos Scraper

scraper-engine/facebook-videos-scraper

Facebook Videos Scraper helps you collect video data from Facebook pages, groups, or profiles. Extract titles, descriptions, view counts, reactions, comments, and video URLs. Great for content analysis, marketing insights, or competitor tracking. Fast, accurate, and easy-to-use scraping solution.

Scraper Engine

116

5.0

Facebook Comments Extractor 🗨️⚡: Data, Details & Analytics

thedoor/facebook-comment-scraper

✨ Fastest BulkScraping get all Videos, Reels, Posts, Groups — and all their comments — in one click. Fast, clean, and effortless. Works with multiple URLs and supports every Facebook link format. Download data in JSON, CSV, Excel for use in apps, spreadsheets, and reports.

TheDoor

406

3.8

LLM Brand Visibility and Citation Tracker

khadinakbar/llm-visibility-tracker

Track brand mentions, response rank, share of voice, citations, competitors, and sentiment across ChatGPT, Perplexity, Gemini, and optional Claude checks. Receive per-prompt evidence plus a structured visibility summary.