Bulk Image Downloader (Scrape, Download & WebP Optimize) avatar
Bulk Image Downloader (Scrape, Download & WebP Optimize)

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Bulk Image Downloader (Scrape, Download & WebP Optimize)

Bulk Image Downloader (Scrape, Download & WebP Optimize)

Scrape and download images from any webpage in bulk. Filter by minimum width/height, limit images per page, optionally download files to Key-Value Store, and convert to WebP to save space. Outputs preview + download links with dimensions and format.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

Logiover Data

Logiover Data

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

18 hours ago

Last modified

Share

πŸ–ΌοΈ Bulk Image Downloader β€” Image Scraper, Downloader & WebP Optimizer

Scrape and download images from any webpage in bulk.
Filter out tiny icons by size, cap the number of images per page, optionally download image files to Apify Key-Value Store, and convert everything to WebP to reduce storage and bandwidth.

If you need a reliable bulk image downloader, image scraper, webpage image extractor, or WebP converter pipeline, this Actor is designed for fast automation and clean output.


βœ… What this Actor does

Given one or more target URLs, the Actor:

  • Extracts image URLs from each page
  • Filters images by minimum width/height
  • Limits the number of images per page (maxImages)
  • Optionally downloads images to Key-Value Store (for direct file access)
  • Optionally converts images to WebP for compression/optimization
  • Outputs a dataset with preview, download link, dimensions, and format

✨ Key Features

  • Bulk Image Extraction from any webpage
  • Size Filtering (min width/height) to remove icons & thumbnails
  • Download Mode (stores images in Key-Value Store)
  • URL-Only Mode (skip downloads, return image URLs only)
  • WebP Optimization (compress images to WebP to save space)
  • Per-Page Limits to control cost and keep runs predictable
  • Proxy Support for image-heavy sites and throttling protection

πŸ›  How to Use

  1. Add one or more Target URLs
  2. Set filters:
    • Min Width / Min Height (remove small images/icons)
    • Max Images per Page (limit extraction)
  3. Choose whether to:
    • Download images to Key-Value Store
    • Convert to WebP for optimization
  4. (Recommended) Enable Proxy Configuration
  5. Run the Actor and export results as JSON/CSV/Excel

βš™οΈ Input Configuration

startUrls (required)

List of pages to scrape images from.

minWidth / minHeight

Minimum image size in pixels. Helps remove icons and UI assets.

  • Default: 200 x 200

maxImages

Max images per page to scrape.

  • Default: 50

downloadImages

If true, images are downloaded to Key-Value Store and a direct downloadUrl is provided.
If false, the Actor returns image URLs only.

  • Default: true

convertToWebp

If true, images are converted to WebP to save space.

  • Default: true

proxyConfiguration

Proxies are recommended on image-heavy sites to avoid rate limiting.


βœ… Example Input (JSON)

{
"startUrls": [{ "url": "https://unsplash.com/s/photos/nature" }],
"minWidth": 200,
"minHeight": 200,
"downloadImages": true,
"convertToWebp": true,
"maxImages": 50,
"proxyConfiguration": { "useApifyProxy": true }
}

πŸ“¦ Output Dataset (Image Dataset)

Each dataset item contains:

sourceUrl β€” the page where the image was discovered

imageUrl β€” the original image URL (preview)

downloadUrl β€” direct download link (when downloadImages=true)

width β€” image width (px)

height β€” image height (px)

format β€” file format (jpg/png/webp/etc.)

Output Example

{
"sourceUrl": "https://unsplash.com/s/photos/nature",
"imageUrl": "https://images.unsplash.com/photo-xxxx",
"downloadUrl": "https://api.apify.com/v2/key-value-stores/XXXX/records/image-1700000000000.webp",
"width": 1920,
"height": 1280,
"format": "webp"
}

πŸ“Š Dataset View (Downloaded Images)

This Actor includes a dataset table view:

Preview (image)

Download Link

Width

Format

Great for quickly validating image quality before exporting.

πŸ”₯ Pro Tips (speed, quality & cost control)

  1. Filter aggressively to avoid junk

Set minWidth and minHeight to at least 200 (default). For β€œhigh-quality only”:

try minWidth: 800, minHeight: 600

  1. Use maxImages for predictable cost

Some pages load hundreds of images. Set maxImages to 20–50 for stable runs.

  1. Use URL-only mode for huge crawls

If you only need links first:

set downloadImages: false Then download only selected images later.

  1. WebP conversion for storage savings

WebP often reduces size significantly while keeping quality goodβ€”ideal for datasets and pipelines.

🧯 Troubleshooting

Some images are missing width/height

Some sites lazy-load images or omit dimensions. The Actor extracts what is available.

Download links are missing

Ensure:

downloadImages: true

Getting blocked / throttled

Enable proxyConfiguration.useApifyProxy = true

Reduce concurrency (if your implementation exposes it)

Lower maxImages for heavy pages

πŸ” SEO Keywords (what this Actor targets)

bulk image downloader

image scraper

extract images from website

webpage image downloader

download images from url

webp converter

image optimization pipeline

image dataset generator

πŸ—Ί Roadmap

Planned improvements:

deduplication across pages (hash-based)

domain-wide crawling mode (follow internal links)

filename templates and folder-like output structure

additional optimization formats (AVIF) (optional)

Support & Feedback

Open an issue in the Actor page with:

your target URLs

expected image types (product images, gallery, etc.)

your size/format requirements