This Apify actor takes a list of URLs and downloads the full HTML content of each page. It simply scrapes the complete HTML code for all given URLs. You can define proxy settings and optional selector waiting.

✅ Use Cases

📄 Download HTML content from multiple websites

🕷️ Archive web pages for offline analysis

📊 Extract raw HTML for custom parsing

🔍 Monitor website changes over time

📥 Input Configuration

You can customize the actor using the following input fields:

{
  "requestListSources": [
    {
      "url": "https://apify.com"
    }
  ],
  "proxyConfiguration": {
    "useApifyProxy": true
  },
  "handlePageTimeoutSecs": 60,
  "maxRequestRetries": 1,
  "useChrome": false
}

🧾 Fields Explained Field Type Description requestListSources array Required. Array of URLs to download. Each item can have optional userData with waitForSelector proxyConfiguration object Proxy settings - choose no proxy, Apify Proxy, or custom proxy URLs handlePageTimeoutSecs integer Optional. Maximum time to spend processing one page (default: 60) maxRequestRetries integer Optional. How many retries before giving up (default: 1) useChrome boolean Optional. Use real Chrome browser instead of Chromium (default: false)

📤 Output

The actor returns a dataset containing HTML content for each URL. Each record includes the original URL, final URL (after redirects), page title, and full HTML content.

🧩 Sample Output

[
  {
    "url": "https://apify.com",
    "loadedUrl": "https://apify.com/",
    "title": "Apify - Web Scraping & Data Extraction | Apify",
    "html": "<!DOCTYPE html>\n<html lang=\"en\">\n<head>\n<meta charset=\"utf-8\">\n..."
  }
]

🔒 Proxy Configuration

This actor supports flexible proxy configuration:

No proxy (default)

Apify Proxy for residential IPs

Custom proxy URLs

Default proxy settings:

{
  "useApifyProxy": true
}

🚀 How to Use

Open the actor in Apify Console

Click "Try actor" or create a new task

Add URLs to the requestListSources array

Configure proxy settings if needed

Run the actor

Download HTML content in JSON, CSV, or XML format

⚙️ Advanced Input Example

{
  "requestListSources": [
    {
      "url": "https://example.com",
      "userData": {
        "waitForSelector": ".content-loaded"
      }
    },
    {
      "url": "https://another-site.com"
    }
  ],
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  },
  "handlePageTimeoutSecs": 120,
  "maxRequestRetries": 3,
  "useChrome": true
}

🛠️ Tech Stack

🧩 Apify SDK — for actor and data handling

🕷️ Crawlee — for robust crawling and scraping

🌐 Puppeteer — for browser automation and rendering dynamic content

⚙️ Node.js — fast, scalable backend environment

Download HTML from URLs

mtrunkat/url-list-download-html

This actor takes a list of URLs and downloads HTML of each page.

Marek Trunkát

Ajio Product Full Details

scrapeai/ajio-product-full-details

This Apify actor scrapes product data from AJIO.com, one of India's leading fashion e-commerce platforms. Search by keyword to collect comprehensive product information including name, brand, price, discount, images and more. Perfect for market research, price comparison, and e-commerce analytics.

ScrapeAI

5.0

Olx Product Scraper

scrapeai/olx-product-scraper

This Apify actor scrapes classified ads from OLX India, extracting detailed listings for motorcycles, cars, electronics, and real estate by visiting each product’s detail page and collecting structured data.

ScrapeAI

5.0

Tradeindia Seller Scraper

scrapeai/tradeindia-seller-scraper

Advanced TradeIndia seller category scraper that collects product and seller data via TradeIndia APIs. Scrape by category URL to get product name, price, company details, location, and seller flags—ideal for B2B lead generation, market research, and business insights.

ScrapeAI

5.0

🛠️ X / Twitter Scraper (Ultimate)

forge-api/x-scraper

Scrape X/Twitter data effortlessly. Extract tweets, profiles, users, and lists with advanced filtering. Supports search, media-only mode, engagement filters, date ranges, and location. Auto-detects content type and delivers structured JSON data instantly.

Forge Api

Indiamart Product Scraper

scrapeai/indiamart-product-scraper

This Apify actor retrieves product and supplier data through the IndiaMart search API. Search by query and city to collect structured product information including name, price, company, contact details, and location. Perfect for B2B lead generation, market research, and sourcing products from India.

ScrapeAI

5.0

Booking Hotel Scraper

scrapeai/booking-hotel-scraper

It extracts hotel name, location, price per night, star rating, guest rating, review count, amenities, room types, images, and property description by navigating search results and visiting individual hotel detail pages. Suitable for price monitoring, hotel dataset creation, and market analysis.

ScrapeAI

5.0

Twitter / X - Scraper - complete suite

mikolabs/tweets-scraper

Extract anything from X (Twitter) with high speed. This smart scraper auto-detects what to collect—tweets, profiles, users, lists, or media—and delivers clean, structured data instantly. Just enter usernames, URLs, or keywords and let automation do the rest.

mikolabs

5.0

Google Maps Places Scraper

scrapeai/google-maps-places-scraper

Retrieve verified business data directly from the Google Maps API. Search by keyword and location to collect structured details such as business name, phone number, address, website, ratings, reviews, and more—ideal for B2B lead generation, market research, and business intelligence

ScrapeAI

5.0

Search X By Keywords

watcher.data/search-x-by-keywords

Fast and reliable scraper for searching tweets and users on X (Twitter). Supports multiple output formats (JSON, CSV, Excel), advanced filtering options, and real-time data extraction. Perfect for social media monitoring, research, and business intelligence.