Screenshot & HTML file from Url avatar

Screenshot & HTML file from Url

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Screenshot & HTML file from Url

Screenshot & HTML file from Url

From 1$/1000 results. Capture website screenshots &/or full-page HTML in one run, from $1/1000 URLs. PNG, JPEG & PDF — full-page, custom viewport, lazy-load scroll, cookie-banner hiding, batch mode. HTML files open correctly in any browser. REST API ready. No watermark.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Alexandre Manguis

Alexandre Manguis

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

20 hours ago

Last modified

Share

From $1 per 1 000 URLs — capture a full-page screenshot and the complete HTML source in a single run. No watermark. REST API ready.

Screenshot preview


What it does

Open each URL in a real browser, wait for the page to fully load, then save two complementary outputs to Apify storage:

Visual capture — screenshot or PDF

OptionDetails
FormatsPNG · JPEG · PDF
Full-pageCaptures the entire scrollable page, not just the viewport
Custom viewportSet width × height in pixels
JPEG qualityAdjustable from 30 to 100 %

Page archive — HTML source

When Save HTML source is enabled, the actor also saves the full page HTML as a .html file. All relative links (stylesheets, images, fonts, scripts) are automatically resolved against the original URL, so the file opens and renders correctly in any browser with no extra setup.

Both outputs are linked in the same dataset row. Use screenshots only, HTML only, or both together.


Capture controls

OptionDetails
Lazy-load scrollScrolls to the bottom before capture to trigger deferred content
CSS hidingSuppresses cookie banners, chat widgets, or any CSS selector before the shot
Delay before captureExtra wait time after load for animations or late-rendered content
Wait-untilChoose between page load, DOM ready, or full network idle
RetriesAutomatic retry on failure — configurable per URL
ConcurrencyRun multiple captures in parallel

Best use cases

  • Visual regression testing — compare pages before and after a deploy
  • Competitive monitoring — daily snapshots of competitor landing pages
  • Content archiving — preserve pages as timestamped PNG, PDF & HTML records
  • Lead enrichment — attach a screenshot and raw HTML to each prospect's profile
  • Reporting & documentation — auto-generate visual proof for QA or client reports
  • Data extraction pipelines — feed the saved HTML into a downstream scraper
  • Thumbnail generation — batch-produce previews for link directories or dashboards

Quick start

  1. Click Try for free (or call the API).
  2. Paste one or more URLs into the Page URLs field.
  3. Choose PNG, JPEG, or PDF as the output format.
  4. Toggle Save HTML source on if you also want the HTML file.
  5. Click Start — results appear in the Dataset and Key-Value Store tabs within seconds.

Input

FieldTypeDefaultDescription
urlsarrayOne or more URLs to capture
formatpng / jpeg / pdfpngOutput file format
fullPagebooleantrueCapture the entire page height
saveHtmlbooleanfalseAlso save the page HTML source as a .html file
viewportWidthinteger1280Viewport width in px
viewportHeightinteger720Viewport height in px
jpegQualityinteger90JPEG quality (30–100 %)
waitUntilload / domcontentloaded / networkidleloadWhen navigation is considered complete
delayBeforeScreenshotMsinteger0Extra wait after page load (ms)
scrollToBottombooleanfalseScroll to trigger lazy content before capture
delayAfterScrollMsinteger1000Wait after scroll (ms)
selectorsToHidestringCSS selectors to hide before capture (one per line)
navigationTimeoutSecsinteger60Max time to load each page (s)
maxRequestRetriesinteger1Retries on failure
maxConcurrencyinteger1Parallel captures
proxyConfigurationobjectdisabledOptional proxy (Apify Proxy or custom URLs)

Output

Dataset (one row per URL)

FieldDescription
oktrue if the capture succeeded
urlFinal URL after redirects
titlePage <title>
statusCodeHTTP status code
screenshotUrlDirect download link for the screenshot or PDF
htmlUrlDownload link for the HTML source (when saveHtml is enabled)
formatpng, jpeg, or pdf
viewportWidth / viewportHeightViewport used
fullPageWhether full-page mode was active
capturedAtISO timestamp
durationMsCapture duration in ms
errorMessageError detail if ok is false

Key-Value Store

Files are stored with a deterministic key derived from the URL. The screenshotUrl and htmlUrl columns in the dataset link directly to each file.

If a URL fails after all retries the actor still writes a dataset row with ok: false and an errorMessage, so you never lose track of what failed.


Tips

Hiding cookie banners Use selectorsToHide to remove overlays before capture:

#cookie-banner
.gdpr-overlay
[data-cookienotice]

Full-page vs. viewport-only Set fullPage: false to capture only what is visible without scrolling — useful for above-the-fold thumbnails.

Slow or JS-heavy pages Switch waitUntil to networkidle and add 1 000–3 000 ms to delayBeforeScreenshotMs.

Lazy-loaded images Enable scrollToBottom and raise delayAfterScrollMs to 2 000–5 000 ms.

Feeding HTML into a scraper Enable saveHtml, grab the htmlUrl from the dataset, and parse the downloaded file downstream — no need to re-fetch the page.

Scaling to thousands of URLs Increase maxConcurrency (up to 5) and consider enabling Apify Proxy to avoid rate-limits.


Proxy settings

Proxies are disabled by default. Enable the Proxies section to:

  • Use Apify Proxy (residential or datacenter)
  • Supply your own proxy URLs

Troubleshooting

SymptomFix
Blank or all-white screenshotIncrease delayBeforeScreenshotMs or switch waitUntil to networkidle
Cookie overlay visibleAdd the overlay selector to selectorsToHide
Lazy images missingEnable scrollToBottom
Timeout errorIncrease navigationTimeoutSecs or add a proxy
PDF is viewport-onlyPDF format ignores fullPage; adjust viewportWidth and viewportHeight
HTML file looks brokenMake sure you are opening the .html file via a browser (not a text editor); all styles load from the live site

Keywords

website screenshot api · capture webpage · screenshot url · full page screenshot · webpage to pdf · save html page · html archiver · batch screenshot · visual monitoring · website thumbnail · web page archiving