RAG Web Browser
Pricing
from $3.99 / 1,000 results
Enter Google Search keywords or a URL of a specific web page. Supports advanced search operators. Examples: san francisco weather, https://www.cnn.com, function calling site:openai.com. Leave empty only when using URLs (optional bulk) below.
Skip Google Search and scrape these URLs directly. If set, the search term above is ignored.
[]The maximum number of top organic Google Search results whose web pages will be extracted. If the query is a URL, this field is ignored and only that page is fetched.
Select one or more formats to which the target web pages will be extracted and saved in the resulting dataset.
[ "markdown"]Overrides the default Apify Proxy group used for fetching Google Search results.
The maximum number of times the Actor will retry fetching Google Search results on error. If the last attempt fails, the entire search step fails.
Apify Proxy settings for scraping target web pages. When enabled, requests start on datacenter proxies. If a site blocks the request, the Actor automatically escalates to residential proxies (up to 3 retries, then stays on residential for the rest of the run).
{ "useApifyProxy": true}Raw HTTP is fast and works for most static sites. Browser (Playwright) mode is not available in this Python build — if selected, Raw HTTP is used instead.
CSS selectors for elements removed from the DOM before conversion to text or Markdown. Set to a non-matching selector like dummy_keep_everything to disable removal.
How to transform HTML after element removal. None keeps the cleaned page; Readable text extracts the main article body.
Per-page retry budget after the first failure, before the page is skipped or proxy escalation continues.
Maximum seconds to wait for dynamic content when using Browser mode. Ignored for Raw HTTP (the default scraping tool in this build).
Remove cookie consent banners before extraction. Slightly increases processing time.