Web Scraper avatar
Web Scraper

Pricing

$35.00 / 1,000 results

Go to Apify Store
Web Scraper

Web Scraper

Developed by

Futurize Rush

Futurize Rush

Maintained by Community

Simple web scraper. Extract titles, paragraphs, links, images, tables and more from websites. Supports custom CSS selectors and batch collection. For large needs, try Apify's Web Content Crawler.

0.0 (0)

Pricing

$35.00 / 1,000 results

2

4

4

Last modified

3 days ago

Start URLs

startUrlsarrayRequired

Enter the URLs you want to scrape (can be multiple)

Max Requests

maxRequestsPerCrawlintegerOptional

Maximum number of pages to scrape (to avoid running too long)

Default value of this property is 10

Max Concurrency

maxConcurrencyintegerOptional

Number of pages to scrape simultaneously (1=stable but slow, 5=fast)

Default value of this property is 3

Smart Mode

smartModebooleanOptional

Automatically optimize speed and save costs when enabled

Default value of this property is true

Enable Cache

cacheEnabledbooleanOptional

Remember scraped pages to avoid duplicate scraping

Default value of this property is true

🎨 Custom Extraction Rules

extractionRulesobjectOptional

Use CSS selectors to specify elements to extract. Examples include common selectors that can be directly modified

Default value of this property is {}

📌 Custom Tags

customDataobjectOptional

Add your tags to each data record (e.g., source, category, etc.)

Default value of this property is {}

⏳ Wait for Element

waitForSelectorstringOptional

Wait for specific element to appear before extracting (for dynamic websites)

📜 Auto Scroll to Bottom

scrollToBottombooleanOptional

Automatically scroll the page to load more content (for infinite scroll websites)

Default value of this property is false

⏱️ Page Load Timeout

pageLoadTimeoutSecsintegerOptional

Maximum time to wait for page to load (seconds)

Default value of this property is 25

🚫 Block Resource Types

blockResourcesarrayOptional

Block unnecessary resources to speed up (images, CSS, fonts, etc.). By default, no resources are blocked to ensure complete extraction

Default value of this property is []