RegExp Scraper avatar

RegExp Scraper

Try for free

30 minutes trial then $25.00/month - No credit card required now

Go to Store
RegExp Scraper

RegExp Scraper

ib4ngz/regexp-scraper
Try for free

30 minutes trial then $25.00/month - No credit card required now

This actor scrapes data from a list of provided URLs using regular expressions for precise and customizable pattern matching. It can handle both static and dynamic web pages and supports depth-based crawling to explore links and extract data from multiple levels of the web.

Start URLs

startUrlsarrayRequired

URLs to start with.

Maximum Depth

maxDepthintegerOptional

The maximum depth for crawling.

Default value of this property is 1

Regex Patterns

patternsstringRequired

Patterns to search in the HTML content. Each pattern should be on a new line.

Crawler Type

crawlerTypeEnumOptional

Select the type of crawler to use.

Value options:

"Crawlee + Cheerio": string"Crawlee + Puppeteer + Chrome": string

Default value of this property is "Crawlee + Cheerio"

Proxy configuration

proxyConfigurationobjectOptional

Select proxies to be used by your crawler.

Minimum Concurrency

minConcurrencyintegerOptional

The minimum number of concurrent requests or pages being processed.

Default value of this property is 1

Maximum Concurrency

maxConcurrencyintegerOptional

The maximum number of concurrent requests or pages being processed.

Default value of this property is 10

Developer
Maintained by Community

Actor Metrics

  • 1 monthly user

  • 1 star

  • >99% runs succeeded

  • Created in Jan 2025

  • Modified 15 hours ago