RegExp Scraper avatar

RegExp Scraper

Try for free

30 minutes trial then $25.00/month - No credit card required now

Go to Store
RegExp Scraper

RegExp Scraper

ib4ngz/regexp-scraper
Try for free

30 minutes trial then $25.00/month - No credit card required now

This actor scrapes data from a list of provided URLs using regular expressions for precise and customizable pattern matching. It can handle both static and dynamic web pages and supports depth-based crawling to explore links and extract data from multiple levels of the web.

Developer
Maintained by Community

Actor Metrics

  • 1 monthly user

  • No reviews yet

  • 1 bookmark

  • >99% runs succeeded

  • Created in Jan 2025

  • Modified 2 months ago

Start URLs

startUrlsarrayRequired

URLs to start with.

Maximum Depth

maxDepthintegerOptional

The maximum depth for crawling.

Default value of this property is 1

Regex Patterns

patternsstringRequired

Patterns to search in the HTML content. Each pattern should be on a new line.

Crawler Type

crawlerTypeEnumOptional

Select the type of crawler to use.

Value options:

"Crawlee + Cheerio": string"Crawlee + Puppeteer + Chrome": string

Default value of this property is "Crawlee + Cheerio"

Proxy configuration

proxyConfigurationobjectOptional

Select proxies to be used by your crawler.

Minimum Concurrency

minConcurrencyintegerOptional

The minimum number of concurrent requests or pages being processed.

Default value of this property is 1

Maximum Concurrency

maxConcurrencyintegerOptional

The maximum number of concurrent requests or pages being processed.

Default value of this property is 10