RegExp Scraper
30 minutes trial then $25.00/month - No credit card required now
RegExp Scraper
30 minutes trial then $25.00/month - No credit card required now
This actor scrapes data from a list of provided URLs using regular expressions for precise and customizable pattern matching. It can handle both static and dynamic web pages and supports depth-based crawling to explore links and extract data from multiple levels of the web.
You can access the RegExp Scraper programmatically from your own JavaScript applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.
1import { ApifyClient } from 'apify-client';
2
3// Initialize the ApifyClient with your Apify API token
4// Replace the '<YOUR_API_TOKEN>' with your token
5const client = new ApifyClient({
6 token: '<YOUR_API_TOKEN>',
7});
8
9// Prepare Actor input
10const input = {
11 "startUrls": [
12 {
13 "url": "https://apify.com"
14 }
15 ],
16 "patterns": "(?<=href=[\"'])([^\"']+)",
17 "crawlerType": "Crawlee + Cheerio"
18};
19
20// Run the Actor and wait for it to finish
21const run = await client.actor("ib4ngz/regexp-scraper").call(input);
22
23// Fetch and print Actor results from the run's dataset (if any)
24console.log('Results from dataset');
25console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
26const { items } = await client.dataset(run.defaultDatasetId).listItems();
27items.forEach((item) => {
28 console.dir(item);
29});
30
31// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs
RegExp Scraper API in JavaScript
The Apify API client for JavaScript is the official library that allows you to use RegExp Scraper API in JavaScript or TypeScript, providing convenience functions and automatic retries on errors.
Install the apify-client
npm install apify-client
Other API clients include:
Actor Metrics
1 monthly user
-
1 star
>99% runs succeeded
Created in Jan 2025
Modified 19 hours ago