You can access the ScraperCodeGenerator programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = {
9    "targetUrl": "https://books.toscrape.com/",
10    "userGoal": "Get me a list of all the books on the first page. For each book, I want its title, price, star rating, and whether it is in stock.",
11    "actors": [
12        {
13            "name": "cheerio-scraper",
14            "enabled": True,
15            "input": {
16                "maxRequestRetries": 3,
17                "requestTimeoutSecs": 30,
18                "maxPagesPerCrawl": 1,
19                "pageFunction": """async function pageFunction(context) {
20    const { request, log, $ } = context;
21    try {
22        const title = $('title').text() || '';
23        const html = $('html').html() || '';
24        return {
25            url: request.url,
26            title: title,
27            html: html
28        };
29    } catch (error) {
30        log.error('Error in pageFunction:', error);
31        return {
32            url: request.url,
33            title: '',
34            html: ''
35        };
36    }
37}""",
38                "proxyConfiguration": { "useApifyProxy": True },
39            },
40        },
41        {
42            "name": "web-scraper",
43            "enabled": False,
44            "input": {
45                "maxRequestRetries": 3,
46                "requestTimeoutSecs": 30,
47                "maxPagesPerCrawl": 1,
48                "pageFunction": """async function pageFunction(context) {
49    const { request, log, page } = context;
50    try {
51        const title = await page.title();
52        const html = await page.content();
53        return {
54            url: request.url,
55            title: title,
56            html: html
57        };
58    } catch (error) {
59        log.error('Error in pageFunction:', error);
60        return {
61            url: request.url,
62            title: '',
63            html: ''
64        };
65    }
66}""",
67                "proxyConfiguration": { "useApifyProxy": True },
68            },
69        },
70        {
71            "name": "website-content-crawler",
72            "enabled": True,
73            "input": {
74                "maxCrawlPages": 1,
75                "crawler": "playwright",
76                "proxyConfiguration": { "useApifyProxy": True },
77            },
78        },
79        {
80            "name": "playwright-scraper",
81            "enabled": False,
82            "input": {
83                "maxRequestRetries": 2,
84                "requestTimeoutSecs": 45,
85                "maxPagesPerCrawl": 1,
86                "pageFunction": """async function pageFunction(context) {
87    const { request, log, page } = context;
88    try {
89        const title = await page.title();
90        const html = await page.content();
91        return {
92            url: request.url,
93            title: title,
94            html: html
95        };
96    } catch (error) {
97        log.error('Error in pageFunction:', error);
98        return {
99            url: request.url,
100            title: '',
101            html: ''
102        };
103    }
104}""",
105                "proxyConfiguration": { "useApifyProxy": True },
106            },
107        },
108    ],
109}
110
111# Run the Actor and wait for it to finish
112run = client.actor("ohlava/scrapercodegenerator").call(run_input=run_input)
113
114# Fetch and print Actor results from the run's dataset (if there are any)
115print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
116for item in client.dataset(run["defaultDatasetId"]).iterate_items():
117    print(item)
118
119# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

ScraperCodeGenerator API in Python

The Apify API client for Python is the official library that allows you to use ScraperCodeGenerator API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

$pip install apify-client

Other API clients include:

ScraperCodeGenerator API in JavaScript

ScraperCodeGenerator API through CLI

ScraperCodeGenerator OpenAPI definition

ScraperCodeGenerator API

Scraping Page

flavorful_quintuplet/scraping-page

Scraping Page Based on Your Website

Nomeri Sadulak

AI Newsletter Agent

louisdeconinck/ai-newsletter-agent

An intelligent agent that automatically generates curated newsletters on any topic by collecting and aggregating content from multiple sources.

Louis Deconinck

5.0

website content crawler

akash9078/website-content-crawler

Powerful website content crawler tool to extract, analyze, and index web pages automatically. Streamline data collection with fast, accurate web scraping technology.

Akash Kumar Naik

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.

Louis Deconinck

127

5.0

Custom Web Scraping Service

delicious_zebu/custom-web-scraping-service

Discover professional, tailored web scraping solutions. Input anything to learn about our services, pricing, and process!

ВAH

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

scrapingxpert

170

5.0

Website Content Crawler Fast

timelody/website-content-crawler-fast

Scraping data from every single web page.

timelody

5.0

📧✨ Extract Emails, Socials and Contacts from Any Website

logical_scrapers/extract-email-from-any-website

(fastest) An advanced Actor for extracting email addresses, social links and contact details from websites. This tool is perfect for web scraping, contact collection, and lead generation.

Goldmine

866

5.0

Intelligent Website Scrapper

happitap/intelligent-website-scrapper

An intelligent website scraper that uses LangChain and LLM to extract and process content based on high-level goals like summarization, product extraction, service extraction, and FAQ extraction.

HappiTap

Ultimate Walmart Scraper

eneiromatos/ultimate-walmart-scraper

This is the ultimate web scraping tool for extracting the most relevant data points from products on Walmart.com! Developed by an expert software developer, this powerful scraper is a fast and reliable tool for all your web scraping needs.