Pricing

Pay per usage

Go to Store

Extended GPT Scraper

Try for free

Developed by

Jakub Drobník

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.6 (4)

Pricing

Pay per usage

Total users

1.5K

Monthly users

Runs succeeded

99%

Last modified

6 months ago

Lead generation

Open source

You can access the Extended GPT Scraper programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = {
9    "startUrls": [{ "url": "https://news.ycombinator.com/" }],
10    "instructions": """Gets the post with the most points from the page and returns it as JSON in this format: 
11postTitle
12postUrl
13pointsCount""",
14    "model": "gpt-3.5-turbo",
15    "includeUrlGlobs": [],
16    "excludeUrlGlobs": [],
17    "linkSelector": "a[href]",
18    "initialCookies": [],
19    "proxyConfiguration": { "useApifyProxy": True },
20    "targetSelector": "",
21    "removeElementsCssSelector": "script, style, noscript, path, svg, xlink",
22    "skipGptGlobs": [],
23    "schema": {
24        "type": "object",
25        "properties": {
26            "title": {
27                "type": "string",
28                "description": "Page title",
29            },
30            "description": {
31                "type": "string",
32                "description": "Page description",
33            },
34        },
35        "required": [
36            "title",
37            "description",
38        ],
39    },
40    "schemaDescription": "",
41}
42
43# Run the Actor and wait for it to finish
44run = client.actor("drobnikj/extended-gpt-scraper").call(run_input=run_input)
45
46# Fetch and print Actor results from the run's dataset (if there are any)
47print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
48for item in client.dataset(run["defaultDatasetId"]).iterate_items():
49    print(item)
50
51# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

Extended GPT Scraper API in Python

The Apify API client for Python is the official library that allows you to use Extended GPT Scraper API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

$pip install apify-client

Other API clients include:

Extended GPT Scraper API in JavaScript

Extended GPT Scraper API through CLI

Extended GPT Scraper OpenAPI definition

Extended GPT Scraper API

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

4.4

GPT Browser

anchor/gpt-browser

A GPT browser to use OpenAI prompt on any website. Put a list of URLs and a prompt, then the GPT agent will give you the answer you need. Fast, easy, and not limited with OpenAI ChatGPT restrictions. The best way to search and use GPT on large number of websites. Upload Excel or CSV. Screenshots 📸

Anchor

Auto GPT

lukaskrivka/auto-gpt

Run Auto GPT sessions directly on Apify. No OpenAI account or API token is required! Store parsed thoughts into datasets for later analysis.

Lukáš Křivka

199

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.

Louis Deconinck

5.0

GPT-2 text generation

jirimoravcik/gpt2-text-generation

This actor uses the GPT-2 language model to generate text.

Jiří Moravčík

248

GPT Search

tri_angle/gpt-search

Send queries to ChatGPT and retrieve structured answers with full source citations. Easily integrate into your tools or workflows for flexible, scalable AI-powered solutions.

Tri⟁angle

AI Web Agent

apify/ai-web-agent

Use natural language prompts to browse the web, click on elements, fill and submit forms, extract data, and take screenshots using the OpenAI API.

Apify

1.4K

4.2

RAG Web Browser

apify/rag-web-browser

Web browser for OpenAI Assistants, RAG pipelines, or AI agents, similar to a web browser in ChatGPT. It queries Google Search, scrapes the top N pages, and returns their content as Markdown for further processing by an LLM. It can also scrape individual URLs. Supports Model Context Protocol (MCP).

Apify

3.9K

4.3

Actors MCP Server

apify/actors-mcp-server

⚠️ Legacy: This Actor is outdated. For the latest features and full documentation, visit https://mcp.apify.com. Easily connect any Apify Actor to AI agents using Anthropic’s Model Context Protocol (MCP) with our actively maintained MCP server.

Apify

1.7K

4.7

Backlink Building Agent

daniil.poletaev/backlink-building-agent

The Backlink Building Agent automates backlink outreach by finding relevant pages & websites, extracting contacts from these websites, and then crafting personalized outreach sequences based on the content to these partners. These sequences can be used on email, LinkedIn, Twitter, & WhatsApp.