Extended GPT Scraper

Pricing

Pay per usage

Try for free

Go to Apify Store

Extended GPT Scraper

Try for free

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Pricing

Pay per usage

Rating

4.8

(5)

Developer

Jakub Drobník

Maintained by Apify

Actor stats

102

Bookmarked

1.6K

Total users

Monthly active users

2.5 days

Issues response

a year ago

Last modified

Categories

Lead generation

Open source

You can access the Extended GPT Scraper programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = {
9    "startUrls": [{ "url": "https://news.ycombinator.com/" }],
10    "instructions": """Gets the post with the most points from the page and returns it as JSON in this format: 
11postTitle
12postUrl
13pointsCount""",
14    "model": "gpt-3.5-turbo",
15    "includeUrlGlobs": [],
16    "excludeUrlGlobs": [],
17    "linkSelector": "a[href]",
18    "initialCookies": [],
19    "proxyConfiguration": { "useApifyProxy": True },
20    "targetSelector": "",
21    "removeElementsCssSelector": "script, style, noscript, path, svg, xlink",
22    "skipGptGlobs": [],
23    "schema": {
24        "type": "object",
25        "properties": {
26            "title": {
27                "type": "string",
28                "description": "Page title",
29            },
30            "description": {
31                "type": "string",
32                "description": "Page description",
33            },
34        },
35        "required": [
36            "title",
37            "description",
38        ],
39    },
40    "schemaDescription": "",
41}
42
43# Run the Actor and wait for it to finish
44run = client.actor("drobnikj/extended-gpt-scraper").call(run_input=run_input)
45
46# Fetch and print Actor results from the run's dataset (if there are any)
47print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
48for item in client.dataset(run["defaultDatasetId"]).iterate_items():
49    print(item)
50
51# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

Extended GPT Scraper API in Python

The Apify API client for Python is the official library that allows you to use Extended GPT Scraper API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

$pip install apify-client

Other API clients include:

Extended GPT Scraper API in JavaScript

Extended GPT Scraper API through CLI

Extended GPT Scraper OpenAPI definition

Extended GPT Scraper API

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

6.3K

4.0

GPT Browser

anchor/gpt-browser

A GPT browser to use OpenAI prompt on any website. Put a list of URLs and a prompt, then the GPT agent will give you the answer you need. Fast, easy, and not limited with OpenAI ChatGPT restrictions. The best way to search and use GPT on large number of websites. Upload Excel or CSV. Screenshots 📸

Anchor

5.0

GPT Search

tri_angle/gpt-search

Send queries to ChatGPT and retrieve structured answers with full source citations. Easily integrate into your tools or workflows for flexible, scalable AI-powered solutions.

Tri⟁angle

139

RAG Web Browser

apify/rag-web-browser

Web browser for OpenAI Assistants, RAG pipelines, or AI agents, similar to a web browser in ChatGPT. It queries Google Search, scrapes the top N pages, and returns their content as Markdown for further processing by an LLM. It can also scrape individual URLs.

Apify

8.4K

4.9

Pinecone GPT Chatbot

tri_angle/pinecone-gpt-chatbot

Pinecone GPT Chatbot combines OpenAI's GPT models with Pinecone's database to generate insightful responses. Its interactive chatbot interface presents precise and comprehensive answers to user queries. Benefit from semantic understanding, efficient workflows, and enriched knowledge integration!

Tri⟁angle

4.9

Chatgpt Prompt Actor

automation_nerd/chatgpt-prompt-actor

This Actor automates interactions with ChatGPT by sending prompts and extracting responses. it opens the web interface, dismisses pop-ups, sends prompts, waits for responses (up to 2 minutes), and extracts generated results including citations for further use.

Egon Maier

5.0

AI Web Agent

apify/ai-web-agent

Use natural language prompts to browse the web, click on elements, fill and submit forms, extract data, and take screenshots using the OpenAI API.

Apify

1.8K

4.2

ChatGPT

pertosh/chatgpt

You can use this Actor to transform scraped results, such as reviews from restaurants, by rephrasing the sentences. Additionally, translation is also supported. You can also use it to generate new website descriptions, keywords, and other similar metadata.

Alper

166

Perplexity Sonar MCP Server

agentify/perplexity-sonar-mcp-server

An MCP server that enables AI applications to perform real-time web searches using the Perplexity Sonar API

agentify

200

Website Content Crawler

apify/website-content-crawler

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

Apify

94K

4.7