Pricing

Pay per usage

Go to Store

Extended GPT Scraper

Try for free

Developed by

Jakub Drobník

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.6 (4)

Pricing

Pay per usage

Total users

1.4k

Monthly users

Runs succeeded

98%

Issue response

1.4 days

Last modified

4 months ago

Lead generation

Open source

You can access the Extended GPT Scraper programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

# Set API token
$API_TOKEN=<YOUR_API_TOKEN>

# Prepare Actor input
$cat > input.json << 'EOF'
<{
<  "startUrls": [
<    {
<      "url": "https://news.ycombinator.com/"
<    }
<  ],
<  "instructions": "Gets the post with the most points from the page and returns it as JSON in this format: \npostTitle\npostUrl\npointsCount",
<  "model": "gpt-3.5-turbo",
<  "includeUrlGlobs": [],
<  "excludeUrlGlobs": [],
<  "linkSelector": "a[href]",
<  "initialCookies": [],
<  "proxyConfiguration": {
<    "useApifyProxy": true
<  },
<  "targetSelector": "",
<  "removeElementsCssSelector": "script, style, noscript, path, svg, xlink",
<  "skipGptGlobs": [],
<  "schema": {
<    "type": "object",
<    "properties": {
<      "title": {
<        "type": "string",
<        "description": "Page title"
<      },
<      "description": {
<        "type": "string",
<        "description": "Page description"
<      }
<    },
<    "required": [
<      "title",
<      "description"
<    ]
<  },
<  "schemaDescription": ""
<}
<EOF

# Run the Actor using an HTTP API
# See the full API reference at https://docs.apify.com/api/v2
$curl "https://api.apify.com/v2/acts/drobnikj~extended-gpt-scraper/runs?token=$API_TOKEN" \
<  -X POST \
<  -d @input.json \
<  -H 'Content-Type: application/json'

Extended GPT Scraper API

Below, you can find a list of relevant HTTP API endpoints for calling the Extended GPT Scraper Actor. For this, you’ll need an Apify account. Replace <YOUR_API_TOKEN> in the URLs with your Apify API token, which you can find under Integrations in Apify Console. For details, see the API reference.

Run Actor

POST

https://api.apify.com/v2/acts/drobnikj~extended-gpt-scraper/runs?token=<YOUR_API_TOKEN>

Note: By adding the method=POST query parameter, this API endpoint can be called using a GET request and thus used in third-party webhooks. Please refer to our Run Actor API documentation.

Run Actor synchronously and get dataset items

POST

https://api.apify.com/v2/acts/drobnikj~extended-gpt-scraper/run-sync-get-dataset-items?token=<YOUR_API_TOKEN>

Note: This endpoint supports both POST and GET request methods. However, only the POST method allows you to pass input data. For more information, please refer to our Run Actor synchronously and get dataset items API documentation.

Get Actor

GET

https://api.apify.com/v2/acts/drobnikj~extended-gpt-scraper?token=<YOUR_API_TOKEN>

For more information, please refer to our Get Actor API documentation.

Actors can be used to scrape web pages, extract data, or automate browser tasks. Use the Extended GPT Scraper API programmatically via the Apify API.

You can choose from:

Extended GPT Scraper API in Python

Extended GPT Scraper API in JavaScript

Extended GPT Scraper API through CLI

Extended GPT Scraper OpenAPI definition

You can start Extended GPT Scraper with the Apify API by sending an HTTP POST request to the Run Actorendpoint. An Actor’s input and its content type can be passed as a payload of the POST request, and additional options can be specified using URL query parameters. The Extended GPT Scraper is identified within the API by its ID, which is the creator’s username and the name of the Actor.

When the Extended GPT Scraper run finishes you can list the data from its default dataset(storage) via the API or you can preview the data directly on Apify Console.

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

5.8k

GPT Browser

anchor/gpt-browser

A GPT browser to use OpenAI prompt on any website. Put a list of URLs and a prompt, then the GPT agent will give you the answer you need. Fast, easy, and not limited with OpenAI ChatGPT restrictions. The best way to search and use GPT on large number of websites. Upload Excel or CSV. Screenshots 📸

Anchor

🔍 GPT Search [Private API]

openapi/gpt-search-private-api

Use OpenAI's GPT4o Search mode via API! No cookie or proxy is required. Fast, cheap and reliable.

Open API

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.

Louis Deconinck

Auto GPT

lukaskrivka/auto-gpt

Run Auto GPT sessions directly on Apify. No OpenAI account or API token is required! Store parsed thoughts into datasets for later analysis.

Lukáš Křivka

194

ChatGPT

pertosh/chatgpt

You can use this Actor to transform scraped results, such as reviews from restaurants, by rephrasing the sentences. Additionally, translation is also supported. You can also use it to generate new website descriptions, keywords, and other similar metadata.

Alper

133

Website Social Scraper Api

oussemafr/website-social-scraper-api

You will get access to the Website Contact Details - Get Contact Info Efficiently!

Oussema FRIKHA

178

Pinecone GPT Chatbot

tri_angle/pinecone-gpt-chatbot

Pinecone GPT Chatbot combines OpenAI's GPT models with Pinecone's database to generate insightful responses. Its interactive chatbot interface presents precise and comprehensive answers to user queries. Benefit from semantic understanding, efficient workflows, and enriched knowledge integration!

Tri⟁angle

Extract-any-webpage-content-for-llm

ai-developer/extract-any-webpage-content-for-llm

Fast and easy way to extract data from any webpage and are LLM friendly. The tool lets you easily extract content from any website. Ideal for researchers, marketers, and developers.