Pricing

$9.00 / 1,000 pages

Try for free

Go to Store

GPT Scraper

Try for free

Developed by

Jakub Drobník

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.4 (7)

Pricing

$9.00 / 1,000 pages

108

Total users

Monthly users

105

Runs succeeded

99%

Issues response

2.3 days

Last modified

6 months ago

Lead generation

Back to issues Create new issue

Not able to integrate with other actor

Closed

jen_brown_done opened this issue

When trying to integrate, using a placeholder to feed urls into gpt, it's not working. Would it be possible to see an example?

Prukáš Lůša (lukas.prusa)

Hi, thanks for opening this issue!

Can you please explain the issue more in-depth? What do you mean by "placeholder to feed urls into gpt? Where are the URLs coming from, and where do you want to pass them to?

jen_brown_done

I'm trying to follow this tutorial to send a set of scraped urls to gpt scraper, it mentions resource.defaultdatasetid, but I'm unsure of where to input that when integrating with the gpt scraper https://blog.apify.com/connecting-scrapers-apify-integration/

Prukáš Lůša (lukas.prusa)

Right I see, yes that is the regular way to go about Actor2Actor integrations. Unfortunately we do not want to pollute the input schema of this Actor with this field and the dataset also has to be in the correct format.

A much easier solution would be to simply use the Forward Dataset to Actor or Task utility Actor. Basically you will use that one for the integration and it will call GPT scraper inside of it. Use this input to send the URLs over to this scraper:

{
    "datasetFieldName": {YOUR_URL_FIELD_NAME},
    "datasetId": {{resource.defaultDatasetId}},
    "format": "requestListSources",
    "inputOverride": {
        {YOUR_INPUT_FOR_GPT_SCRAPER_WITHOUT_START_URLS}
    },
    "targetFieldName": "startUrls",
    "targetId": "paOtbjvyUiNsr1Qms",
    "targetType": "ACTOR"
}

And that's it, just input in your stuff for the placeholder variables and it should work :)

Let me know if this helps, thanks and happy scraping!

Add comment

Extended GPT Scraper

drobnikj/extended-gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

1.5K

4.6

GPT Browser

anchor/gpt-browser

A GPT browser to use OpenAI prompt on any website. Put a list of URLs and a prompt, then the GPT agent will give you the answer you need. Fast, easy, and not limited with OpenAI ChatGPT restrictions. The best way to search and use GPT on large number of websites. Upload Excel or CSV. Screenshots 📸

Anchor

Auto GPT

lukaskrivka/auto-gpt

Run Auto GPT sessions directly on Apify. No OpenAI account or API token is required! Store parsed thoughts into datasets for later analysis.

Lukáš Křivka

199

🔍 GPT Search [Private API]

openapi/gpt-search-private-api

Use OpenAI's GPT4o Search mode via API! No cookie or proxy is required. Fast, cheap and reliable.

Open API

5.0

GPT Search

tri_angle/gpt-search

Send queries to ChatGPT and retrieve structured answers with full source citations. Easily integrate into your tools or workflows for flexible, scalable AI-powered solutions.

Tri⟁angle

ChatGPT

pertosh/chatgpt

You can use this Actor to transform scraped results, such as reviews from restaurants, by rephrasing the sentences. Additionally, translation is also supported. You can also use it to generate new website descriptions, keywords, and other similar metadata.

Alper

147

OpenAI Vector Store Integration

jiri.spilka/openai-vector-store-integration

The Apify OpenAI Vector Store integration uploads data from Apify Actors to the OpenAI Vector Store linked to OpenAI Assistant.

Jiří Spilka

180

4.8

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.