GPT Scraper avatar
GPT Scraper

Pricing

$9.00 / 1,000 pages

Go to Store
GPT Scraper

GPT Scraper

drobnikj/gpt-scraper

Developed by

Jakub Drobník

Maintained by Apify

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.4 (7)

Pricing

$9.00 / 1,000 pages

86

Monthly users

144

Runs succeeded

>99%

Response time

3.2 days

Last modified

3 months ago

JB

Not able to integrate with other actor

Closed
jen_brown_done opened this issue
6 months ago

When trying to integrate, using a placeholder to feed urls into gpt, it's not working. Would it be possible to see an example?

lukas.prusa avatar

Hi, thanks for opening this issue!

Can you please explain the issue more in-depth? What do you mean by "placeholder to feed urls into gpt? Where are the URLs coming from, and where do you want to pass them to?

JB

jen_brown_done

6 months ago

I'm trying to follow this tutorial to send a set of scraped urls to gpt scraper, it mentions resource.defaultdatasetid, but I'm unsure of where to input that when integrating with the gpt scraper https://blog.apify.com/connecting-scrapers-apify-integration/

lukas.prusa avatar

Right I see, yes that is the regular way to go about Actor2Actor integrations. Unfortunately we do not want to pollute the input schema of this Actor with this field and the dataset also has to be in the correct format.

A much easier solution would be to simply use the Forward Dataset to Actor or Task utility Actor. Basically you will use that one for the integration and it will call GPT scraper inside of it. Use this input to send the URLs over to this scraper:

1{
2    "datasetFieldName": {YOUR_URL_FIELD_NAME},
3    "datasetId": {{resource.defaultDatasetId}},
4    "format": "requestListSources",
5    "inputOverride": {
6        {YOUR_INPUT_FOR_GPT_SCRAPER_WITHOUT_START_URLS}
7    },
8    "targetFieldName": "startUrls",
9    "targetId": "paOtbjvyUiNsr1Qms",
10    "targetType": "ACTOR"
11}

And that's it, just input in your stuff for the placeholder variables and it should work :)

Let me know if this helps, thanks and happy scraping!

Pricing

Pricing model

Pay per result 

This Actor is paid per result. You are not charged for the Apify platform usage, but only a fixed price for each dataset of 1,000 items in the Actor outputs.

Price per 1,000 items

$9.00