GPT Scraper avatar
GPT Scraper

Pricing

$9.00 / 1,000 pages

Go to Store
GPT Scraper

GPT Scraper

Developed by

Jakub Drobník

Jakub Drobník

Maintained by Apify

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.4 (7)

Pricing

$9.00 / 1,000 pages

99

Total users

5.8k

Monthly users

168

Runs succeeded

>99%

Last modified

4 months ago

JoseJet avatar

Feature Request: Include information about the ratio between Generated and Sent Content to the dataset

Open

Pepa <b>J</b> (JoseJet) opened this issue
5 months ago

For websites with a lot of content, the information in sentContent doesn't include all the information from the page.

Would it be possible to introduce some indicator of not the content from whole page was used?

Possibly some ratio of value between 0 and 1 - like 0.64 = 64% of the generated markdown was sent/used for the prompt.

This would help us to investigate issues related to the content was cutoff in the middle and therefore right results were not provided to the dataset.

lukas.prusa avatar

Hi, thanks for the suggestion!

This seems like a reasonable feature to be added into the scraper :) We will add it in.

I will keep you updated here, thanks!