
GPT Scraper
Pricing
$9.00 / 1,000 pages

GPT Scraper
Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.
4.4 (7)
Pricing
$9.00 / 1,000 pages
99
Total users
5.8k
Monthly users
168
Runs succeeded
>99%
Last modified
4 months ago
Feature Request: Include information about the ratio between Generated and Sent Content to the dataset
Open
For websites with a lot of content, the information in sentContent
doesn't include all the information from the page.
Would it be possible to introduce some indicator of not the content from whole page was used?
Possibly some ratio of value between 0
and 1
- like 0.64
= 64% of the generated markdown was sent/used for the prompt.
This would help us to investigate issues related to the content was cutoff in the middle and therefore right results were not provided to the dataset.

Hi, thanks for the suggestion!
This seems like a reasonable feature to be added into the scraper :) We will add it in.
I will keep you updated here, thanks!