No credit card required

Extended GPT Scraper

drobnikj/extended-gpt-scraper

No credit card required

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Back to issues Create new issue

Rate limit needs to be handled.

Closed

maroon_herb opened this issue

I've been bumping up against the rate limit, unfortunately. Also, it seems OpenAI isn't currently accepting requests for an increased GPT-4 rate limit and my queries don't work well with GPT3.5, so I'm a bit stuck at the moment.

When I try to beef up the resources for the scraper, it quickly hits the rate limit and just keeps on trying, until it finally gives up and moves on to the next record. The process just keeps repeating, and I'm not getting any valuable output.

I tried another approach of cutting down to a machine spec until I don't get the rate-limit and it isn't until 512MB RAM, but then I ended up with memory limitations and the request just crashes. So, that's another dead end.

My suggestion on how to resolve this is:

Read and use the rate-limit values provided in the header reply from OpenAI. If the header indicates "x-ratelimit-limit-requests X", pace the requests to X/min.

How about we add a rate limit with TPM and RPM in the input menu? This could help us stay under the limit, and the user might want to have some control and limit the rate lower than the maximum allowed for a variety of reasons.

Lastly, when a rate limit response comes back, let's take a break from making any new requests for the duration of 'x-ratelimit-reset-requests' and 'x-ratelimit-reset-tokens'. If the rate limit issues persist, try an exponential backoff strategy until things look better.

designer_filter

same issue here

Jakub Drobník (drobnikj)

Hey Markus,

feature of new input fields makes sense and I consider to adding these in some way.

I will let you know when it will be done.

Pavlína Vencovská (paja)

Hi, little update, it's in our backlog, unfortunately not a priority at the moment. Nevertheless, some rate limit management is a good idea, we hope to get back to it in the foreseeable future. Thanks for the suggestion and your patience!

Pavlína Vencovská (paja)

Hi again, I´m going to close the issue for now, but we´re still hoping to get back to it. It might take some time though. Happy Scraping in the meantime!

Add comment

Developer

Jakub Drobník

Actor metrics

83 monthly users
21 stars
98.8% runs succeeded
1.4 days response time
Created in Jun 2023
Modified 2 months ago

Categories

Lead generation

Developer tools

Twitter Tweets and Profiles Scraper

web.harvester/twitter-scraper

Easily search and extract tweets from profiles or directly using a URL with our tool. Scrape tweets and replies effortlessly. Download data in formats like JSON, CSV, XML, RSS, or HTML Table, perfect for integration with various applications, databases, and social media analytics tools.

Web Harvester

1.2k

Easy Twitter Search Scraper

web.harvester/easy-twitter-search-scraper

Easily Scrape tweets with our Twitter Search Scraper. Export data in formats like JSON and Excel perfect for integration with various applications, databases, and data analysis tools.

Web Harvester

2.2k

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

4.7k

Website Content Crawler

apify/website-content-crawler

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗LangChain, LlamaIndex, and the wider LLM ecosystem.

Apify

17.6k

317

WCC Pinecone Integration

tri_angle/wcc-pinecone-integration

Crawl any website and store its content in your Pinecone vector database. Enhance the accuracy and reliability of your own AI Assistant with facts fetched from external sources or connect this integration to our Pinecone GPT Chatbot assistant available in Apify Store.