data:image/s3,"s3://crabby-images/2ca21/2ca21e92f46b9502fe9d7fe0c5c5d2535fd20ecf" alt="Pinecone Integration avatar"
Pinecone Integration
No credit card required
data:image/s3,"s3://crabby-images/2ca21/2ca21e92f46b9502fe9d7fe0c5c5d2535fd20ecf" alt="Pinecone Integration"
Pinecone Integration
No credit card required
This integration transfers data from Apify Actors to a Pinecone and is a good starting point for a question-answering, search, or RAG use case.
It has staring failing, saying chunk size is too big but it is not
It is just one actor but it keeps happening. I think the api is trying to chunk all 500 chunks in one go.
data:image/s3,"s3://crabby-images/91a36/91a367a04f91462b9a0580243cbfa98c064d7117" alt="jiri.spilka avatar"
Hi, thank you for using the Pinecone integration.
The batching logic is handled by the official Pinecone implementation, which computes text embeddings in batches of 1,000.
I'm not sure why your chunk_size
is set to 40,000 characters. In most cases, embeddings won't effectively capture the meaning of such long text. If you reduce it to 20,000 (which is still quite long), it should work.
I can also adjust the batch size (if you want), but I suspect the chunk_size
might have been an unintentional copy-paste mistake.
Hope this helps!
Jiri
Actor Metrics
45 monthly users
-
25 bookmarks
95% runs succeeded
16 days response time
Created in Jun 2024
Modified a month ago