PDF to RAG Markdown Chunks for Embeddings avatar

PDF to RAG Markdown Chunks for Embeddings

Pricing

from $3.00 / 1,000 page parseds

Go to Apify Store
PDF to RAG Markdown Chunks for Embeddings

PDF to RAG Markdown Chunks for Embeddings

Convert PDFs into token-bounded Markdown chunks for RAG, embeddings, and vector databases (Pinecone, Chroma, Weaviate, Qdrant). Set maxTokens + overlap; get clean chunks with page number, token count, and SHA-256 content hash for dedup. JSON dataset ready for any LLM pipeline.

Pricing

from $3.00 / 1,000 page parseds

Rating

0.0

(0)

Developer

Adam

Adam

Maintained by Community

Actor stats

0

Bookmarked

0

Total users

0

Monthly active users

20 hours ago

Last modified

Share