Website Content Crawler
No credit card required
Website Content Crawler
No credit card required
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Do you want to learn more about this Actor?
Get a demoIt seems that the API endpoint does not work, also when I test it it gives an error.
Hi,
Thank you for using Website Content Crawler.
I’m sorry, but I’m unable to access the run ID you’ve attached. It seems you may have attached an Actor ID instead.
Could you please provide the correct run ID again?
Additionally, could you specify which API endpoint is not working and share the error message you’re encountering?
Thank you, Jiri
this is the run: h7S7tw7FmZF4Xu8b0 attached the error message I get when I click on "test" of the "get output" endpoint. The same output I get when I pull the data from a google sheet via app script.
Oh, I see. Thank you for providing more details.
The endpoint you mentioned: Returns the output of this run from its default key-value store.
In your case, the default Output record in the key-value store is empty because the data was saved to the dataset instead.
I believe you’re looking for the Get dataset items
endpoint. You can use the following URL:
https://api.apify.com/v2/datasets/ro4ZdTb3nCOJPKBk1/items?token=apify_api_*
. Please change your token to make it work.
I understand that the description is misleading, and I’ll raise this issue internally.
Please let me know if this resolves your problem.
Actor Metrics
4k monthly users
-
839 stars
>99% runs succeeded
1 days response time
Created in Mar 2023
Modified 17 hours ago