Japanese Website Content Crawler for RAG
Pricing
from $0.001 / result
Japanese Website Content Crawler for RAG
日本語のドキュメント、ヘルプセンター、ブログ、製品サイトをクロールし、RAG、ベクトルDB、LLMアプリ、社内検索に使いやすいMarkdown、テキスト、HTMLとして抽出します。
Japanese Website Content Crawler for RAG
Pricing
from $0.001 / result
日本語のドキュメント、ヘルプセンター、ブログ、製品サイトをクロールし、RAG、ベクトルDB、LLMアプリ、社内検索に使いやすいMarkdown、テキスト、HTMLとして抽出します。
You can access the Japanese Website Content Crawler for RAG programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.
$echo '{< "startUrls": [< {< "url": "https://www.digital.go.jp/resources/introduction-to-web-accessibility-guidebook"< }< ],< "sitemapUrls": [< "https://www.digital.go.jp/sitemap.xml"< ],< "excludeUrlGlobs": [< "**/search/**",< "**/login/**"< ]<}' |<apify call nezha/website-content-crawler-japan --silent --output-datasetThe Apify CLI is the official tool that allows you to use Japanese Website Content Crawler for RAG locally, providing convenience functions and automatic retries on errors.
Using installation script (macOS/Linux):
$curl -fsSL https://apify.com/install-cli.sh | bashUsing installation script (Windows):
$irm https://apify.com/install-cli.ps1 | iexUsing Homebrew:
$brew install apify-cliUsing npm:
$npm install -g apify-cliOther API clients include: