Multilingual Corpus Builder avatar

Multilingual Corpus Builder

Deprecated

Pricing

from $0.50 / 1,000 dataset items

Go to Apify Store
Multilingual Corpus Builder

Multilingual Corpus Builder

Deprecated

Scrapes web content in multiple languages, extracts clean text, detects language, scores quality, and outputs LLM-ready training data (JSONL). Perfect for multilingual AI training datasets, corpus linguistics research, and bilingual NLP pipelines.

Pricing

from $0.50 / 1,000 dataset items

Rating

0.0

(0)

Developer

Peter PANG

Peter PANG

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

a month ago

Last modified

Share

The Actor has no README.md file. Sad!