apify_animation_01_02

Machine learning

Generate large-scale datasets from the web for the training of your artificial intelligence models

Machine learning

The complexity of your problem and algorithms increases the size of the dataset needed. Don’t limit your ideas and use the vast amount of publicly available data on the Internet to feed and train your models.

Natural language processing

Build a program to process and analyze large amounts of “natural language data” such as reviews. For instance, our Yelp crawler checks the web for the latest reviews of selected restaurants. Or get reviews from the Google Play Store for your favorite app.

Try Yelp Review Extractor actor
Try Google Places Scraper actor
Try Google Play Store App Reviews actor

Natural language processing
Image recognition

Image recognition

Many of the latest technological innovations rely on image recognition. In order to train self-driving cars, diagnostics imaging software, or simply the face-unlock feature in our smartphones, you need a colossal number of images.

News aggregation

Upgrade and train your models by adding new data from crawling global news sources. Track public sentiment, identify relationships, spot fake news, and gather up-to-date intelligence.

Try Article Text Extractor actor
Read more

News aggregation