
Web Scraper
apify/web-scraper
Crawls arbitrary websites using the Chrome browser and extracts structured data from web pages using a provided JavaScript function. The Actor supports both recursive crawling and lists of URLs, and automatically manages concurrency for maximum performance.
77.5k
4.5/5

Legacy PhantomJS Crawler
apify/legacy-phantomjs-crawler
Replacement for the legacy Apify Crawler product with a backward-compatible interface. The actor uses PhantomJS headless browser to recursively crawl websites and extract data from them using a piece of front-end JavaScript code.
1.6k
5.0/5

Puppeteer Scraper
apify/puppeteer-scraper
Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.
5.6k
5.0/5

Cheerio Scraper
apify/cheerio-scraper
Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.
6.6k
4.7/5

Vanilla JS Scraper
mstephen190/vanilla-js-scraper
Scrape the web using familiar JavaScript methods! Crawls websites using raw HTTP requests, parses the HTML with the JSDOM package, and extracts data from the pages using Node.js code. Supports both recursive crawling and lists of URLs. This actor is a non jQuery alternative to CheerioScraper.
445

Playwright Scraper
apify/playwright-scraper
Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.
1.1k
4.3/5

JSDOM Scraper
apify/jsdom-scraper
Parses the HTML using the JSDOM library, providing the same DOM API as browsers do (e.g. `window`). It is able to process client-side JavaScript without using a real browser. Performance-wise, it stands somewhere between the Cheerio Scraper and the browser scrapers.
87
4.3/5
Actor Readme Generator
apify/actor-readme-generator
Generates READMEs scrapers using ChatGPT, based on an Apify-approved template.
17
4.6/5

Airtable Exporter
jupri/airtable-exporter
💫 Export Dataset to Airtable
105
3.5/5
BeautifulSoup Scraper
apify/beautifulsoup-scraper
Crawls websites using raw HTTP requests. It parses the HTML with the BeautifulSoup library and extracts data from the pages using Python code. Supports both recursive crawling and lists of URLs. This Actor is a Python alternative to Cheerio Scraper.
788
4.4/5

Forward Dataset to Actor or Task
valek.josef/forward-dataset-to-actor-or-task
Forwards contents of specified dataset to a specified field on the input of another Actor or task.
5

OpenAI Vector Store Integration
jiri.spilka/openai-vector-store-integration
The Apify OpenAI Vector Store integration uploads data from Apify Actors to the OpenAI Vector Store linked to OpenAI Assistant.
128
4.8/5

Pinecone Integration
apify/pinecone-integration
This integration transfers data from Apify Actors to a Pinecone and is a good starting point for a question-answering, search, or RAG use case.
234
4.6/5

Chroma Integration
apify/chroma-integration
This integration transfers data from Apify Actors to a Chroma and is a good starting point for a question-answering, search, or RAG use case.
1
4.5/5

Qdrant Integration
apify/qdrant-integration
Transfer data from Apify Actors to a Qdrant vector database.
29
4.5/5

PGVector Integration
apify/pgvector-integration
This integration transfers data from Apify Actors to a Postgres SQL database (with PGVector extension).
7
4.6/5

Weaviate Integration
apify/weaviate-integration
This integration transfers data from Apify Actors to a Weaviate and is a good starting point for a question-answering, search, or RAG use case.
4
4.7/5

Milvus Integration
apify/milvus-integration
This integration transfers data from Apify Actors to a Milvus/Zilliz database and is a good starting point for a question-answering, search, or RAG use case.
8
4.5/5

OpenSearch Integration
apify/opensearch-integration
Transfer data from Apify Actors to Amazon OpenSearch Service. This Actor is a good starting point for building question-answering systems, search functionality, or Retrieval-Augmented Generation (RAG) use cases.
3
4.4/5

SuperScraper API
apify/super-scraper-api
Generic REST API for scraping websites: send a URL and get back HTML. This Actor is a drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
588
4.2/5