Pricing

Pay per usage

Try for free

Go to Apify Store

Legacy PhantomJS Crawler

Try for free

Developed by

Apify

Replacement for the legacy Apify Crawler product with a backward-compatible interface. The actor uses PhantomJS headless browser to recursively crawl websites and extract data from them using a piece of front-end JavaScript code.

5.0 (6)

Pricing

Pay per usage

Last modified

a year ago

Developer tools

Open source

You can access the Legacy PhantomJS Crawler programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

$echo '{
<  "startUrls": [
<    {
<      "key": "START",
<      "value": "https://www.example.com/"
<    }
<  ],
<  "crawlPurls": [
<    {
<      "key": "MY_LABEL",
<      "value": "https://www.example.com/[.*]"
<    }
<  ],
<  "clickableElementsSelector": "a:not([rel=nofollow])",
<  "pageFunction": "function pageFunction(context) {\\n    // called on every page the crawler visits, use it to extract data from it\\n    var $ = context.jQuery;\\n    var result = {\\n        title: $('\''title'\'').text(),\\n        myValue: $('\''TODO'\'').text()\\n    };\\n    return result;\\n}\\n",
<  "interceptRequest": "function interceptRequest(context, newRequest) {\\n    // called whenever the crawler finds a link to a new page,\\n    // use it to override default behavior\\n    return newRequest;\\n}\\n"
<}' |
<apify call apify/legacy-phantomjs-crawler --silent --output-dataset

Legacy PhantomJS Crawler - Crawl websites, extract data API through CLI

The Apify CLI is the official tool that allows you to use Legacy PhantomJS Crawler locally, providing convenience functions and automatic retries on errors.

Install the Apify CLI

$npm i -g apify-cli
$apify login

Other API clients include:

Legacy PhantomJS Crawler API in Python

Legacy PhantomJS Crawler API in JavaScript

Legacy PhantomJS Crawler OpenAPI definition

Legacy PhantomJS Crawler API

JSDOM Scraper

apify/jsdom-scraper

Parses the HTML using the JSDOM library, providing the same DOM API as browsers do (e.g. `window`). It is able to process client-side JavaScript without using a real browser. Performance-wise, it stands somewhere between the Cheerio Scraper and the browser scrapers.

Apify

101

4.3

Cheerio Scraper

apify/cheerio-scraper

Crawls websites using raw HTTP requests, parses the HTML with the Cheerio library, and extracts data from the pages using a Node.js code. Supports both recursive crawling and lists of URLs. This actor is a high-performance alternative to apify/web-scraper for websites that do not require JavaScript.

Apify

9.5K

4.8

Vanilla JS Scraper

mstephen190/vanilla-js-scraper

Scrape the web using familiar JavaScript methods! Crawls websites using raw HTTP requests, parses the HTML with the JSDOM package, and extracts data from the pages using Node.js code. Supports both recursive crawling and lists of URLs. This actor is a non jQuery alternative to CheerioScraper.

Matthias Stephens

474

Puppeteer Scraper

apify/puppeteer-scraper

Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.

Apify

8.9K

5.0

Send Legacy PhantomJS Crawler Results

drobnikj/send-crawler-results

This actor downloads results from Legacy PhantomJS Crawler task and sends them to email as attachments. It is designed to run from finish webhook.

Jakub Drobník

Web Scraper

apify/web-scraper

Crawls arbitrary websites using a web browser and extracts structured data from web pages using a provided JavaScript function. The Actor supports both recursive crawling and lists of URLs, and automatically manages concurrency for maximum performance.

Apify

93K

4.5

Stealth Scraper

lolio9/stealth-scraper

A stealthy, headless browser-based scraper that mimics human behavior to avoid detection. Automatically saves every visited HTML page and downloadable file, incrementally archiving progress. Perfect for large websites, internal networks, or compliance-sensitive environments.

Marcus

BeautifulSoup Scraper

apify/beautifulsoup-scraper

Crawls websites using raw HTTP requests. It parses the HTML with the BeautifulSoup library and extracts data from the pages using Python code. Supports both recursive crawling and lists of URLs. This Actor is a Python alternative to Cheerio Scraper.

Apify

884

4.2

Solana Account Tokens API

websift/solana-account-tokens-api

Effortlessly retrieve SOL and SPL token balances for multiple Solana wallets with the Solana Balance API. Get real-time token prices, metadata, and detailed account information. Supports up to 5 wallets per run with built-in error handling for seamless performance.

Jacob

Actor Inspector Agent

jakub.kopecky/actor-inspector-agent

Agent Actor Inspector 🕵️‍♂️: An Apify Actor that rates others on docs 📝, inputs 🔍, code 💻, functionality ⚙️, performance ⏱️, and uniqueness 🌟. Config with actorId array, run, and review results. Helps devs improve, ensures quality, and guides users.