# OnlineJobs.ph Scraper (`inatanli/onlinejobs-ph-scraper`) Actor

Extract job postings from OnlineJobs.ph — the largest marketplace for hiring Filipino virtual assistants and remote workers. Search by keyword, set how many pages to crawl, and get back a structured dataset ready for export.

- **URL**: https://apify.com/inatanli/onlinejobs-ph-scraper.md
- **Developed by:** [Ina Li](https://apify.com/inatanli) (community)
- **Categories:** Jobs
- **Stats:** 7 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.004 / result

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## OnlineJobs.ph Scraper

OnlineJobs.ph doesn't offer a public API, but this scraper acts as an unofficial **OnlineJobs.ph API** to help you extract the job data you need, when you need it, and at scale. Just type in a search keyword, set how many pages to crawl, and get clean, structured job listings exported in the format of your choice.

Built with [Crawlee](https://crawlee.dev/python/) (BeautifulSoup parser + httpx client) and deployed as an [Apify](https://apify.com/) Actor.

### What does OnlineJobs.ph Scraper do?

OnlineJobs.ph Scraper extracts job postings from [OnlineJobs.ph](https://www.onlinejobs.ph) — the largest marketplace for hiring Filipino virtual assistants and remote workers — and exports them into JSON, CSV, Excel, or HTML. You can then use this data in your own data projects, hiring pipelines, market reports, and applications.

- 🔎 Search **any keyword** (e.g. `virtual assistant`, `gtm`, `seo`) — or leave it blank to scrape all jobs
- 📄 Scrape job listings across **multiple search-result pages** (30 jobs per page)
- 🧾 Extract full job details: title, employment type, salary, last-updated date, description, and required skills
- 📤 Export data in multiple formats: JSON, CSV, Excel, or HTML
- 🔌 Connect to any AI chatbot or workflow using the Apify API, MCP server, webhooks, and SDKs (Python & Node.js)

### What data can you get from OnlineJobs.ph?

|                      |                         |                    |
| -------------------- | ----------------------- | ------------------ |
| 🧑‍💼 Job title         | ⏱ Employment type       | 💵 Salary          |
| 📅 Date last updated | 📝 Full job description | 🏷️ Required skills |
| 🔗 Job detail URL    |                         |                    |

> **Note:** `employerName` is **not** extracted — the employer section on OnlineJobs.ph is hidden behind a login wall for unauthenticated visitors.

### Why scrape OnlineJobs.ph?

- 🕵️ Research the Filipino remote-work and VA market by role, salary, and recency
- 📈 Monitor hiring trends and in-demand skills over time
- 💸 Benchmark salaries across roles and employment types
- 🗂 Build a custom database of available positions
- 🤺 Track competitors' job postings and hiring activity
- 📩 Feed an automated job-search or lead-generation pipeline

### How to use OnlineJobs.ph Scraper

OnlineJobs.ph Scraper is built for an easy start, even if you've never scraped before:

1. Create a free [Apify account](https://console.apify.com/sign-up) using your email.
2. Open OnlineJobs.ph Scraper.
3. Enter a **Job Keyword** and the **Max Pages** you want to crawl.
4. Click **Start** and wait for the data to be extracted.
5. Download your data in JSON, CSV, Excel, HTML, or XML.

### Input

The input is a simple JSON object. OnlineJobs.ph Scraper recognizes the following parameters:

| Parameter    | Type    | Required | Default | Description                                                         |
| ------------ | ------- | -------- | ------- | ------------------------------------------------------------------- |
| `jobKeyword` | string  | No       | —       | Search keyword (e.g. `virtual assistant`, `gtm`). Blank = all jobs. |
| `maxPages`   | integer | No       | `5`     | Maximum search-result pages to crawl (30 jobs per page).            |

#### Input example

```json
{
  "jobKeyword": "virtual assistant",
  "maxPages": 3
}
````

### Output

The results are wrapped into a dataset you can find in the **Storage** tab and access via the API. Each scraped job is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node.js). See the [API reference](https://docs.apify.com/api/v2) to learn more about getting results from this Actor.

#### Scraped OnlineJobs.ph job listings

The structure of each item looks like this:

```json
{
  "jobUrl": "https://www.onlinejobs.ph/jobseekers/job/Virtual-Assistant-1234567",
  "jobTitle": "Executive Virtual Assistant",
  "employmentType": "Full Time",
  "salary": "$4-6 per hour",
  "dateUpdated": "2026-05-31",
  "fullDescription": "We are looking for an experienced executive virtual assistant to manage calendars, handle email, and support day-to-day operations...",
  "skills": ["Email Management", "Calendar Management", "Customer Support"]
}
```

#### Output fields

| Field             | Type           | Description                                                 |
| ----------------- | -------------- | ----------------------------------------------------------- |
| `jobUrl`          | string (URL)   | Canonical job detail URL                                    |
| `jobTitle`        | string         | Job title                                                   |
| `employmentType`  | string         | `Full Time`, `Part Time`, `Gig`, or `Any`                   |
| `salary`          | string | null | Raw salary string (e.g. `$4-6 per hour`); `null` if absent  |
| `dateUpdated`     | string         | Date last updated, parsed to ISO `YYYY-MM-DD` when possible |
| `fullDescription` | string         | Job description, with `<ojfilter>` tags stripped            |
| `skills`          | string\[]       | Listed skills / top-skill labels                            |

### During the run

OnlineJobs.ph Scraper runs in two phases:

1. **Listing phase** — crawls each search-results page and collects job detail URLs.
2. **Detail phase** — visits each collected URL and extracts the structured fields above.

To respect the site's `robots.txt` (`Crawl-delay: 5`), the crawler runs **one request at a time** and caps its rate at **12 requests/minute**. Total requests per run are bounded by `maxPages` listing pages plus up to 30 detail pages each.

If a detail page can't be fetched, the job is still recorded (with empty fields) rather than silently dropped, so your dataset stays complete and predictable.

### Integrations

You can connect OnlineJobs.ph Scraper with almost any cloud service or web app. Apify offers integrations with Make, Zapier, Slack, Google Sheets, Google Drive, GitHub, and more, plus webhooks to trigger actions whenever a run finishes.

#### Using the Apify API

The [Apify API](https://docs.apify.com/api/v2) gives you programmatic access to run the Actor, schedule it, and fetch results. Use the [`apify-client`](https://docs.apify.com/api/client/js/) NPM package for Node.js or the [`apify-client`](https://docs.apify.com/api/client/python/) PyPI package for Python.

#### Using an MCP server

With the Apify API you can use this Actor through an MCP server and connect it to clients like Claude Desktop, or build your own. Read more about [setting up Apify Actors with MCP](https://docs.apify.com/platform/integrations/mcp).

### Is it legal to scrape OnlineJobs.ph?

This scraper only collects data that employers have chosen to publish publicly, and it honors the site's crawl-delay. It does not extract data behind the login wall (such as employer contact details).

# Actor input Schema

## `jobKeyword` (type: `string`):

Search keyword (e.g. virtual assistant, gtm). Leave blank to return all jobs.

## `maxPages` (type: `integer`):

Maximum number of search result pages to crawl (30 jobs per page).

## Actor input object example

```json
{
  "jobKeyword": "virtual assistant",
  "maxPages": 1
}
```

# Actor output Schema

## `results` (type: `string`):

Dataset items containing scraped job postings (jobUrl, jobTitle, employmentType, salary, dateUpdated, fullDescription, skills).

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "jobKeyword": "virtual assistant",
    "maxPages": 1
};

// Run the Actor and wait for it to finish
const run = await client.actor("inatanli/onlinejobs-ph-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "jobKeyword": "virtual assistant",
    "maxPages": 1,
}

# Run the Actor and wait for it to finish
run = client.actor("inatanli/onlinejobs-ph-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "jobKeyword": "virtual assistant",
  "maxPages": 1
}' |
apify call inatanli/onlinejobs-ph-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=inatanli/onlinejobs-ph-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "OnlineJobs.ph Scraper",
        "description": "Extract job postings from OnlineJobs.ph — the largest marketplace for hiring Filipino virtual assistants and remote workers. Search by keyword, set how many pages to crawl, and get back a structured dataset ready for export.",
        "version": "0.0",
        "x-build-id": "2LrOt3nLBxieyOBG5"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/inatanli~onlinejobs-ph-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-inatanli-onlinejobs-ph-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/inatanli~onlinejobs-ph-scraper/runs": {
            "post": {
                "operationId": "runs-sync-inatanli-onlinejobs-ph-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/inatanli~onlinejobs-ph-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-inatanli-onlinejobs-ph-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "jobKeyword": {
                        "title": "Job Keyword",
                        "type": "string",
                        "description": "Search keyword (e.g. virtual assistant, gtm). Leave blank to return all jobs."
                    },
                    "maxPages": {
                        "title": "Max Pages",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of search result pages to crawl (30 jobs per page).",
                        "default": 1
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
