# Jobstreet & JobsDB Jobs Scraper · SE-Asia Salary & Filters (`memo23/jobstreet-scraper`) Actor

Scrape Jobstreet & JobsDB jobs across SE-Asia (MY, PH, SG, ID, VN, TH, HK). Paste any keyword/location search URL — returns title, company, location, salary, classification, work type, listing date, teaser & apply link as clean JSON/CSV. Pure HTTP, no browser

- **URL**: https://apify.com/memo23/jobstreet-scraper.md
- **Developed by:** [Muhamed Didovic](https://apify.com/memo23) (community)
- **Categories:** Jobs, AI, Agents
- **Stats:** 16 total users, 14 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.90 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Jobstreet & JobsDB Jobs Scraper

Scrape job listings from **Jobstreet** (Malaysia, Philippines, Singapore, Indonesia, Vietnam) and **JobsDB** (Thailand, Hong Kong) — the SEEK-powered job boards of South-East Asia. Paste any search URL and get clean, structured rows: title, company, location, salary, classification, work type, listing date, teaser and apply link. Pure HTTP, no browser.

![How it works](https://raw.githubusercontent.com/muhamed-didovic/muhamed-didovic.github.io/main/assets/how-it-works-jobstreet.png)

### Why use this scraper

- **All of SE-Asia in one actor** — one row shape across Jobstreet MY/PH/SG/ID/VN and JobsDB TH/HK. Country and locale are detected automatically from the URL you paste.
- **Search-URL driven** — copy any search from the site (keywords + location + filters) and it's paginated for you up to your item cap.
- **Clean, flat output** — one row per job, salary text preserved, classification split into category + sub-category, CSV-friendly by default.
- **Pure HTTP, no browser** — fast and cheap; no Playwright, no headless Chrome.
- **Optional employer email enrichment** — discover each employer's website and harvest a contact email (opt-in, billed only on a hit).

### What it does

Give it one or more Jobstreet/JobsDB **search** URLs. For each, it reads the live results and emits one dataset row per job with all the fields below. Searches paginate automatically until you hit `Maximum items`.

### Supported inputs

| Input | Example | Supported |
|---|---|---|
| Keyword search | `jobstreet.com.my/developer-jobs` | ✅ |
| Keyword + location | `jobstreet.com.ph/accounting-jobs/in-Manila` | ✅ |
| Query-string search | `jobstreet.com.my/jobs?keywords=marketing&where=Kuala+Lumpur` | ✅ |
| JobsDB search | `th.jobsdb.com/jobs?keywords=developer` | ✅ |
| Single job URL `/job/{id}` | — | ❌ paste a search URL instead |

### Use cases

- **Labour-market & salary research** across SE-Asia markets.
- **Recruitment & sourcing** — monitor new postings by keyword, city, or classification.
- **Competitive hiring intelligence** — track which companies are hiring for what.
- **Aggregation & job-board feeds** — pull fresh listings into your own product.

### How it works

1. You paste Jobstreet/JobsDB search URLs.
2. The actor detects the country, language and region from the page, then queries the site's own search service and paginates through results.
3. Each job is normalised into one flat row and pushed to the dataset.

### Input configuration

| Field | Type | Description |
|---|---|---|
| `startUrls` | array | Jobstreet/JobsDB **search** URLs (auto-paginated). |
| `maxItems` | integer | Hard cap on jobs collected. Default 10000. |
| `flatten` | boolean | JSON-stringify arrays into `*_json` columns for CSV (default on). |
| `enrichEmails` | boolean | Opt-in employer contact-email enrichment (billed per email found). |
| `maxConcurrency` | integer | Parallel row processing (mainly for enrichment). Default 8. |
| `proxy` | object | Proxy config. Defaults to Apify Residential. |

### Output sample

```json
{
  "portal": "jobstreet",
  "jobId": "92683012",
  "title": "Junior Software Developer",
  "company": "SEAGULL COOLING TECHNOLOGIES (ASIA PACIFIC) SDN. BHD.",
  "location": "Pusat Bandar Damansara, Kuala Lumpur",
  "classification": "Information & Communication Technology",
  "subClassification": "Engineering - Software",
  "workType": "Full time",
  "salaryDisplay": "RM 3,500 – RM 5,000 per month",
  "listingDate": "2026-06-12T03:02:16Z",
  "listingDateLabel": "8d ago",
  "teaser": "Work with senior developers, project teams, and business stakeholders to design, develop…",
  "bulletPoints": [],
  "canonicalUrl": "https://www.jobstreet.com.my/job/92683012",
  "scrapedAt": "2026-06-20T15:36:00.000Z"
}
````

### Key output fields

| Field | Description |
|---|---|
| `jobId` | Stable Jobstreet/JobsDB job id |
| `title` | Job title |
| `company` | Advertiser / employer name |
| `location` | Job location (city, region) |
| `classification` / `subClassification` | Category + sub-category |
| `workType` | Full time / Part time / Contract / Casual |
| `salaryDisplay` | Salary text as shown |
| `listingDate` / `listingDateLabel` | ISO date + relative label |
| `teaser` / `descriptionText` | Description snippet |
| `bulletPoints` | Highlight bullets (when present) |
| `canonicalUrl` | Link to the job on the site |
| `contactEmail` / `contactWebsite` | Employer contact (only when `enrichEmails` is on) |

### FAQ

**Does it return the full job description?** It returns the **teaser** (description snippet) plus all structured fields. The full HTML body is served by Jobstreet through an authenticated API and isn't included.

**Can I scrape a single `/job/{id}` URL?** No — paste a **search** URL (e.g. `jobstreet.com.my/developer-jobs`). Single-job detail is auth-walled.

**Which countries?** Jobstreet MY, PH, SG, ID, VN and JobsDB TH, HK — pass the matching country's search URL.

**Do I need a proxy?** Apify Residential (the default) is recommended for larger runs.

### Support

Found an issue or want a field added? Open an issue on the actor's Issues tab and we'll take a look.

### Additional services

Need employer contact emails? Turn on `enrichEmails` to append a best-effort contact email + website per employer (billed only when an email is found).

### Explore more scrapers

Check the publisher's profile for more job-board and directory scrapers (Seek, Naukri, StepStone, Glassdoor and more).

### ⚠️ Disclaimer

This actor collects only publicly available job-listing data for legitimate research, recruitment and aggregation use. Respect Jobstreet/JobsDB terms of service and applicable laws (including data-protection rules) in your jurisdiction. You are responsible for how you use the scraped data.

### SEO Keywords

jobstreet scraper, jobsdb scraper, jobstreet api, jobstreet malaysia jobs, jobstreet philippines, jobstreet singapore, jobstreet indonesia, jobsdb thailand, jobsdb hong kong, seek asia jobs, south east asia job scraper, job listings scraper, salary data scraper, recruitment data, hiring intelligence

# Actor input Schema

## `startUrls` (type: `array`):

Full Jobstreet/JobsDB search URLs. Each is paginated automatically until `Maximum items` is reached.

## `flatten` (type: `boolean`):

When enabled (default), arrays (e.g. `bulletPoints`) are JSON-stringified into `*_json` fields for CSV. Disable to keep the full nested JSON.

## `enrichEmails` (type: `boolean`):

If enabled, tries to find a contact email for each employer by discovering the company's website (Clearbit) and reading its contact/about pages. Adds contactEmail + contactWebsite columns plus a detailed emailEnrichment object. Best-effort, billed per contact email found; only charged when an email is returned, never for misses.

## `maxItems` (type: `integer`):

Hard cap on the number of jobs collected. Each search returns 30 results/page and paginates automatically; use this cap to control billing.

## `maxConcurrency` (type: `integer`):

Maximum number of rows processed in parallel (mainly relevant when employer-email enrichment is on). 6-12 is the sweet spot.

## `maxRequestRetries` (type: `integer`):

Number of retries before a failed request is given up.

## `proxy` (type: `object`):

Defaults to Apify Residential. Jobstreet/JobsDB serve data cleanly to good fingerprints; residential IPs are recommended for large runs.

## Actor input object example

```json
{
  "startUrls": [
    "https://www.jobstreet.com.my/developer-jobs",
    "https://www.jobstreet.com.ph/accounting-jobs",
    "https://www.jobstreet.com.my/jobs?keywords=marketing&where=Kuala+Lumpur"
  ],
  "flatten": true,
  "enrichEmails": false,
  "maxItems": 10000,
  "maxConcurrency": 8,
  "maxRequestRetries": 5,
  "proxy": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        "https://www.jobstreet.com.my/developer-jobs",
        "https://www.jobstreet.com.ph/accounting-jobs",
        "https://www.jobstreet.com.my/jobs?keywords=marketing&where=Kuala+Lumpur"
    ],
    "proxy": {
        "useApifyProxy": true,
        "apifyProxyGroups": [
            "RESIDENTIAL"
        ]
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("memo23/jobstreet-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": [
        "https://www.jobstreet.com.my/developer-jobs",
        "https://www.jobstreet.com.ph/accounting-jobs",
        "https://www.jobstreet.com.my/jobs?keywords=marketing&where=Kuala+Lumpur",
    ],
    "proxy": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"],
    },
}

# Run the Actor and wait for it to finish
run = client.actor("memo23/jobstreet-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    "https://www.jobstreet.com.my/developer-jobs",
    "https://www.jobstreet.com.ph/accounting-jobs",
    "https://www.jobstreet.com.my/jobs?keywords=marketing&where=Kuala+Lumpur"
  ],
  "proxy": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}' |
apify call memo23/jobstreet-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=memo23/jobstreet-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Jobstreet & JobsDB Jobs Scraper · SE-Asia Salary & Filters",
        "description": "Scrape Jobstreet & JobsDB jobs across SE-Asia (MY, PH, SG, ID, VN, TH, HK). Paste any keyword/location search URL — returns title, company, location, salary, classification, work type, listing date, teaser & apply link as clean JSON/CSV. Pure HTTP, no browser",
        "version": "0.0",
        "x-build-id": "PGGZOrneaFm0bqcyq"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/memo23~jobstreet-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-memo23-jobstreet-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/memo23~jobstreet-scraper/runs": {
            "post": {
                "operationId": "runs-sync-memo23-jobstreet-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/memo23~jobstreet-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-memo23-jobstreet-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrls": {
                        "title": "Jobstreet / JobsDB search URLs",
                        "type": "array",
                        "description": "Full Jobstreet/JobsDB search URLs. Each is paginated automatically until `Maximum items` is reached.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "flatten": {
                        "title": "Flatten nested fields for CSV-friendly output",
                        "type": "boolean",
                        "description": "When enabled (default), arrays (e.g. `bulletPoints`) are JSON-stringified into `*_json` fields for CSV. Disable to keep the full nested JSON.",
                        "default": true
                    },
                    "enrichEmails": {
                        "title": "Enrich with employer contact emails (experimental, billed per email)",
                        "type": "boolean",
                        "description": "If enabled, tries to find a contact email for each employer by discovering the company's website (Clearbit) and reading its contact/about pages. Adds contactEmail + contactWebsite columns plus a detailed emailEnrichment object. Best-effort, billed per contact email found; only charged when an email is returned, never for misses.",
                        "default": false
                    },
                    "maxItems": {
                        "title": "Maximum items to scrape",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Hard cap on the number of jobs collected. Each search returns 30 results/page and paginates automatically; use this cap to control billing.",
                        "default": 10000
                    },
                    "maxConcurrency": {
                        "title": "Max concurrency",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of rows processed in parallel (mainly relevant when employer-email enrichment is on). 6-12 is the sweet spot.",
                        "default": 8
                    },
                    "maxRequestRetries": {
                        "title": "Max request retries",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Number of retries before a failed request is given up.",
                        "default": 5
                    },
                    "proxy": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Defaults to Apify Residential. Jobstreet/JobsDB serve data cleanly to good fingerprints; residential IPs are recommended for large runs."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
