# Daijob.com Scraper (`unfenced-group/daijob-scraper`) Actor

Scrape Japan's leading bilingual job board. Extract job titles, salaries in JPY, Japanese/English level requirements, location, and full descriptions. 8,000+ listings. No API key required.

- **URL**: https://apify.com/unfenced-group/daijob-scraper.md
- **Developed by:** [Unfenced Group](https://apify.com/unfenced-group) (community)
- **Categories:** Jobs
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $1.49 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Daijob.com Scraper

![Daijob.com Scraper](https://i.imgur.com/LV6Bx3l.png)

Extract job listings from [Daijob.com](https://www.daijob.com), Japan's leading job board for bilingual and multilingual professionals. Retrieve structured data including full job descriptions, salary details, language requirements, and company information — no API key required.

---

### Why this scraper?

#### 📋 Complete bilingual job data
Captures all fields unique to Daijob: Japanese level (JLPT), English level (TOEIC), and other language requirements alongside the full job description.

#### 💴 Structured salary output
Salary ranges extracted as numeric fields (`salaryMin`, `salaryMax` in JPY) with currency and period — ready for filtering and analysis without string parsing.

#### 🗾 Full location hierarchy
Location data delivered at three levels: country, prefecture (e.g. Tokyo, Kanagawa), and city for precise geographic filtering.

#### 📝 Descriptions in three formats
Every job description delivered as HTML, plain text, and Markdown — compatible with any downstream pipeline or LLM ingestion.

#### 🔁 Cross-run deduplication
Built-in 90-day deduplication store prevents re-processing listings already captured in earlier runs, reducing costs on scheduled scrapes.

#### ⚡ Lightweight and fast
Runs at 256 MB memory. No browser required. Processes thousands of listings per run.

---

### Input parameters

| Parameter | Type | Default | Description |
|---|---|---|---|
| `keyword` | String | `""` | Job title, skill, or keyword to search for |
| `language` | Select | `english` | Job post language: `english`, `japanese`, or `both` |
| `maxItems` | Integer | `100` | Maximum number of results to return |
| `daysOld` | Integer | — | Only return jobs activated within this many days |
| `skipReposts` | Boolean | `false` | Skip jobs seen in previous runs |
| `startUrls` | Array | `[]` | Specific Daijob search or detail URLs to scrape |

**Using `startUrls`:** Paste any Daijob.com search result URL (with location or job-type filters already applied) to override the keyword search. You can also pass individual job detail URLs directly.

---

### Output schema

```json
{
    "id": "1523159",
    "url": "https://www.daijob.com/en/jobs/detail/1523159",
    "title": "[Executive Assistant] Supporting Director-Level Leadership",
    "companyName": "Cubastion Consulting K.K.",
    "accountType": "Employer",
    "jobTypes": [
        "General Affairs/HR/Legal - Trainer (Education/Training)",
        "Administrative - Secretary"
    ],
    "industry": "IT Consulting",
    "country": "Japan",
    "prefecture": "Kanagawa",
    "city": "Yokohama",
    "salaryMin": 3000000,
    "salaryMax": 4000000,
    "salaryCurrency": "JPY",
    "salaryPeriod": "YEAR",
    "salaryDescription": "Social Insurance\nCommuting / Transportation Allowance",
    "japaneseLevel": "Business Level (JLPT Level 2 or N2)",
    "englishLevel": "Business Conversation Level (TOEIC 735-860)",
    "otherLanguages": null,
    "contractType": "Full-time",
    "careerLevel": "Staff Level",
    "holidays": "Five-Day Workweek\nSummer Holidays\nWinter Holidays",
    "descriptionHtml": "<p>We are looking for a highly organized...</p>",
    "descriptionText": "We are looking for a highly organized...",
    "descriptionMarkdown": "We are looking for a highly organized...",
    "companyInfoText": "Cubastion is a next-generation technology company...",
    "requirementsText": "Required: 3–5 years of experience as an Executive Assistant...",
    "applyUrl": "https://www.daijob.com/en/member/gotoapply/1523159",
    "publishDate": "2026-04-21",
    "updateDate": "2026-04-22",
    "contentHash": "a1b2c3d4e5f6a7b8",
    "source": "daijob.com",
    "scrapedAt": "2026-04-23T10:00:00.000Z",
    "isRepost": false,
    "originalPublishDate": null,
    "originalUrl": null
}
````

***

### Examples

#### 1. Search for software engineers (English posts)

```json
{
    "keyword": "software engineer",
    "language": "english",
    "maxItems": 200
}
```

#### 2. All jobs posted in the last 7 days

```json
{
    "language": "both",
    "maxItems": 500,
    "daysOld": 7
}
```

#### 3. Tokyo-area jobs via start URL

```json
{
    "startUrls": [
        { "url": "https://www.daijob.com/en/jobs/search_result?job_post_language=1&ac=118&page=1" }
    ],
    "maxItems": 300
}
```

#### 4. Scheduled incremental scrape (skip known jobs)

```json
{
    "language": "english",
    "maxItems": 1000,
    "daysOld": 3,
    "skipReposts": true
}
```

***

### 💰 Pricing

**$1.49 per 1,000 results** — you only pay for successfully retrieved listings.
Failed retries and filtered reposts are never charged.

| Results | Cost |
|---|---|
| 100 | ~$0.15 |
| 1,000 | ~$1.49 |
| 10,000 | ~$14.90 |
| 100,000 | ~$149.00 |

> Flat-rate alternatives typically charge $29–$49/month regardless of usage.

Use the **Max results** cap to control your spend exactly.

***

### Performance

| Run size | Approx. time | Memory |
|---|---|---|
| 100 jobs | ~2 min | 256 MB |
| 500 jobs | ~8 min | 256 MB |
| 2,000 jobs | ~30 min | 256 MB |
| 8,000+ (full) | ~2–3 hrs | 256 MB |

Performance varies by Apify server load and network latency to Japan.

***

### Known limitations

- Application URLs redirect via a Daijob member gateway — the direct employer apply URL is not exposed publicly.
- Location data for postings with multiple work sites lists all sites; the primary location is listed first.
- Japanese-language postings (`language: "japanese"`) return titles and descriptions in Japanese.

***

### Technical details

- **Source:** daijob.com — Japan's leading bilingual job board, established 1998
- **Memory:** 256 MB
- **Repost storage:** KeyValueStore `daijob-job-dedup`, 90-day TTL
- **Retry:** Automatic retry with exponential backoff, 3 attempts per request

***

### Additional services

Need a custom actor, additional filters, scheduled runs, or integration support?
Send an email to <info@unfencedgroup.nl> — we build on request.

***

*Built by [unfenced-group](https://apify.com/unfenced-group) · Issues? Open a ticket or send a message.*

# Actor input Schema

## `keyword` (type: `string`):

Job title, skill, or any keyword to search for.

## `language` (type: `string`):

Filter by the language of the job posting.

## `maxItems` (type: `integer`):

Maximum number of job listings to return.

## `daysOld` (type: `integer`):

Only return jobs activated within this many days. Leave empty for no limit.

## `skipReposts` (type: `boolean`):

Skip jobs already seen in a previous run (based on 90-day deduplication store).

## `startUrls` (type: `array`):

Optional list of Daijob.com search result or job detail URLs to scrape directly. Overrides keyword/language filters.

## Actor input object example

```json
{
  "keyword": "",
  "language": "english",
  "maxItems": 5,
  "skipReposts": false,
  "startUrls": []
}
```

# Actor output Schema

## `OUTPUT` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("unfenced-group/daijob-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("unfenced-group/daijob-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call unfenced-group/daijob-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=unfenced-group/daijob-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Daijob.com Scraper",
        "description": "Scrape Japan's leading bilingual job board. Extract job titles, salaries in JPY, Japanese/English level requirements, location, and full descriptions. 8,000+ listings. No API key required.",
        "version": "0.0",
        "x-build-id": "kdXo2q5TiXL9awlp5"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/unfenced-group~daijob-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-unfenced-group-daijob-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/unfenced-group~daijob-scraper/runs": {
            "post": {
                "operationId": "runs-sync-unfenced-group-daijob-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/unfenced-group~daijob-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-unfenced-group-daijob-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "keyword": {
                        "title": "Keyword",
                        "type": "string",
                        "description": "Job title, skill, or any keyword to search for.",
                        "default": ""
                    },
                    "language": {
                        "title": "Job post language",
                        "enum": [
                            "english",
                            "japanese",
                            "both"
                        ],
                        "type": "string",
                        "description": "Filter by the language of the job posting.",
                        "default": "english"
                    },
                    "maxItems": {
                        "title": "Max results",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of job listings to return.",
                        "default": 5
                    },
                    "daysOld": {
                        "title": "Max age (days)",
                        "type": "integer",
                        "description": "Only return jobs activated within this many days. Leave empty for no limit."
                    },
                    "skipReposts": {
                        "title": "Skip reposts",
                        "type": "boolean",
                        "description": "Skip jobs already seen in a previous run (based on 90-day deduplication store).",
                        "default": false
                    },
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "Optional list of Daijob.com search result or job detail URLs to scrape directly. Overrides keyword/language filters.",
                        "default": [],
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
