# RozeePk Jobs Scraper (`shahidirfan/rozeepk-jobs-scraper`) Actor

Extract RozeePk job listings in bulk with precision. Captures job titles, companies, salaries & full descriptions automatically. Ideal for job boards, market analysis & recruitment intelligence. Requires residential proxies for optimal scraping success.

- **URL**: https://apify.com/shahidirfan/rozeepk-jobs-scraper.md
- **Developed by:** [Shahid Irfan](https://apify.com/shahidirfan) (community)
- **Categories:** Jobs, AI, Automation
- **Stats:** 7 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Rozee.pk Jobs Scraper

Extract comprehensive job listings from Rozee.pk with rich structured output for hiring intelligence, labor-market analysis, and opportunity tracking. Collect role, company, location, salary, skills, experience, and advanced listing metadata in one run.

### Features

- **Rich Job Records** - Captures core job details plus extended listing metadata.
- **Flexible Search** - Filter by keyword and optionally narrow by location text.
- **Auto Pagination** - Automatically calculates pages from `results_wanted`.
- **Clean Output** - Returns analysis-ready JSON with normalized convenience fields.
- **QA-Friendly Defaults** - Ships with safe defaults for reliable automated runs.

### Use Cases

#### Job Market Research
Track role demand, hiring hotspots, and salary patterns across cities and industries in Pakistan.

#### Recruitment Intelligence
Monitor active openings and competitor hiring activity with structured records you can query.

#### Skills Trend Analysis
Analyze skill requirements, experience ranges, and job type patterns for workforce planning.

#### Data Enrichment Pipelines
Feed normalized job data into internal tools, dashboards, and scoring workflows.

---

### Input Parameters

| Parameter | Type | Required | Default | Description |
|-----------|------|----------|---------|-------------|
| `keyword` | String | No | `"software engineer"` | Search keyword. Use broad terms for larger datasets. |
| `location` | String | No | `""` | Optional location text filter (e.g., `Lahore`, `Karachi`). |
| `results_wanted` | Integer | No | `20` | Maximum number of job records to save. |
| `proxyConfiguration` | Object | No | Residential enabled | Proxy settings for reliability and stability. |

---

### Output Data

Each dataset item includes both normalized fields and extended listing fields.

| Field | Type | Description |
|-------|------|-------------|
| `job_id` | String | Unique listing identifier. |
| `url` | String | Direct listing URL. |
| `title` | String | Job title. |
| `company` | String | Hiring company name. |
| `location` | String | Normalized location text. |
| `salary` | String | Human-readable salary range/value when available. |
| `contract_type` | String | Employment type (for example, full-time or contract). |
| `experience` | String | Experience requirement text. |
| `skills` | Array | Listed skills for the role. |
| `description_html` | String/null | Rich description content. |
| `description_text` | String | Plain-text description. |
| `date_posted` | String | Posting date text. |
| `applyBy` | String | Apply-by date when provided. |
| `created_at` | String | Listing creation timestamp. |
| `modified_at` | String | Last modification timestamp. |
| `cityId_exact` | Array | Source city IDs. |
| `provinceId_exact` | Array | Source province IDs. |
| `countryId_exact` | Number | Source country ID. |
| `latlng` | Array | Source latitude/longitude values. |
| `industry_exact` | Number | Source industry category ID. |
| `careerLevel_Id` | Number | Source career-level ID. |
| `jobSource` | String | Source classification when provided. |
| `whereFrom` | String | Origin tag when provided. |
| `scraped_at` | String | ISO timestamp of extraction. |

---

### Usage Examples

#### Basic Search

```json
{
    "keyword": "software engineer",
    "results_wanted": 30
}
````

#### City-Filtered Search

```json
{
    "keyword": "data analyst",
    "location": "Lahore",
    "results_wanted": 50
}
```

#### Broad Collection

```json
{
    "keyword": "all",
    "results_wanted": 200
}
```

***

### Sample Output

```json
{
    "jid": "1823712",
    "permaLink": "software-engineer-ai-ml-lahore-jobs-1823712",
    "title": "Software Engineer (AI / ML)",
    "company": "Infotech",
    "city": "Lahore",
    "skills": ["Python", "TensorFlow", "Scikit-learn"],
    "applyBy": "2026-03-20T00:00:00Z",
    "created_at": "2026-03-01T10:11:12Z",
    "job_id": "1823712",
    "url": "https://www.rozee.pk/software-engineer-ai-ml-lahore-jobs-1823712",
    "location": "Lahore",
    "salary": "PKR 250,000 - 350,000",
    "contract_type": "Full Time/Permanent",
    "experience": "3 Years",
    "description_text": "We are looking for a skilled Software Engineer specializing in Artificial Intelligence and Machine Learning...",
    "date_posted": "Mar 01, 2026",
    "scraped_at": "2026-03-02T08:57:00.000Z"
}
```

***

### Tips for Best Results

#### Start With Focused Keywords

- Use specific role names for high relevance.
- Use broader terms for larger datasets.

#### Balance Volume and Speed

- Keep `results_wanted` near `20-100` for quick runs.
- Increase `results_wanted` when you need deeper coverage.

#### Use Reliable Proxies

- Residential proxy groups generally improve stability on larger runs.

***

### Integrations

Connect your dataset to:

- **Google Sheets** - Reporting and ad hoc analysis.
- **Airtable** - Searchable recruiting databases.
- **Slack** - New-listing notifications.
- **Webhooks** - Push updates to internal services.
- **Make** - Automated no-code workflows.
- **Zapier** - Trigger downstream business automations.

#### Export Formats

- **JSON** - Programmatic usage and APIs.
- **CSV** - Spreadsheet workflows.
- **Excel** - Business reporting.
- **XML** - System integrations.

***

### Frequently Asked Questions

#### How many jobs can I collect?

You can collect up to your configured `results_wanted`, subject to available results.

#### Can I filter by city?

Yes. Use the `location` field to keep only jobs matching your target location text.

#### Why are some fields missing in certain jobs?

Some listings do not publish every field. The actor only returns available values.

#### Does it handle pagination automatically?

Yes. It iterates result pages until limits are reached or no more jobs are available.

#### Is output ready for analytics?

Yes. Records include normalized core fields plus extended metadata for deeper analysis.

***

### Support

For issues or feature requests, use Apify Console support channels.

#### Resources

- [Apify Documentation](https://docs.apify.com/)
- [API Reference](https://docs.apify.com/api/v2)
- [Scheduling Runs](https://docs.apify.com/schedules)

***

### Legal Notice

This actor is intended for legitimate data collection. You are responsible for complying with website terms, applicable laws, and responsible data usage practices.

# Actor input Schema

## `keyword` (type: `string`):

Job search keyword (e.g., 'Software Engineer', 'Marketing Manager').

## `location` (type: `string`):

Filter by city or region (e.g., 'Lahore', 'Karachi', 'Islamabad').

## `results_wanted` (type: `integer`):

Maximum number of jobs to scrape.

## `proxyConfiguration` (type: `object`):

Proxy settings. Residential proxies recommended for best results.

## Actor input object example

```json
{
  "keyword": "software engineer",
  "location": "",
  "results_wanted": 20,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "keyword": "software engineer",
    "location": "",
    "results_wanted": 20
};

// Run the Actor and wait for it to finish
const run = await client.actor("shahidirfan/rozeepk-jobs-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "keyword": "software engineer",
    "location": "",
    "results_wanted": 20,
}

# Run the Actor and wait for it to finish
run = client.actor("shahidirfan/rozeepk-jobs-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "keyword": "software engineer",
  "location": "",
  "results_wanted": 20
}' |
apify call shahidirfan/rozeepk-jobs-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=shahidirfan/rozeepk-jobs-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "RozeePk Jobs Scraper",
        "description": "Extract RozeePk job listings in bulk with precision. Captures job titles, companies, salaries & full descriptions automatically. Ideal for job boards, market analysis & recruitment intelligence. Requires residential proxies for optimal scraping success.",
        "version": "1.0",
        "x-build-id": "6L8RFoTYfR2YoS9cU"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/shahidirfan~rozeepk-jobs-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-shahidirfan-rozeepk-jobs-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/shahidirfan~rozeepk-jobs-scraper/runs": {
            "post": {
                "operationId": "runs-sync-shahidirfan-rozeepk-jobs-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/shahidirfan~rozeepk-jobs-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-shahidirfan-rozeepk-jobs-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "keyword": {
                        "title": "Keyword",
                        "type": "string",
                        "description": "Job search keyword (e.g., 'Software Engineer', 'Marketing Manager').",
                        "default": "software engineer"
                    },
                    "location": {
                        "title": "Location",
                        "type": "string",
                        "description": "Filter by city or region (e.g., 'Lahore', 'Karachi', 'Islamabad').",
                        "default": ""
                    },
                    "results_wanted": {
                        "title": "Maximum Jobs",
                        "minimum": 1,
                        "maximum": 1000,
                        "type": "integer",
                        "description": "Maximum number of jobs to scrape.",
                        "default": 20
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Proxy settings. Residential proxies recommended for best results.",
                        "default": {
                            "useApifyProxy": true,
                            "apifyProxyGroups": [
                                "RESIDENTIAL"
                            ]
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
