# Komatsu OEM Specs Scraper — Equipment Specifications (`rastriq/komatsu-oem-specs`) Actor

Scrape official Komatsu equipment specifications. Extract model details, operating weight, engine power, bucket capacity, dimensions, and performance data for excavators, dozers, loaders, dump trucks, and more. Build a complete Komatsu equipment database.

- **URL**: https://apify.com/rastriq/komatsu-oem-specs.md
- **Developed by:** [Rastriq — Structured data from the world](https://apify.com/rastriq) (community)
- **Categories:** Other, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $1.50 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Komatsu Equipment Specs API

Get **complete technical specifications** for 100+ Komatsu construction equipment models — delivered instantly from a pre-built database updated monthly.

### What data do you get?

For every Komatsu model (excavators, dozers, wheel loaders, trucks, graders, and more), you receive:

- **Model identification** — name, category, subcategory, official URL
- **Full spec sheet** — engine power, operating weight, hydraulic flow, bucket capacity, dimensions, ground pressure, travel speed, swing speed, and every other OEM specification
- **Structured values** — raw text plus parsed numeric value and unit, ready for analysis or comparison

Data is extracted directly from official [komatsu.com](https://www.komatsu.com) product pages.

### Who is this for?

- **Equipment dealers & rental companies** — compare specs across models for fleet decisions
- **Market intelligence platforms** — build equipment comparison tools and pricing models
- **Construction tech companies** — power search, recommendation, or matching features with real specs
- **Data analysts & researchers** — analyze trends in equipment capabilities across generations

### How it works

1. Click **Start** — results are served instantly from our cached database
2. Download in JSON, CSV, or Excel
3. That's it. No configuration needed.

The database refreshes monthly. If you need the absolute latest data, enable "Force fresh scrape" — but for 99% of use cases, the cached data is identical to what's on komatsu.com.

### Equipment categories covered

| Category | Examples |
|----------|----------|
| Excavators | PC30MR-5 through PC4000-6 (mini to mining) |
| Dozers | D31PX-24 through D575A-3D |
| Wheel Loaders | WA150-8 through WA800-3 |
| Motor Graders | GD511A-1, GD655-7, GD675-7, GD825A-2 |
| Trucks | HD325-8 through 930E-5 (rigid + articulated) |
| Forklifts | FG10HX-20 through FG50HT-8 |
| Forestry | Harvester heads + log loaders |
| Mining | Blasthole drills, draglines, electric rope shovels |

### Sample output

```json
{
  "category": "excavators",
  "subcategory": "hydraulic-excavators",
  "model_slug": "pc200lc-11",
  "model_name": "PC200LC-11 Hydraulic Excavator",
  "section": "Engine",
  "spec_name": "Net Power",
  "raw_value": "110 kW / 148 HP",
  "value": "110",
  "unit": "kW / 148 HP",
  "url": "https://www.komatsu.com/en-us/products/equipment/excavators/hydraulic-excavators/pc200lc-11"
}
````

### Pricing

**$1.50 per 1,000 results** (spec rows). A typical full run with all models produces ~2,000–5,000 rows depending on category selection.

Most runs complete in seconds (cached data). Fresh scrapes take 15–30 minutes.

### Tips

- **First run**: Leave all defaults and click Start. You'll get the full database instantly.
- **Filter by category**: Use the "Equipment categories" field to get only excavators, only trucks, etc.
- **Integrate via API**: Schedule monthly runs and pull results from the dataset API endpoint — perfect for keeping your systems in sync.
- **Cost control**: Set "Max models" to limit output for testing.

### FAQ

**How fresh is the data?**
The database updates monthly. Komatsu rarely changes specs for existing models — new models are added periodically.

**Can I get specs for discontinued models?**
Only models currently listed on komatsu.com are available. Historical models that have been removed from the site are not included.

**What format are the values in?**
Each spec includes the raw value (as shown on komatsu.com), a parsed numeric value, and the unit separately — so you can use them directly in calculations or comparisons.

**Is this official Komatsu data?**
Yes — all data is extracted directly from official komatsu.com product specification pages. We do not modify or interpret the values.

# Actor input Schema

## `categories` (type: `array`):

Select which Komatsu equipment types to include in results. Leave empty to get <b>all categories</b>.<br><br>Available: excavators, dozers, wheel-loaders, motor-graders, trucks, forklifts, forestry, blasthole-drills, crushing, draglines, electric-rope-shovels

## `maxModels` (type: `integer`):

Limit the number of equipment models returned. Leave empty for all available models (~100+).<br><br>💡 Use a small number (5–10) for your first run to preview the data structure.

## `forceRefresh` (type: `boolean`):

By default, results come from our <b>pre-built database</b> (updated monthly). Enable this only if you need the absolute latest data from komatsu.com — it will take longer and cost more.

## `mode` (type: `string`):

Controls what the Actor does internally.<br>• <b>Full</b> — build model index + scrape specs (default)<br>• <b>Index only</b> — only discover model URLs, don't scrape specs<br>• <b>Scrape only</b> — use a previously saved index

## `delayMs` (type: `integer`):

Milliseconds between requests when scraping fresh data. Only relevant if 'Force fresh scrape' is enabled.

## `proxyConfiguration` (type: `object`):

Proxy settings for fresh scrapes. Uses US-based proxies by default for best reliability.

## Actor input object example

```json
{
  "categories": [],
  "maxModels": 10,
  "forceRefresh": false,
  "mode": "full",
  "delayMs": 800,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyCountry": "US"
  }
}
```

# Actor output Schema

## `dataset` (type: `string`):

Full dataset with all models and their technical specs (engine, weight, hydraulics, dimensions, etc.)

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "categories": [],
    "maxModels": 10,
    "proxyConfiguration": {
        "useApifyProxy": true,
        "apifyProxyCountry": "US"
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("rastriq/komatsu-oem-specs").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "categories": [],
    "maxModels": 10,
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyCountry": "US",
    },
}

# Run the Actor and wait for it to finish
run = client.actor("rastriq/komatsu-oem-specs").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "categories": [],
  "maxModels": 10,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyCountry": "US"
  }
}' |
apify call rastriq/komatsu-oem-specs --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=rastriq/komatsu-oem-specs",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Komatsu OEM Specs Scraper — Equipment Specifications",
        "description": "Scrape official Komatsu equipment specifications. Extract model details, operating weight, engine power, bucket capacity, dimensions, and performance data for excavators, dozers, loaders, dump trucks, and more. Build a complete Komatsu equipment database.",
        "version": "0.1",
        "x-build-id": "MAhKzC9HWclDtfjsO"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/rastriq~komatsu-oem-specs/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-rastriq-komatsu-oem-specs",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/rastriq~komatsu-oem-specs/runs": {
            "post": {
                "operationId": "runs-sync-rastriq-komatsu-oem-specs",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/rastriq~komatsu-oem-specs/run-sync": {
            "post": {
                "operationId": "run-sync-rastriq-komatsu-oem-specs",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "categories": {
                        "title": "Equipment categories",
                        "type": "array",
                        "description": "Select which Komatsu equipment types to include in results. Leave empty to get <b>all categories</b>.<br><br>Available: excavators, dozers, wheel-loaders, motor-graders, trucks, forklifts, forestry, blasthole-drills, crushing, draglines, electric-rope-shovels",
                        "default": [],
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxModels": {
                        "title": "Max models",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Limit the number of equipment models returned. Leave empty for all available models (~100+).<br><br>💡 Use a small number (5–10) for your first run to preview the data structure."
                    },
                    "forceRefresh": {
                        "title": "Force fresh scrape",
                        "type": "boolean",
                        "description": "By default, results come from our <b>pre-built database</b> (updated monthly). Enable this only if you need the absolute latest data from komatsu.com — it will take longer and cost more.",
                        "default": false
                    },
                    "mode": {
                        "title": "Run mode",
                        "enum": [
                            "full",
                            "index_only",
                            "scrape_only"
                        ],
                        "type": "string",
                        "description": "Controls what the Actor does internally.<br>• <b>Full</b> — build model index + scrape specs (default)<br>• <b>Index only</b> — only discover model URLs, don't scrape specs<br>• <b>Scrape only</b> — use a previously saved index",
                        "default": "full"
                    },
                    "delayMs": {
                        "title": "Request delay (ms)",
                        "minimum": 100,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Milliseconds between requests when scraping fresh data. Only relevant if 'Force fresh scrape' is enabled.",
                        "default": 800
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Proxy settings for fresh scrapes. Uses US-based proxies by default for best reliability.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
