Automobile Catalog — Luxury Car Taxonomy Scraper
Pricing
Pay per usage
Automobile Catalog — Luxury Car Taxonomy Scraper
Under maintenanceScrapes make/model/generation/variant/specs taxonomy from automobile-catalog.com for luxury and classic car brands.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
ScrappingLatam
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
20 hours ago
Last modified
Categories
Share
Scrapes the complete make/model/generation/variant/specs taxonomy from automobile-catalog.com for 65 luxury and classic car brands — Porsche, Ferrari, Mercedes-Benz, Lamborghini, Aston Martin, Bentley, Rolls-Royce, McLaren, Bugatti, and more.
What does it do?
Runs a 3-phase pipeline:
- Generations — downloads all model generations per brand (e.g. Porsche 911 3rd gen, 1973–1989). ~1,500 rows. Fast (~5 min).
- Variants — for each generation, downloads all individual variants (e.g. 911 Carrera RS 2.7 Coupe). ~8,000–12,000 rows. Medium (~3–4 h).
- Specs — for each variant, downloads the full technical spec sheet (engine, power, torque, dimensions, weight, etc.). ~8,000–12,000 rows. Slow (~8–10 h).
Run each phase separately using resume_from_kv=true to chain runs efficiently.
Output fields
| Field | Description |
|---|---|
_type | generation / variant / spec |
make_slug | Brand slug (porsche, mercedes-benz) |
model_slug | Model/generation slug |
family_slug | Family slug (without generation suffix) |
year_from / year_to | Production years |
car_id | Stable numeric ID from automobile-catalog.com |
variant_slug | Full variant slug |
spec_url | Source URL |
power_hp / power_kw / power_ps | Engine power in 3 units |
torque_nm | Torque in Nm |
displacement_cm3 | Engine displacement |
transmission_type | manual / automatic |
body_style | coupe, convertible, sedan, etc. |
fuel_type | gasoline, diesel, electric, hybrid |
accel_0_100_kmh_s | 0–100 km/h acceleration |
top_speed_kmh | Top speed |
curb_weight_kg | Curb weight in kg |
Example input
{"phase": "generations","makes": ["porsche", "ferrari", "lamborghini"],"max_items": 0,"resume_from_kv": false,"delay_min_s": 2,"delay_max_s": 4}
Recommended run sequence
Run 1: phase=generations, resume_from_kv=false → ~5 minRun 2: phase=variants, resume_from_kv=true → ~3-4 hRun 3: phase=specs, resume_from_kv=true → ~8-10 h
Each run saves progress to the Key-Value store. If a run times out, re-run with resume_from_kv=true and it picks up exactly where it stopped.
Notes
- Uses Apify residential proxies to bypass Cloudflare Turnstile
- Rate-limited to 2–4 s between requests (configurable)
- Automatically retries on 429/503 with exponential backoff
- All data is pushed incrementally — no data is lost on timeout