Automobile Catalog — Luxury Car Taxonomy Scraper avatar

Automobile Catalog — Luxury Car Taxonomy Scraper

Under maintenance

Pricing

Pay per usage

Go to Apify Store
Automobile Catalog — Luxury Car Taxonomy Scraper

Automobile Catalog — Luxury Car Taxonomy Scraper

Under maintenance

Scrapes make/model/generation/variant/specs taxonomy from automobile-catalog.com for luxury and classic car brands.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

ScrappingLatam

ScrappingLatam

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

20 hours ago

Last modified

Share

Scrapes the complete make/model/generation/variant/specs taxonomy from automobile-catalog.com for 65 luxury and classic car brands — Porsche, Ferrari, Mercedes-Benz, Lamborghini, Aston Martin, Bentley, Rolls-Royce, McLaren, Bugatti, and more.

What does it do?

Runs a 3-phase pipeline:

  1. Generations — downloads all model generations per brand (e.g. Porsche 911 3rd gen, 1973–1989). ~1,500 rows. Fast (~5 min).
  2. Variants — for each generation, downloads all individual variants (e.g. 911 Carrera RS 2.7 Coupe). ~8,000–12,000 rows. Medium (~3–4 h).
  3. Specs — for each variant, downloads the full technical spec sheet (engine, power, torque, dimensions, weight, etc.). ~8,000–12,000 rows. Slow (~8–10 h).

Run each phase separately using resume_from_kv=true to chain runs efficiently.

Output fields

FieldDescription
_typegeneration / variant / spec
make_slugBrand slug (porsche, mercedes-benz)
model_slugModel/generation slug
family_slugFamily slug (without generation suffix)
year_from / year_toProduction years
car_idStable numeric ID from automobile-catalog.com
variant_slugFull variant slug
spec_urlSource URL
power_hp / power_kw / power_psEngine power in 3 units
torque_nmTorque in Nm
displacement_cm3Engine displacement
transmission_typemanual / automatic
body_stylecoupe, convertible, sedan, etc.
fuel_typegasoline, diesel, electric, hybrid
accel_0_100_kmh_s0–100 km/h acceleration
top_speed_kmhTop speed
curb_weight_kgCurb weight in kg

Example input

{
"phase": "generations",
"makes": ["porsche", "ferrari", "lamborghini"],
"max_items": 0,
"resume_from_kv": false,
"delay_min_s": 2,
"delay_max_s": 4
}
Run 1: phase=generations, resume_from_kv=false → ~5 min
Run 2: phase=variants, resume_from_kv=true → ~3-4 h
Run 3: phase=specs, resume_from_kv=true → ~8-10 h

Each run saves progress to the Key-Value store. If a run times out, re-run with resume_from_kv=true and it picks up exactly where it stopped.

Notes

  • Uses Apify residential proxies to bypass Cloudflare Turnstile
  • Rate-limited to 2–4 s between requests (configurable)
  • Automatically retries on 429/503 with exponential backoff
  • All data is pushed incrementally — no data is lost on timeout