Italian  Companies Data Scraper avatar

Italian Companies Data Scraper

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Italian  Companies Data Scraper

Italian Companies Data Scraper

Extract Italian company data by category from aziende.it. Get company name, revenue range, ATECO code, location and optional VAT, address, postal code and employees. Ideal for market research, CRM prep, BI datasets and B2B analysis.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Francesco Ayrton Davoli

Francesco Ayrton Davoli

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

2

Monthly active users

9 days ago

Last modified

Categories

Share

Italian Company Data Scraper

Extract structured Italian company data from aziende.it by business category.

This Actor is built for users who need clean Italian business datasets for market research, B2B research, CRM preparation, territory mapping, competitive intelligence, SEO analysis, supplier discovery, investment scouting, and BI or AI data workflows.

The default mode is fast and lightweight. It collects company listing data such as company name, revenue range, ATECO code, province, city and source detail URL. An optional details mode opens each company page and enriches the dataset with VAT number, address, postal code and employee range when available. Because the details mode fetches one extra page per company, it takes noticeably more time and platform resources than listing mode — see Optional full-detail mode below before enabling it.

What you can scrape

Validated categories currently supported:

  • informatica
  • software
  • consulenza
  • marketing
  • edilizia
  • immobiliare
  • turismo
  • trasporti

Categories such as food, textile, restaurants and mechanics were tested but are not exposed as validated presets yet because the tested source URLs returned no compatible rows. You can still use Advanced category URLs if you have a specific aziende.it category URL you want to scrape.

Main use cases

  • Build Italian company datasets by business sector
  • Map companies by city, province, ATECO code or revenue range
  • Prepare CRM imports before email/contact enrichment
  • Identify companies in a target industry or region
  • Analyze market density and regional business distribution
  • Build lead research datasets without claiming direct email coverage
  • Feed dashboards, spreadsheets, BI tools, data warehouses or AI workflows
  • Research suppliers, competitors, agencies, transport companies, real estate firms or construction businesses

Fast listing mode

By default, Include company details is disabled.

This mode is much faster because it only reads category listing pages.

Listing fields include:

  • Company name
  • Requested category
  • Revenue range
  • Parsed revenue minimum in EUR
  • Parsed revenue maximum in EUR
  • ATECO code
  • Province
  • City
  • Source category label
  • Company detail page URL
  • Source category URL

Use this mode when you need thousands of records quickly.

Optional full-detail mode

Enable Include company details when you need richer registry-style fields.

Full-detail enrichment can add:

  • VAT number
  • Address
  • Postal code
  • Employee range
  • Tax code, when available
  • REA registry number, when available
  • Legal form, when available
  • Activity status, when available
  • Foundation date, when available

Full-detail mode is slower and more resource-intensive than listing mode, because every company requires its own additional detail-page request. In practice this means:

  • More time: runs take much longer — plan to raise the run timeout (see below).
  • More compute: longer runs consume more Apify compute units (and therefore more platform usage/cost).
  • More proxy traffic: one extra request per company increases proxy bandwidth usage.

Based on current tests, about 100 companies with details are pushed to the dataset in roughly 1 minute. The exact speed depends on source response time, proxy performance and Apify runtime conditions. As a rule of thumb: listing-only is the right choice for large datasets, and details mode is worth its extra cost only when you specifically need VAT number, address, postal code or employee range.

For full-detail runs, increase the Actor timeout in Run options:

  • 500 companies with details: use at least 900 seconds
  • 1,000 companies with details: use at least 1,800 seconds
  • 5,000 companies with details: use around 7,200 seconds

For larger full-detail runs, increasing memory from 1 GB to 2 GB can be useful.

Balanced multi-category runs

Use Maximum results per category when scraping multiple categories and you want a balanced dataset.

Example: if you select software, marketing and edilizia, and set:

"maxItemsPerCategory": 1000

then the Actor will try to collect up to 1,000 companies per requested category, while still respecting the overall Maximum results limit.

Set Maximum results per category to 0 to disable category-level balancing and use only the global Maximum results limit.

Input fields

Business category

Main business category to scrape.

Validated values:

  • informatica
  • software
  • consulenza
  • marketing
  • edilizia
  • immobiliare
  • turismo
  • trasporti

Additional categories

Optional extra categories to merge into the same dataset.

Example:

["software", "consulenza", "marketing"]

Maximum results per category

Optional category-level limit.

Use it when scraping more than one category and you want balanced results.

Examples:

  • 0: disabled
  • 100: up to 100 companies per category
  • 1000: up to 1,000 companies per category

Maximum results

Overall maximum number of companies to save.

This always acts as a global safety cap.

Include company details

Optional checkbox.

  • Disabled: fast listing-only dataset
  • Enabled: slower full-detail dataset with VAT number, address, postal code and employee range when available

Try website contact enrichment

Advanced optional setting.

If a real company website is found, the Actor can try to inspect it for public contact data. This is slower and often returns few results because aziende.it rarely exposes company websites.

Scrape all validated categories

Runs all validated preset categories.

Recommended only when you want a broad dataset. Use it with Maximum results per category to avoid one large category dominating the output.

Advanced category URLs

Optional direct aziende.it category URLs for users who want to bypass the preset category mapping.

Example: fast default run

{
"category": "informatica",
"categories": ["software"],
"maxItemsPerCategory": 0,
"maxItems": 5000,
"includeDetails": false,
"includeWebsiteContacts": false,
"proxyConfig": {
"useApifyProxy": true
}
}

Example: balanced multi-category run

{
"category": "software",
"categories": ["marketing", "edilizia", "immobiliare"],
"maxItemsPerCategory": 1000,
"maxItems": 4000,
"includeDetails": false,
"includeWebsiteContacts": false,
"proxyConfig": {
"useApifyProxy": true
}
}

Example: full-detail run

{
"category": "software",
"categories": ["consulenza"],
"maxItemsPerCategory": 250,
"maxItems": 500,
"includeDetails": true,
"includeWebsiteContacts": false,
"proxyConfig": {
"useApifyProxy": true
}
}

For this full-detail example, set the run timeout to at least 900 seconds.

Output schema

The dataset table uses English labels for international buyers. Raw field keys are source-compatible and may use Italian names internally.

Dataset labelRaw fieldModeDescription
Company nameragioneSocialeListingLegal/company name
Requested categorycategoryKeyListingUser-selected/preset category key
Revenue rangefatturatoListingRevenue range from the source
Revenue min EURfatturatoMinEurListingParsed lower revenue estimate
Revenue max EURfatturatoMaxEurListingParsed upper revenue estimate
ATECO codeatecoListingItalian business activity code
ProvinceprovinciaListingItalian province
CitycittaListingCity
Source category labelcategoriaListingCategory label shown by the source
Company detail pagedetailUrlListingCompany detail page URL
VAT numberpartitaIvaDetailsItalian VAT number, when detail mode is enabled
AddressindirizzoDetailsCompany address, when detail mode is enabled
Postal codecapDetailsItalian postal code, when detail mode is enabled
EmployeesdipendentiDetailsEmployee range, when detail mode is enabled
Tax codecodiceFiscaleDetailsTax code, when available
REAreaDetailsREA registry number, when available
PhonetelefonoRarePhone, rarely available from the source
EmailemailRareEmail, rarely available from the source
PECpecRareCertified email, rarely available from the source
WebsitesitoWebRareWebsite, rarely available from the source
Legal formformaGiuridicaDetailsLegal form, when available
Activity statusstatoAttivitaDetailsActivity status, when available
Foundation datedataCostituzioneDetailsFoundation/incorporation date, when available
Source category URLsourceCategoryUrlListingaziende.it category URL used as source
Details includeddetailScrapedSystemBoolean flag indicating whether detail enrichment was performed

Important limitation about contacts

This Actor should be positioned as an Italian company registry dataset scraper, not as an email scraper.

aziende.it rarely exposes real email, PEC, phone or website fields on company detail pages. These fields are included in the output schema because they may appear in some cases, but they are often null.

For email, PEC, phone or website discovery at scale, use a separate contact-enrichment workflow after exporting this dataset.

Performance guidance

Listing-only mode is the recommended default for large datasets.

Full-detail mode is useful when VAT number, address, postal code and employee range are required, but it is slower because every company requires an additional detail-page request.

Current tested speed for full-detail enrichment is roughly 100 enriched companies per minute pushed to the dataset. For large full-detail runs, always increase timeout and monitor retries.

Recommended product positioning:

Extract structured Italian company data by validated business category, including company name, revenue range, ATECO code, location and optional VAT/address/employee enrichment.

Avoid positioning this Actor as:

Italian business email scraper

The source is strong for company registry-style fields, not for contact emails.