Actor picture

DNB Companies Scraper

dtrungtin/dnb-companies-scraper

Filtering and extracting data about companies from https://www.dnb.com. Easily filtering results by different criteria such as revenue minimum, number of employees minimum, located in different countries, etc.

Free trial for 14 days

Then $30/month

No credit card required now

Author's avatarTin Duong
  • Modified
  • Users13
  • Runs266
Actor picture
DNB Companies Scraper

Free trial for 14 days

Then $30/month

DNB Companies Scraper

DNB Companies Scraper is an Apify actor for filtering and extracting data about companies from DNB. It allows you to extract all companies' information. It is build on top of Apify SDK and you can run it both on Apify platform and locally.

Input

Input is a JSON object with the following properties:

{
    "searchTerm": SEARCH_TERM,
    "revenueMin": REVENUE_MIN_FILTER,
    "numberOfEmployeesMin": NUMBER_OF_EMPLOYEES_MIN_FILTER,
    "yearStartFrom": YEAR_START_FROM_FILTER,
    "countryIn": COUNTRY_IN_FILTER,
    "industryIn": INDUSTRY_IN_FILTER,
    "maxItems": MAXIMUM_ITEMS,
    "proxyConfiguration": APIFY_PROXY_CONFIG,
}

Example:

{
  "searchTerm": "Apple",
  "revenueMin": 1000000,
  "numberOfEmployeesMin": 100,
  "yearStartFrom": 1980,
  "countryIn": "Australia, Germany, Russian, Spain, Japan, United, France",
  "industryIn": "Computer, Retail",
  "maxItems": 10,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyCountry": "US"
  }
}

Output

Output is stored in a dataset. Each item is an information about a company. Example:

{
  "url": "https://www.dnb.com/business-directory/company-profiles.apple_pty_limited.1b1106bd70f6fd219ede98d370ae92ce.html",
  "name": "APPLE PTY LIMITED",
  "description": "APPLE PTY LIMITED is located at 367 GEORGE STREET in SYDNEY, NSW and offers Other Electronic Parts and Equipment Merchant Wholesalers products and services.",
  "website": "www.apple.com.au",
  "numberOfEmployees": 4137,
  "telephone": "289878000",
  "addressLocality": "SYDNEY",
  "addressCountry": "Australia",
  "addressRegion": "NSW",
  "streetAddress": "367 GEORGE STREET",
  "postalCode": "2000",
  "type": "Parent",
  "role": "Proprietorship",
  "industry": [
    "Household Appliances and Electrical and Electronic Goods Merchant Wholesalers",
    "Professional and Commercial Equipment and Supplies Merchant Wholesalers",
    "Electronics and Appliance Stores",
    "Other Miscellaneous Store Retailers",
    "Electronic parts and equipment, nec",
    "Computers, peripherals, and software",
    "Computer and software stores",
    "Miscellaneous retail stores, nec"
  ],
  "principals": [
    {
      "name": "ANTHONY JAMES KING",
      "position": "Chief Executive Officer"
    },
    {
      "name": "PETER RONALD DENWOOD",
      "position": "Director"
    },
    {
      "name": "ANTHONY JAMES KING",
      "position": "Director"
    },
    {
      "name": "REBECCA-JANE MCCALLUM",
      "position": "Secretary"
    },
    {
      "name": "PAUL ANTHONY WHITTINGHAM",
      "position": "Director and Secretary"
    }
  ],
  "breadcrumb": [
    "HOME",
    "BUSINESS DIRECTORY",
    "WHOLESALE TRADE",
    "MERCHANT WHOLESALERS, DURABLE GOODS",
    "HOUSEHOLD APPLIANCES AND ELECTRICAL AND ELECTRONIC GOODS MERCHANT WHOLESALERS",
    "AUSTRALIA",
    "NEW SOUTH WALES",
    "SYDNEY"
  ],
  "revenue": 8060000000,
  "fiscalYearEnd": "SEP",
  "yearStart": 1982,
  "incorporated": 1982
}

Compute units consumption

Keep in mind that it is much more efficient to run one longer scrape (at least one minute) than more shorter ones because of the startup time.

The average consumption is 1 Compute unit for 1000 actor pages scraped

Using proxies

DNB now aggressively blocks scrapers. Currently, the only reliable solution to this problem is to use residential proxies or provide your own proxy servers. Datacenter proxies will not work.

Apify residential proxies

The Apify platform provides residential proxies if you have a paid subscription. These proxies are only available to be run within actors on the Apify platform, not externally. If you are interested in using residential proxies for this scraper, contact support@apify.com via email or in-app chat to get the proxies enabled.

Epilogue

Thank you for trying my actor. I will be very glad for a feedback that you can send to my email dtrungtin@gmail.com.

Industries

See how DNB Companies Scraper is used in industries around the world