Crunchbase.com Insights Extractor avatar
Crunchbase.com Insights Extractor
Try for free

7 days trial then $40.00/month - No credit card required now

View all Actors
Crunchbase.com Insights Extractor

Crunchbase.com Insights Extractor

epctex/crunchbase-scraper
Try for free

7 days trial then $40.00/month - No credit card required now

Crunchbase Data Extractor empowers you to unlock valuable insights from millions of organizations. Extract acquisitions, people, reports, events, and more with customizable search for targeted results. Supercharge your research and business intelligence with ease.

Crunchbase Scraper

Crunchbase scraper

Since Crunchbase doesn't provide a proper and free API, this actor should help you to retrieve data from it.

The Crunchbase data scraper supports the following features:

  • Scrape organization details - You can scrape attributes like about, number of employees, technology, summary, people working, or investment details of an organization. You can find details below.

  • Scrape person details - You can scrape attributes like title, name, CB Rank, primary organization, jobs, or related hubs of a person. You can find details below.

  • Scrape event details - You can scrape attributes like speakers, name, location, date, venue, and registration links of an event. You can find details below.

  • Scrape hub details - You can scrape attributes like the number of founders, name, founded date, acquired percentage, and so on. You can find details below.

  • Scrape by keyword - You can use location-wise keywords to search specific search lists. Also, you can directly point out rental, for sale, or sold properties on this feature.

Possible Use-cases

  • Competitor analysis: You can use this actor to get detailed information about your competitors

  • Data Analysis: - You can analyze CrunchBase data any way you want from organizations to events.

  • News and Signals: Get news and signals from organizations

  • Due Dillegence

Bugs, fixes, updates, and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Incoming Changes

  • Advanced search
  • Fetch full descriptions alongside short_description

Input Parameters

The input of this scraper should be JSON containing the list of pages on Crunchbase that should be visited. Possible fields are:

  • search: (Optional) (String) Keyword that can be searched in Crunchbase search engine. When it is present, mode must be used as well.

  • mode: (Optional) (String) Mode of the actor. It gets the keyword from search parameter and initiates the search according to the mode. Can be all, organizations, events, hubs, or people. When present, search must be provided as well.

  • startUrls: (Optional) (Array) List of Crunchbase URLs. You should only provide organization detail, person detail, event detail, or hub detail URLs.

  • maxItems: (Optional) (Number) You can limit scraped items. This should be useful when you search through the big lists or search results.

  • proxy: (Required) (Proxy Object) Proxy configuration.

  • extendOutputFunction: (Optional) (String) Function that takes a JQuery handle ($) as an argument and returns an object with data.

  • customMapFunction: (Optional) (String) Function that takes each object's handle as an argument and returns the object with executing the function.

This solution requires the use of Proxy servers, either your own proxy servers or you can use Apify Proxy.

Compute Unit Consumption

The actor is optimized to run blazing fast and scrape many items as possible. Therefore, it forefronts all item detail requests. If the actor doesn't block very often it'll scrape 100 items in 1 minute with ~0.04-0.05 compute units.

Crunchbase Scraper Input example

1{
2    "startUrls": [
3        "https://www.crunchbase.com/organization/warner-law-offices",
4        "https://www.crunchbase.com/person/warner-l-baxter",
5        "https://www.crunchbase.com/hub/warner-bros-alumni-founded-companies",
6        "https://www.crunchbase.com/event/messenger-chatbots-the-secret-to-8x-engagement-20171010"
7    ],
8    "search": "warner",
9    "proxy": {
10        "useApifyProxy": true
11    },
12    "mode": "all",
13    "maxItems": 500
14}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with a failure state and output an explanation of what is wrong.

Crunchbase Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this Crunchbase actor.

Example Crunchbase Items

An example structure of items in Crunchbase looks like this:

1{
2    "properties": {
3        "identifier": {
4            "uuid": "27e940be-6f97-2ee0-b8a9-5896fc45bc2a",
5            "value": "Warner L. Baxter",
6            "image_id": "v1460807093/enb7lpjnglfxlapctkkp.png",
7            "permalink": "warner-l-baxter",
8            "entity_def_id": "person"
9        },
10        "facet_ids": ["rank"],
11        "title": "Warner L. Baxter - Chairman, President and Chief Executive Officer @ Ameren Services",
12        "short_description": "Business Experience:Mr. Baxter, 54, is the Chairman, President and Chief Executive Officer of Ameren Corporation, a regulated electric and g..."
13    },
14    "cards": {
15        "education_summary": {
16            "identifier": {
17                "uuid": "27e940be-6f97-2ee0-b8a9-5896fc45bc2a",
18                "value": "Warner L. Baxter",
19                "image_id": "v1460807093/enb7lpjnglfxlapctkkp.png",
20                "permalink": "warner-l-baxter",
21                "entity_def_id": "person"
22            }
23        },
24        "current_board_and_advisory_roles_image_list": [
25            {
26                "identifier": {
27                    "uuid": "4dbd38e5-3f9b-9d0b-4c8a-e2c71e2f7d41",
28                    "value": "Warner L. Baxter Board of Director @ Ameren Services",
29                    "permalink": "warner-l-baxter-board-member-ameren--4dbd38e5",
30                    "entity_def_id": "job"
31                },
32                "organization_identifier": {
33                    "uuid": "fa3d8616-afc6-abc5-c103-a7a1074eeb79",
34                    "value": "Ameren Services",
35                    "image_id": "v1402650462/arrtxhwudi5syoqleyqe.png",
36                    "permalink": "ameren",
37                    "entity_def_id": "organization"
38                },
39                "title": "Board of Director"
40            },
41            {
42                "identifier": {
43                    "uuid": "ce8f5c6f-6e3c-4d4c-b0a0-c1a5d8ab3e6f",
44                    "value": "Warner L. Baxter Board Of Director @ U.S. Bancorp",
45                    "permalink": "warner-l-baxter-board-member-u-s-bancorp--ce8f5c6f",
46                    "entity_def_id": "job"
47                },
48                "organization_identifier": {
49                    "uuid": "9561f613-9cc0-d418-da8c-2f545b6eb8b6",
50                    "value": "U.S. Bancorp",
51                    "image_id": "v1397179096/f5615a5f64426ad47ec9c4e43409a1bf.png",
52                    "permalink": "u-s-bancorp",
53                    "entity_def_id": "organization"
54                },
55                "title": "Board Of Director"
56            }
57        ],
58        "current_jobs_image_list": [
59            {
60                "identifier": {
61                    "uuid": "7aba5665-f138-df29-c604-b32415e569e1",
62                    "value": "Warner L. Baxter Chairman, President and Chief Executive Officer @ Ameren Services",
63                    "permalink": "warner-l-baxter-employee-ameren--7aba5665",
64                    "entity_def_id": "job"
65                },
66                "organization_identifier": {
67                    "uuid": "fa3d8616-afc6-abc5-c103-a7a1074eeb79",
68                    "value": "Ameren Services",
69                    "image_id": "v1402650462/arrtxhwudi5syoqleyqe.png",
70                    "permalink": "ameren",
71                    "entity_def_id": "organization"
72                },
73                "title": "Chairman, President and Chief Executive Officer"
74            }
75        ],
76        "board_and_advisory_roles_summary": {
77            "identifier": {
78                "uuid": "27e940be-6f97-2ee0-b8a9-5896fc45bc2a",
79                "value": "Warner L. Baxter",
80                "image_id": "v1460807093/enb7lpjnglfxlapctkkp.png",
81                "permalink": "warner-l-baxter",
82                "entity_def_id": "person"
83            },
84            "num_current_advisor_jobs": 2
85        },
86        "overview_fields2": {},
87        "past_jobs_list": [],
88        "overview_fields_v2": {
89            "primary_job_title": "Chairman, President and Chief Executive Officer",
90            "primary_organization": {
91                "uuid": "fa3d8616-afc6-abc5-c103-a7a1074eeb79",
92                "value": "Ameren Services",
93                "image_id": "v1402650462/arrtxhwudi5syoqleyqe.png",
94                "permalink": "ameren",
95                "entity_def_id": "organization"
96            }
97        },
98        "past_board_and_advisory_roles_list": [],
99        "overview_fields": {
100            "gender": "male"
101        },
102        "jobs_summary": {
103            "identifier": {
104                "uuid": "27e940be-6f97-2ee0-b8a9-5896fc45bc2a",
105                "value": "Warner L. Baxter",
106                "image_id": "v1460807093/enb7lpjnglfxlapctkkp.png",
107                "permalink": "warner-l-baxter",
108                "entity_def_id": "person"
109            },
110            "num_current_jobs": 1
111        },
112        "overview_description": {
113            "description": "Business Experience:Mr. Baxter, 54, is the Chairman, President and Chief Executive Officer of Ameren Corporation, a regulated electric and gas utility company serving customers in Missouri and Illinois. He has served in these positions since 2014. Mr. Baxter served as Chairman, President and Chief Executive Officer of Ameren Missouri from 2009 to 2014 and as Executive Vice President and Chief Financial Officer of Ameren Corporation from 2003 to 2009. In addition, he also served as President and Chief Executive Officer of Ameren Services from 2007 to 2009."
114        },
115        "board_and_advisory_roles_headline": {
116            "num_current_advisor_jobs": 2
117        },
118        "overview_headline": {
119            "num_current_advisor_jobs": 2,
120            "rank_person": 104129
121        },
122        "news_headline": {
123            "num_articles": 9
124        },
125        "event_appearances_summary": {
126            "identifier": {
127                "uuid": "27e940be-6f97-2ee0-b8a9-5896fc45bc2a",
128                "value": "Warner L. Baxter",
129                "image_id": "v1460807093/enb7lpjnglfxlapctkkp.png",
130                "permalink": "warner-l-baxter",
131                "entity_def_id": "person"
132            }
133        },
134        "education_image_list": [],
135        "event_appearances_list": [],
136        "hubs_list": [
137            {
138                "identifier": {
139                    "permalink": "electrical-distribution-companies",
140                    "image_id": "qunvrh1ipdxiveejujcq",
141                    "uuid": "3cbc8c65-6619-4954-967c-978ca8cc17d8",
142                    "entity_def_id": "hub",
143                    "value": "Electrical Distribution Companies"
144                },
145                "rank_hub": 2391
146            },
147            {
148                "identifier": {
149                    "permalink": "solar-companies",
150                    "image_id": "qxktzxyjyoo3ojtsv1cj",
151                    "uuid": "b209864f-5f74-447f-b7f5-4d91e1573fcb",
152                    "entity_def_id": "hub",
153                    "value": "Solar Companies"
154                },
155                "rank_hub": 3738
156            },
157            {
158                "identifier": {
159                    "permalink": "energy-public-companies",
160                    "image_id": "v1488039292/yesmtc8tpggbkfvfakjj.png",
161                    "uuid": "d628bfb1-0955-4d52-81a6-0792c5768474",
162                    "entity_def_id": "hub",
163                    "value": "Energy Public Companies"
164                },
165                "rank_hub": 5594
166            },
167            {
168                "identifier": {
169                    "permalink": "electrical-distribution-public-companies",
170                    "image_id": "v1470302873/fgmqdg1rqbrfqningev0.png",
171                    "uuid": "f3a7290e-c6f1-446a-a20a-5f19cae814e5",
172                    "entity_def_id": "hub",
173                    "value": "Electrical Distribution Public Companies"
174                },
175                "rank_hub": 23763
176            },
177            {
178                "identifier": {
179                    "permalink": "oil-and-gas-companies",
180                    "image_id": "v1488039292/yesmtc8tpggbkfvfakjj.png",
181                    "uuid": "abf2ae9a-ee16-4232-957b-9da73fdcc902",
182                    "entity_def_id": "hub",
183                    "value": "Oil and Gas Companies"
184                },
185                "rank_hub": 884
186            },
187            {
188                "identifier": {
189                    "permalink": "oil-and-gas-companies-that-exited",
190                    "image_id": "v1488039292/yesmtc8tpggbkfvfakjj.png",
191                    "uuid": "311c6486-7c4f-4166-99fc-479b88410bd1",
192                    "entity_def_id": "hub",
193                    "value": "Oil and Gas Companies that Exited"
194                },
195                "rank_hub": 3550
196            },
197            {
198                "identifier": {
199                    "permalink": "energy-companies-that-exited",
200                    "image_id": "v1488039292/yesmtc8tpggbkfvfakjj.png",
201                    "uuid": "6edc42a8-c17d-4725-b5ec-1fbc79f80d3c",
202                    "entity_def_id": "hub",
203                    "value": "Energy Companies that Exited"
204                },
205                "rank_hub": 2659
206            },
207            {
208                "identifier": {
209                    "permalink": "electrical-distribution-companies-that-exited",
210                    "image_id": "v1470302873/fgmqdg1rqbrfqningev0.png",
211                    "uuid": "1a73b0ae-6105-4d96-b3bc-253514629c5b",
212                    "entity_def_id": "hub",
213                    "value": "Electrical Distribution Companies that Exited"
214                },
215                "rank_hub": 10515
216            },
217            {
218                "identifier": {
219                    "permalink": "solar-companies-that-exited",
220                    "image_id": "m06tssvwea7mcbya02ik",
221                    "uuid": "05acd309-af54-44bd-9c37-7de789c86271",
222                    "entity_def_id": "hub",
223                    "value": "Solar Companies that Exited"
224                },
225                "rank_hub": 18733
226            },
227            {
228                "identifier": {
229                    "permalink": "consumer-electronics-companies-that-exited",
230                    "image_id": "v1475583702/sjuchl5dc4p56li5qlxj.png",
231                    "uuid": "4813123b-b076-4d83-802b-99224e06de25",
232                    "entity_def_id": "hub",
233                    "value": "Consumer Electronics Companies that Exited"
234                },
235                "rank_hub": 9097
236            }
237        ],
238        "jobs_headline": {
239            "num_current_jobs": 1
240        },
241        "event_appearances_headline": {},
242    },
243    "scrapedType": "person",
244}

Contact

Please visit us through epctex.com to see all the products that are available for you. If you are looking for any custom integration or so, please reach out to us through the chat box in epctex.com. In need of support? devops@epctex.com is at your service.

Developer
Maintained by Community
Actor metrics
  • 32 monthly users
  • 99.6% runs succeeded
  • Created in May 2021
  • Modified about 13 hours ago