This Apify actor crawls the Bizi.si website to extract company details such as title, URL, phone number, and email. It iterates through all browse pages based on specified criteria and collects data from each listed company.

Features

Iterates through all browse pages: The crawler navigates through all pages in the browse section of Bizi.si, capturing links to individual company profiles.
Scrapes company details: For each company, it extracts the title, URL, phone number, and email address.
Handles dynamic content and pagination: The crawler is designed to navigate through multiple pages and handle dynamic content loading.

Input Configuration

The crawler can be configured with the following inputs:

startUrl (String, Required): The starting URL of the browse page you wish to crawl. Example: https://www.bizi.si/TSMEDIA/V/vulkanizerstvo-4940/?f=activity&cls=TSMEDIA&chr=V&actss=4940&actsd=vulkanizerstvo&rw=1.
maxConcurrency (Number, Optional): The maximum number of concurrent pages that the crawler will process. Default is 10.
proxyConfiguration (Object, Optional): Configure proxies to avoid being blocked. Default is to use Apify's proxies.

Example Input

json

Copy code

{ "startUrl": "https://www.bizi.si/TSMEDIA/V/vulkanizerstvo-4940/?f=activity&cls=TSMEDIA&chr=V&actss=4940&actsd=vulkanizerstvo&rw=1", "maxConcurrency": 5, "proxyConfiguration": { "useApifyProxy": true, "apifyProxyGroups": ["SHADER"] } }

Output

The crawler will output a dataset with the following structure for each company:

title: The name of the company.
url: The URL of the company's profile page.
info_phone: The phone number listed for the company.
info_email: The email address listed for the company.

Example:

json

Copy code

{ "title": "Vulkanizerstvo ABC", "url": "https://www.bizi.si/TSMEDIA/V/vulkanizerstvo-4940/?f=activity&cls=TSMEDIA&chr=V&actss=4940&actsd=vulkanizerstvo&rw=1", "info_phone": "01 234 5678", "info_email": "info@vulkanizerstvoabc.si" }

Installation and Usage

Clone the repository or create a new actor on Apify using the code.
Set up the input: Provide the startUrl in the input configuration to specify the browse page you want to crawl.
Run the actor: You can run the actor on the Apify platform. The actor will start from the specified URL, navigate through all pages, and collect data from each company listed.

Notes

Captcha Handling: If the site presents captchas, you may need to integrate a captcha-solving service or manually intervene.
Rate Limiting: To avoid being blocked, consider adjusting the concurrency settings and implementing random delays between requests.

Contributing

If you'd like to contribute to this project, feel free to submit a pull request. Any improvements, bug fixes, or feature requests are welcome!

License

This project is licensed under the MIT License - see the LICENSE file for details.

Share Actor:

Walmart Reviews Scraper.

getdataforme/walmart-reviews-spider

Walmart reviews scraper. We been building reviews scraper since past 1 years, if you are someone trying to analyse product or have software analytics company then this walmart reviews scraper will help as well

GetDataForMe

Clutch.co Scraper

curious_coder/clutch-scraper

Scrape clutch.co and get companies information including website, name, hourly rate, reviews, logo and many more details

Curious Coder

578

1.0

Google search Parser google maps / my business

saswave/google-search-parser-google-maps-my-business

Extract company infos from google search results, right side box (website, phone number, reviews, social accounts, address..) Allows you to automate enrichment using queries with a combination of company name, website domain and street address (complete or partial). Check readme for some exemples.

SASWAVE

592

5.0

Welcome to the jungle scraper

saswave/welcome-to-the-jungle-scraper

Welcome to the jungle scraper. Retrieve jobs, companies from welcometothejungle.com website and extract search results informations. Helps for Intent based marketing campaigns. Extract valuable data: social networks, website, tech stack, job count ... jobtitle, salary, benefits, remote, date ...

SASWAVE

Clutch.co Listings Scraper

piotrv1001/clutch-listings-scraper

The Clutch.co Listings Scraper extracts paginated business data from Clutch.co URLs, capturing company titles, logos, hourly rates, reviews, ratings, and offered services—ideal for market research and competitor analysis.

Piotr Vassev

226

Profesia.sk Scraper

jurooravec/profesia-sk-scraper

One-stop-shop for all data on Profesia.sk Extract job offers, list of companies, positions, locations... Job offers include salary, textual info, company, and more

Juro Oravec

1.0

Jobteaser job scraper

saswave/jobteaser-job-listing-scraper

Extract job listings from jobteaser website. Start from any search url from jobteaser.com that contains a joblisting. Scrape informations like: company, job details, company social network, website, followers, employees and more ..

SASWAVE

Allabolag Business Details Scraper

ecomscrape/allabolag-business-details-scraper

Scrape detailed Swedish business profiles from Allabolag.se automatically. Extract company info, contact details, reviews, ratings & geo data in JSON/CSV/Excel. Ideal for lead generation & market research. User-friendly tool with proxy support & bulk processing capabilities.

ecomscrape

Pagesjaunes Business Details Scraper

ecomscrape/pagesjaunes-business-details-scraper

Quickly gather detailed business information from Pagesjaunes.fr, France’s leading directory. Our automated scraper collects contact details, reviews, ratings, and more, enabling efficient market analysis with comprehensive, up-to-date company data.

ecomscrape

Ankorstore Scraper

saswave/ankorstore-scraper

Ankorstore scraper. Collect companies details from website ankorstore.com . Advanced Web crawler for ecommerce website. Extract promo code, adresses, company name, year founded, instagram username, intagram followers, description, available countries, shipping time, is holiday mode enabled

SASWAVE

Craigslist Scraper

ivanvs/craigslist-scraper

Extract data from classified advertisements on Craigslist. Scrape details from jobs, housing, items wanted, items for sale, services, community service, gigs, events and resumes listed on Craigslist. Download listings data in JSON, XML, Excel, and other versatile