Willhaben Company Info Spider avatar

Willhaben Company Info Spider

Pricing

from $9.00 / 1,000 results

Go to Apify Store
Willhaben Company Info Spider

Willhaben Company Info Spider

Extract detailed company information from Willhaben, including descriptions, industry types, and social media links. Ideal for market research, competitive analysis, and business automation with flexible configuration and efficient performance....

Pricing

from $9.00 / 1,000 results

Rating

0.0

(0)

Developer

GetDataForMe

GetDataForMe

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share


README.md

Willhaben Company Info Spider

Introduction

The Willhaben Company Info Spider is a powerful tool designed to extract detailed company information from the Willhaben platform. It provides comprehensive data on companies listed in job postings, making it invaluable for market research and competitive analysis.

Features

  • Comprehensive Data Extraction: Retrieves extensive details including company descriptions, industry types, employee counts, and social media links.
  • High-Quality Data: Ensures reliable and accurate information directly from the source.
  • Flexible Configuration: Allows customization of URLs and item limits for tailored data collection.
  • Efficient Performance: Optimized to handle multiple requests efficiently within set limits.
  • Easy Integration: Seamlessly integrates with Apify's ecosystem for further processing and analysis.

Input Parameters Table

ParameterTypeRequiredDescriptionExample
UrlsArrayYesThe URLs for the spider to scrape. Must be valid HTTP/HTTPS links.["https://www.willhaben.at/jobs/firma/example"]
item_limitIntegerNoMaximum items to scrape per actor run. Set to 0 for no limit.10

Example Usage

Input JSON

{
"Urls": [
"https://www.willhaben.at/jobs/firma/berger-personal-service-gmbh/331257977"
],
"item_limit": 10
}

Output JSON

[
{
"url": "https://www.willhaben.at/jobs/firma/berger-personal-service-gmbh/331257977",
"id": 331257977,
"title": "BERGER Personal-Service GmbH",
"slug_title": "berger-personal-service-gmbh",
"description": "<p>Berger Personal-Service ist seit über 20 Jahren der Spezialist für qualifizierte, regionale Fach- und Hilfskräfte in der Metallbranche. In Kombination mit unserem technischen Büro für Schweißtechnik bieten wir höchste Qualitätsstandards bei unserem Personal.</p>",
"type": "Personaldienstleister",
"uid_number": "ATU73579923",
"website_url": "http://www.bergerpersonal.at",
"logo_url": "https://www.willhaben.at/jobs/api/v1/images/public/482321502?resolution=480",
"industry": "Personaldienstleistungen",
"address_street": "Hauptstraße 2",
"address_zip": "4552",
"address_city": "Wartberg an der Krems",
"address_country": "Österreich",
"founding_year": 1999,
"employee_count_from": 201,
"employee_count_to": 500,
"social_media_facebook": "https://www.facebook.com/bergerpersonal",
"social_media_linkedin": null,
"social_media_xing": null,
"actor_id": "6w3V7oxrGBK0WTsVY",
"run_id": "0pseBZQRgA5ezePhN"
}
]

Use Cases

  • Market Research and Analysis: Gain insights into company profiles for strategic planning.
  • Competitive Intelligence: Monitor competitors' activities and market presence.
  • Price Monitoring: Track pricing strategies in the job market sector.
  • Content Aggregation: Compile comprehensive datasets for analysis or reporting.
  • Academic Research: Support studies on employment trends and industry dynamics.
  • Business Automation: Automate data collection processes for efficiency.

Installation and Usage

  1. Search for "Willhaben Company Info Spider" in the Apify Store.
  2. Click "Try for free" or "Run".
  3. Configure input parameters as needed.
  4. Click "Start" to begin extraction.
  5. Monitor progress in the log.
  6. Export results in your preferred format (JSON, CSV, Excel).

Output Format

The output is a JSON array containing objects with detailed company information. Key fields include:

  • url: The original URL of the job posting.
  • id: Unique identifier for the company.
  • title: Company name.
  • description: Detailed description of the company.
  • type: Type of service provided by the company.
  • uid_number: Company registration number.
  • website_url: Official website link.
  • logo_url: URL to the company's logo image.
  • industry: Industry category.
  • address_street, address_zip, address_city, address_country: Physical address details.
  • founding_year: Year of establishment.
  • employee_count_from, employee_count_to: Range of employee count.
  • social_media_facebook, social_media_linkedin, social_media_xing: Social media links.

Error Handling

The spider is designed to handle common errors such as invalid URLs or network issues gracefully. If an error occurs, it will be logged for review, and the process will continue with other URLs if possible.

Rate Limiting and Best Practices

  • Rate Limiting: Ensure compliance with Willhaben's terms of service by setting appropriate delays between requests.
  • Best Practices: Use a reasonable item_limit to avoid overloading the server and ensure efficient data collection.

Limitations and Considerations

  • The spider is limited to extracting data from publicly accessible URLs on Willhaben.
  • Ensure that all input URLs are valid and accessible before running the spider.
  • Be mindful of legal and ethical considerations when scraping data.

Support

For custom/simplified outputs or bug reports, please contact:

We're here to help you get the most out of this Actor!