Doctoralia Brazil Scraper
1 day trial then $20.00/month - No credit card required now
Doctoralia Brazil Scraper
1 day trial then $20.00/month - No credit card required now
This Apify actor collects doctor reviews and ratings from the doctoralia.com.br website. Users can search by specific specialties and cities. The actor pulls data such as doctor ID, review score, review text and review date and presents it in a structured format.
Doctoralia Brazil Scraper
This Apify actor is designed to scrape doctor information from the doctoralia.com.br website.
Features
- Search for doctors by specific specialties
- Filter doctors by city
- Collect basic information such as doctor ID, doctor name, specialty, city, region, review count, latest review date, overall rating, telemedicine availability, and access time
- Optionally collect additional information by enabling detailedAddressInfo:
- Clinic details (name and location)
- Complete address information (street, postal code, province)
- Contact information (phone numbers)
- Geographic coordinates (latitude/longitude)
- Access directions
Usage
- Run this actor in the Apify console.
- Provide the desired inputs:
- specialization: Select the specialties to scrape (multiple selections allowed)
- city: Select the city to scrape
- detailedAddressInfo: Enable this option to collect additional information such as clinic details, exact address, phone numbers, and geographic coordinates
Output
The actor generates a list of JSON objects containing the following information:
doctor_id
: The unique ID of the doctorname
: The name of the doctorcity
: The city where the doctor is locatedregion
: The region where the doctor is locatedspecialization
: The specialization of the doctorreview_count
: The number of reviews the doctor has receivedoverall_rating
: The overall rating of the doctornewest_review_date
: The date of the latest reviewtelemedicine
: Whether the doctor offers telemedicine servicesurl
: The URL of the doctor's profile pagefetch_time
: The time when the data was fetchedclinic_name
: The name of the clinic where the doctor worksprovince
: The state/province where the doctor is locatedpost_code
: The postal code of the doctor's locationstreet
: The street address of the doctor's locationphone_numbers
: Contact phone numbersdirections
: Access instructions to the locationlatitude
: Geographical latitude of the location (first address)longitude
: Geographical longitude of the location (first address)all_addresses
: List of all addresses associated with the doctor, each containing complete location details
Example Output
1{ 2 "doctor_id": "7891011", 3 "name": "Dr. Ana Souza", 4 "city": "São Paulo", 5 "region": "Vila Mariana", 6 "specialization": "Cardiologia", 7 "review_count": "75", 8 "overall_rating": "4.9", 9 "newest_review_date": "2024-02-05", 10 "telemedicine": "No", 11 "url": "https://www.doctoralia.com.br/ana-souza/cardiologia/sao-paulo", 12 "fetch_time": "2024-02-07T14:30:00", 13 "clinic_name": "Centro Médico São Paulo", 14 "province": "São Paulo SP", 15 "post_code": "04001-000", 16 "street": "Rua Vergueiro, 1234", 17 "phone_numbers": "+55 11 1234-5678, +55 11 8765-4321", 18 "directions": "Próximo ao metrô Vila Mariana", 19 "latitude": -23.5868, 20 "longitude": -46.6394, 21 "all_addresses": [ 22 { 23 "post_code": "04001-000", 24 "street": "Rua Vergueiro, 1234", 25 "phone_numbers": "+55 11 1234-5678, +55 11 8765-4321", 26 "directions": "Próximo ao metrô Vila Mariana", 27 "clinic_name": "Centro Médico São Paulo", 28 "province": "São Paulo SP", 29 "latitude": -23.5868, 30 "longitude": -46.6394 31 } 32 ] 33}
This example output, filled with random data, shows the structure for a single doctor. The actual output will be a list of similar objects for all the doctors that were scraped.
Notes
- This actor uses the Scrapy framework for web scraping.
- The data collected is stored in Apify’s default data store.
- Please ensure compliance with doctoralia.com.br’s terms of use and the rules outlined in their
robots.txt
file.
Actor Metrics
6 monthly users
-
1 star
98% runs succeeded
Created in Sep 2024
Modified 18 days ago