CBKC Brazil Dog Breed Directory Scraper avatar

CBKC Brazil Dog Breed Directory Scraper

Pricing

Pay per event

Go to Apify Store
CBKC Brazil Dog Breed Directory Scraper

CBKC Brazil Dog Breed Directory Scraper

Scrapes the CBKC (Confederação Brasileira de Cinofilia) breed directory — FCI-recognised breeds with Portuguese names, FCI numbers, groups, countries of origin, and official standard PDF download links. The canonical PT-BR source for all FCI breed standards.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Scrapes the complete CBKC (Confederação Brasileira de Cinofilia) dog breed directory — the official FCI member body for Brazil. Returns structured records for all ~255 breeds including FCI-recognised breeds, Brazilian-developed breeds, and CBKC-recognised non-FCI breeds. Each record includes the Portuguese breed name, FCI number, group, country of origin, breed category, and a direct URL to the CBKC-hosted official PT-BR breed standard PDF.

CBKC Brazil Dog Breed Directory Scraper Features

  • Covers all ~255 breeds across three categories: FCI-recognised (224), Brazilian-developed (11), and CBKC-only non-FCI (20)
  • Returns FCI metadata — FCI nomenclature number and group for every FCI-registered breed
  • Captures CBKC internal IDs — the cbkc_id attribute that links each breed to its CBKC record
  • Provides standard PDF links — direct URL to the CBKC-hosted PT-BR translation of the FCI breed standard
  • Includes file version timestamps — the file_version field tracks PDF update timestamps for change detection
  • Classifies breed categoriesFCI, Brasileira, or Não-FCI per CBKC's own classification scheme
  • No proxy required — site has no anti-bot measures; runs on standard datacenter connection
  • Completes in under 60 seconds — 12 requests to a fast server, no JavaScript rendering needed

What Can You Do With CBKC Breed Data?

  • AI and NLP training datasets — the canonical PT-BR FCI breed name corpus for training multilingual breed-classification models; the only comprehensive PT-BR source for all FCI groups
  • Veterinary and kennel platforms — localise breed metadata for Brazilian or Portuguese-language platforms (breed names, country of origin, FCI group)
  • Breed-traceability and pedigree services — Whistle/Embark equivalents entering the Brazilian market; link breed DNA results to official CBKC standards
  • Companion scraper to AKC/UKC data — pairs with the AKC Dog Breed Directory Scraper for cross-registry breed matching and gap analysis (AKC ≠ FCI ≠ CBKC)
  • Academic and regulatory research — FCI group/section classification for breed-specific legislation analysis in Brazil and other FCI member countries

How It Works

  1. The scraper paginates through the 10-page CBKC breed listing at cbkc.org/racas — all FCI-recognised breeds are grouped by FCI group across these pages.
  2. Two supplemental pages add the remaining breeds: lista/brasileiras for CBKC-only Brazilian breeds (Grupo 11) and lista/n_fci for non-FCI CBKC-recognised breeds.
  3. Each breed card is parsed for name, FCI standard header, FCI group, country, and the PDF download button's data-file attribute. The data-file encodes both the cbkc_id and file_version.
  4. PDF URL is assembled as https://cbkc.org/application/views/docs/padroes/padrao-raca_{cbkc_id}.pdf?v={file_version}.

Input

FieldTypeDescriptionDefault
maxItemsintegerMaximum number of breed records to return. Leave blank to retrieve all breeds.

Output

Each dataset item represents one CBKC-registered breed:

{
"breed_name_pt": "Fila Brasileiro",
"breed_name_fci": "Fila Brasileiro",
"fci_number": 225,
"fci_group": 2,
"fci_section": null,
"cbkc_id": 1,
"country_of_origin": "Brasil",
"breed_category": "Brasileira",
"standard_pdf_url": "https://cbkc.org/application/views/docs/padroes/padrao-raca_1.pdf?v=1723752061",
"standard_pdf_filename": "padrao-raca_1.pdf",
"file_version": "1723752061",
"image_url": null,
"description_pt": null,
"scraped_at": "2026-06-12T00:18:03.055Z"
}