Pricing

from $2.00 / 1,000 results

Doing Good Leeds Scraper

Scrape paid jobs, volunteering, events, and training from doinggoodleeds.org.uk via WP-JSON. Pick any subset of 4 entity types. ~178 entities total. Title, employer, location, salary, apply email/URL, full description HTML inline per row. JSON or CSV out, billed per result.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Muhamed Didovic

Actor stats

Bookmarked

Total users

Monthly active users

5.5 hours

Issues response

4 days ago

Last modified

How it works

How Doing Good Leeds Scraper works

✨ Why use this scraper?

Tracking Leeds' voluntary-sector hiring? Mapping volunteer opportunities for placement programmes? Building a community events calendar? Cataloguing the training courses charities are funding?

🎯 Four entity types in one actor. job-listings (paid jobs, ~31), volunteers (volunteering opportunities, ~23), event (events, ~34), training-course (training courses, ~90) — pick any subset.
⚡ WP-JSON REST API as the data source. Each entity is a WordPress custom post type with its own /wp-json/wp/v2/<cpt> endpoint.
🏷️ Custom location taxonomy. WP Job Manager's _job_location meta is usually empty on Doing Good Leeds — we fall back to their custom location taxonomy (Leeds, regional Yorkshire, etc.).
📧 Apply email / URL captured. _application meta is split into applyEmail vs externalApplyUrl automatically.
🌟 Cloudflare-friendly. Only the passive __cf_bm cookie is enforced — any sane UA passes without proxy.
📤 Clean exports. One row per entity, all fields inline. JSON + CSV exported automatically.

🎯 Use cases

Team	What they build
Voluntary sector recruiters	Daily Leeds nonprofit hiring feeds
Volunteer co-ordinators	Opportunity mapping for student placement programmes
Community events platforms	Pull events into a unified Leeds calendar
Training providers	Track what courses other charities are running / funding
Workforce strategy	Leeds third-sector pay benchmarks
Researchers	Leeds civil-society datasets (jobs + volunteers + events + training)

📥 Supported inputs

URL pattern	Behaviour
`https://doinggoodleeds.org.uk/jobs/` etc.	Listings for any CPT (the actor picks based on `entityTypes`)
`https://doinggoodleeds.org.uk/job/<slug>/`	Single paid job
`https://doinggoodleeds.org.uk/volunteer/<slug>/`	Single volunteer opportunity
`https://doinggoodleeds.org.uk/event/<slug>/`	Single event
`https://doinggoodleeds.org.uk/training-course/<slug>/`	Single training course
`https://doinggoodleeds.org.uk/wp-json/wp/v2/{job-listings\|volunteers\|event\|training_course}`	WP-JSON endpoint

Leave startUrls empty + pick entityTypes to scrape every entity of those types.

Not supported: mixing entity types in a single dataset row (each row is one CPT); hosts outside doinggoodleeds.org.uk.

🔄 How it works

Resolve start URLs — either from explicit startUrls, or from entityTypes (default ["job-listings"]).
Classify + translate each URL into the canonical /wp-json/wp/v2/<cpt> shape — tagging it with which CPT it represents.
Walk pagination via X-WP-TotalPages from the response header.
Parse each WP-JSON item — title, content HTML, WP Job Manager meta (where present), _embed taxonomies (categories, types, location).
Push one normalised row per entity to the dataset, tagged with the source CPT via the cpt field.

⚙️ Input parameters

Parameter	Type	Default	Description
`startUrls`	array	`["https://doinggoodleeds.org.uk/wp-json/wp/v2/job-listings"]`	Browser URLs, single-detail URLs, or WP-JSON endpoints.
`entityTypes`	array	`["job-listings"]`	Used when `startUrls` is empty. Allowed values: `job-listings`, `volunteers`, `event`, `training-course`.
`enrichTaxonomies`	boolean	`true`	When `true`, embeds taxonomy term names + featured image via WP-JSON `_embed`.
`postedWithinHours`	integer	(none)	Only return rows posted in the last N hours (24 = last day, 72 = last 3 days). Empty/0 = all. Ideal for daily monitoring runs that only want fresh postings.
`maxItems`	integer	`1000`	Hard cap on rows pushed (~178 total across all CPTs).
`maxConcurrency` / `minConcurrency`	integer	`5` / `1`	Parallel WP-JSON page-fetch limits.
`maxRequestRetries`	integer	`5`	Retries before a failed request is given up.
`proxy`	object	No proxy	Cloudflare lets us through without a proxy.

📊 Output overview

Each entity is one single dataset row. The type field tells you what entity it is (job, volunteer, event, training), and the cpt field carries the raw CPT slug.

📦 Output sample

{
  "type": "job",
  "cpt": "job-listings",
  "source": "doinggoodleeds.org.uk",
  "jobId": "111130",
  "slug": "young-adults-worker-3",
  "jobUrl": "https://doinggoodleeds.org.uk/job/young-adults-worker-3/",
  "wpJsonUrl": "https://doinggoodleeds.org.uk/wp-json/wp/v2/job-listings/111130",
  "title": "Young Adults Worker",
  "description": "<p>Young Adults Worker role at Waythrough…</p>",
  "descriptionText": "Young Adults Worker role at Waythrough…",
  "companyName": null,
  "companyWebsite": null,
  "companyDomain": null,
  "location": "Leeds",
  "locations": ["Leeds"],
  "remote": false,
  "salary": null,
  "categories": [],
  "employmentTypes": ["Full Time"],
  "contractType": "Full Time",
  "featured": false,
  "filled": false,
  "status": "publish",
  "postedDate": "2026-04-23T10:01:46Z",
  "modifiedDate": "2026-04-23T10:01:46Z",
  "applyType": "email",
  "applyUrl": "https://doinggoodleeds.org.uk/job/young-adults-worker-3/",
  "applyEmail": "olivia.hodgson@waythrough.org.uk",
  "externalApplyUrl": null,
  "featuredImageUrl": null,
  "authorId": 1,
  "authorName": null,
  "scrapedAt": "2026-05-20T00:13:00.000Z"
}

🗂 Key output fields

Group	Fields
Identifiers	`type`, `cpt` (`job-listings` / `volunteers` / `event` / `training-course`), `source`, `jobId`, `slug`, `jobUrl`, `wpJsonUrl`, `scrapedAt`
Content	`title`, `description` (HTML), `descriptionText` (plain)
Dates	`postedDate` (ISO), `modifiedDate` (ISO)
Employer	`companyName` (often null), `companyWebsite`, `companyDomain`, `companyTagline`, `featuredImageUrl`
Location	`location` (primary), `locations[]` (all taxonomy terms), `remote`
Compensation	`salary.{currency, min, max, unit, raw}` (when present in WP Job Manager meta)
Taxonomies	`categories[]`, `employmentTypes[]`, `contractType`
Flags	`featured`, `filled`, `status`
Apply flow	`applyType`, `applyUrl`, `applyEmail`, `externalApplyUrl`

❓ FAQ

Can I scrape all four entity types in one run? Yes. Set entityTypes to ["job-listings", "volunteers", "event", "training-course"]. Each row will have a cpt field indicating which collection it came from.

Why are some salaries empty? WP Job Manager's _job_salary meta isn't always populated for voluntary-sector roles. Look at the description HTML for compensation info when salary is null.

Why is companyName often null? Doing Good Leeds doesn't enforce the _company_name meta field. Org name is usually in the description or apply email domain — pull it from there if needed.

Can I scrape private pages or applicant data? No. Only the public WP-JSON REST API.

How do I limit results? Set maxItems. Run with ["training-course"] alone if you only want the 90 courses.

💬 Support

For issues or feature requests, please use the Issues tab on the actor's Apify Console page.
Author's website: https://muhamed-didovic.github.io/
Email: muhamed.didovic@gmail.com

🛠 Additional services

Custom output shape, additional fields, or one-off datasets: muhamed.didovic@gmail.com
Similar scrapers for other regional CVS / volunteer hubs: drop an email.
For API access (no Apify fee, just usage): muhamed.didovic@gmail.com

🔎 Explore more scrapers

See other scrapers at memo23's Apify profile — covering job boards, real estate, social media, and more.

⚠️ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Doing Good Leeds, Voluntary Action Leeds (VAL), or any of their subsidiaries or affiliates. All trademarks mentioned are the property of their respective owners.

The scraper accesses only the publicly available WP-JSON REST endpoints and public detail pages on doinggoodleeds.org.uk — no authenticated endpoints, recruiter-only features, or content behind a login. Users are responsible for ensuring their use complies with doinggoodleeds.org.uk's Terms of Service, applicable data-protection law (GDPR, CCPA, etc.), and any contractual obligations of their own organisation.

SEO Keywords

doing good leeds scraper, scrape doinggoodleeds.org.uk, doing good leeds api, leeds volunteer scraper, leeds charity jobs scraper, leeds nonprofit jobs api, leeds volunteer opportunities api, leeds community events scraper, leeds training courses scraper, voluntary action leeds scraper, Apify doing good leeds, leeds third sector jobs, leeds civil society data, yorkshire volunteer hub api, wp-json scraper, wp job manager scraper, charityjob alternative scraper, vassheffield alternative scraper, barnsleycvs alternative scraper, uk cvs jobs scraper, leeds nonprofit recruitment data

Doing Good Leeds Scraper

parsebird/doinggoodleeds-scraper

Scrape paid jobs, volunteering, events, and training courses from doinggoodleeds.org.uk via WP-JSON. Four entity types, ~178 items total. Title, employer, location, salary, apply email/URL, full description HTML. Export as JSON, CSV, Excel.

ParseBird

Doing Good Leeds Scraper

maximedupre/doing-good-leeds-scraper

Scrape Doing Good Leeds jobs, volunteer opportunities, events, and training courses. Export titles, descriptions, dates, locations, action links, source URLs, and scrape times to Apify datasets, API, schedules, and integrations.

Maxime Dupré

Community First Yorkshire Jobs Scraper

memo23/cfy-scraper

Scrape jobs and other portfolio content from communityfirstyorkshire.org.uk via WP-JSON portfolio CPT. Filter by taxonomy (default jobs ≈ 6 vacancies). Title, full HTML, location, apply email/URL, best-effort closing date + salary regex. JSON or CSV out.

Muhamed Didovic

VAS Sheffield Jobs Scraper

memo23/vassheffield-scraper

Scrape voluntary-sector vacancies from jobs.vas.org.uk (Voluntary Action Sheffield) via WP-JSON. Title, sector taxonomy, posted date, external apply URL extracted from listing body, full description. ~25 live vacancies in one request. JSON or CSV out, billed per result

Muhamed Didovic

Highland Jobs (.scot) Scraper (/w EMAILS)

memo23/highlandjobs-scraper

crape every Scottish Highlands & Islands vacancy from highlandjobs.scot via the public WP-JSON API. Title, employer, location, salary (parsed), categories, posted date, full description HTML, apply email/URL. 204 jobs in ~3 requests. JSON or CSV out.

Muhamed Didovic

s1jobs.com Scraper (/w EMAILS)

memo23/s1jobs-scraper

Scrape Scottish job postings (all sectors) from s1jobs.com — title, salary, employer, location, real lat/lng coordinates, posted/closing dates, full description, structured skills, and the actual recruiter apply URL or apply email. Works with any listing or vacancy URL. JSON or CSV out.

Muhamed Didovic

Goodmoves.org Jobs Scraper

memo23/goodmoves-scraper

Scrape Scottish charity-sector job postings from Goodmoves.org — title, salary, workplace type, employer, posted/closing dates with time, full description, real lat/lng coordinates, OSCR charity number and apply email or apply URL. Works with any listing or vacancy URL. JSON or CSV out.

Muhamed Didovic

Google Jobs Scraper - Listings, Salary, Employer & Apply Links

santhej/google-jobs-scraper

Scrape Google Jobs at scale: job title, employer, location, salary, contract type, posting date & source/apply links. Any role or city, 190+ countries. Clean JSON/CSV for job boards, recruiters & market research. No API keys.

Santhej Kallada

VA Rotherham Jobs Scraper

memo23/varotherham-scraper

Scrape the varotherham.org.uk South Yorkshire voluntary-sector job board (Wix CMS). One HTTP request, every job inline: title, employer, location, closing date. Rotherham / Barnsley / Doncaster / Sheffield charities. JSON or CSV out, billed per result.

Muhamed Didovic

Snicket Jobs Scraper

memo23/snicket-scraper

Scrape snicket.org — Bradford and West Yorkshire community-sector vacancies. RSS + labelled detail-page extraction: title, organisation, salary, hours, closing date, payment schedule, contact name/email/phone, full HTML description. JSON or CSV out, billed per result

Muhamed Didovic