Himalayas Remote Jobs Scraper avatar

Himalayas Remote Jobs Scraper

Pricing

from $2.10 / 1,000 results

Go to Apify Store
Himalayas Remote Jobs Scraper

Himalayas Remote Jobs Scraper

Scrape remote job listings from Himalayas (himalayas.app) โ€” 100,000+ remote jobs. Filter by keyword, category, seniority and employment type. Extract title, company, salary range, location restrictions, categories and apply links.

Pricing

from $2.10 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

225

Total users

53

Monthly active users

2 days ago

Last modified

Share

๐Ÿ”๏ธ Himalayas Remote Jobs Scraper โ€” 100,000+ Remote Tech Jobs to JSON/CSV/Excel

Himalayas Remote Jobs Scraper

Pull a clean, structured feed of every remote job listed on Himalayas โ€” one of the largest dedicated remote-work boards, with 100,000+ live remote roles across engineering, design, product, marketing, sales, support, data, operations and more. Title, company, salary range, location restrictions, seniority, employment type, categories, apply link โ€” straight from the official Himalayas API, fully paginated, no login, no API key.

Built for recruiters, HR-tech teams, job aggregators, labor-market researchers, talent-intelligence platforms and personal job-hunt automations. Cost-optimised by default โ€” the heavy description field is opt-in and a maxPagesToScan ceiling stops runaway scans when filters are narrow.

๐ŸŸข No login. No API key. No proxy. No headless browser. Pure public HTTPS.


๐Ÿงญ What this scraper does

Himalayas indexes tens of thousands of remote-only jobs from companies that hire worldwide, regionally (US-only, EU-only, LATAM) or within a specific timezone window. This Apify Actor talks to Himalayas's public jobs API directly โ€” the same endpoint that powers the site โ€” walks the entire feed page by page, applies your filters (keyword, category, seniority, employment type) and writes one flat JSON record per matching job to the Apify Dataset.

Pull a quick 100-job sample, a focused 500-job recruiter slice, or the full 100k+ feed for an aggregator backend. Either way you get the same clean schema โ€” ready for JSON, CSV, Excel, Google Sheets, Postgres, BigQuery, Snowflake or your ATS.


โœจ Key features

FeatureWhat it gives you
๐ŸŒ Official Himalayas APITalks to the same JSON endpoint as the website โ€” fast, stable, fully paginated
๐Ÿ“ฆ 100,000+ remote jobsEngineering, design, marketing, sales, support, data, ops, finance, writing, legal, HR, product
๐Ÿ”Ž Keyword searchsearchKeyword matches against title, excerpt and company name
๐Ÿท๏ธ Category filterMulti-value categories array matched against each job's categories and parent categories
๐ŸŽš๏ธ Seniority filterFilter by Entry-level, Mid-level, Senior, Lead, Executive, Internship
๐Ÿ’ผ Employment type filterNarrow to Full Time, Part Time, Contract, Freelance or Internship
๐Ÿ’ฐ Salary dataMin, max and currency captured for every job that publishes a range
๐ŸŒ Location & timezone restrictionsWorldwide, US, EU, LATAM, UTC ยฑ 3, etc.
๐Ÿ’ธ Cost-optimised by defaultHeavy description field is opt-in; 200-char excerpt always included
๐Ÿ›ก๏ธ maxPagesToScan cost ceilingHard cap on pages โ€” prevents runaway scans when filters are narrow
๐Ÿ“ค Export-readyFlat JSON โ€” JSON, CSV, Excel, XML, HTML, RSS via the Apify Dataset
๐Ÿ”„ Schedule-friendlyDaily/hourly recurring runs to keep your feed fresh

๐ŸŽฏ Built for these use cases

1. Recruiters & talent sourcing

Maintain a continuously fresh feed of worldwide remote roles in your niche. Filter by category and seniority once, schedule the Actor every morning, wake up to a deduplicated list of new remote engineers, designers or PMs โ€” apply links and salary bands already pulled.

2. HR analytics & workforce dashboards

Pipe listings into Looker, Metabase, Tableau or Power BI for a live picture of remote-hiring demand by category, seniority and salary band. Track week-over-week movement, spot rising sub-niches, benchmark against your own openings.

3. Job market & salary benchmarking

Structured minSalary, maxSalary and currency on a huge share of listings. Aggregate by categories ร— seniority for one of the cleanest public salary datasets for remote tech work โ€” no proprietary compensation feed needed.

4. Job board & aggregator products

Power a remote-jobs section, a niche category page ("Remote Solidity Jobs"), or a full job-board MVP using Himalayas as your upstream feed. Cross-link the apply URL, store excerpt and tags, refresh nightly.

5. Workforce planning for distributed teams

For remote-first companies, monitor the supply side: how many comparable senior backend roles are live, what salary bands are being offered worldwide vs. US-only, what timezones companies want. Better TA budgeting and offer decisions.

6. Academic labor market research

Track structural shifts in remote work โ€” full-time vs. contract share, regional restrictions trends, category mix. postedAt timestamps and locationRestrictions arrays are researcher-friendly.

7. Lead generation โ€” companies actively hiring

A company hiring 3+ senior remote engineers right now is a high-signal target for ATS vendors, dev-tools, EOR/PEO providers (Deel, Remote.com), payroll, equipment, learning platforms and benefits brokers. Aggregate by company, sort by openings, hand to outbound.

8. Personal job-hunt automation

Run a tightly-filtered nightly scan (React, Senior, Worldwide) and pipe new matches into Slack, Discord, email or Notion. Apply within hours of posting when response rates are highest.


๐Ÿ“ฅ Inputs

FieldTypeRequiredDescription
searchKeywordstringNoFree-text matched against job title, excerpt and company. Examples: react, data scientist, solidity, fintech. Empty = no keyword filter.
categoriesstring[]NoFilter by category. Common values: Engineering, Software Development, Design, Marketing, Sales, Product, Customer Success, Operations, Finance, Data, Human Resources, Writing, Legal. Empty = all.
senioritystring[]NoEntry-level, Mid-level, Senior, Lead, Executive, Internship. Empty = all.
employmentTypestringNoFull Time, Part Time, Contract, Freelance, Internship. Empty = all.
maxJobsintegerNoMax matching jobs to save. 100 = quick sample, 500 = focused dataset, 2000 = large, 0 = unlimited. Default 200.
maxPagesToScanintegerNoHard cost ceiling on pages (20 jobs/page). 50 โ‰ˆ 1,000 jobs scanned, 200 โ‰ˆ 4,000, 500 โ‰ˆ 10,000. 0 disables (only when you want the entire 100k+ feed). Default 100.
includeDescriptionbooleanNoWhen true, includes full HTML description (1โ€“7 KB per job). excerpt (200 chars) always included. Default false.

Example inputs

Quick 100-job sample across all categories:

{ "maxJobs": 100, "maxPagesToScan": 5 }

Senior React engineers, worldwide remote:

{
"searchKeyword": "react",
"categories": ["Engineering", "Software Development"],
"seniority": ["Senior", "Lead"],
"employmentType": "Full Time",
"maxJobs": 500
}

Full board crawl with descriptions (research / backfill):

{ "maxJobs": 0, "maxPagesToScan": 0, "includeDescription": true }

Daily incremental for a design-roles dashboard:

{
"categories": ["Design"],
"seniority": ["Mid-level", "Senior"],
"maxJobs": 200,
"maxPagesToScan": 50
}

๐Ÿ“ค Output

Each item is one Himalayas remote job, flat and ready to export.

Example output record

{
"title": "Senior Backend Engineer (Remote, Worldwide)",
"company": "Acme Cloud",
"companySlug": "acme-cloud",
"companyLogo": "https://himalayas.app/companies/acme-cloud/logo.png",
"employmentType": "Full Time",
"seniority": ["Senior"],
"categories": ["Software Development", "Backend"],
"parentCategories": ["Engineering"],
"minSalary": 130000,
"maxSalary": 180000,
"currency": "USD",
"locationRestrictions": ["Worldwide"],
"timezoneRestrictions": ["UTC ยฑ 4"],
"excerpt": "We're hiring a senior backend engineer to scale our distributed event-pipeline platform...",
"url": "https://himalayas.app/companies/acme-cloud/jobs/senior-backend-engineer",
"postedAt": "2026-05-14T09:12:00.000Z",
"expiresAt": "2026-06-13T09:12:00.000Z",
"guid": "himalayas-987654",
"scrapedAt": "2026-05-15T08:30:00.000Z"
}

Field reference

FieldTypeDescription
titlestringJob title
companystringHiring company name
companySlugstringHimalayas URL slug for the company
companyLogostringLogo image URL
employmentTypestringFull Time, Part Time, Contract, Freelance, Internship
senioritystring[]Entry-level, Mid-level, Senior, Lead, Executive
categoriesstring[]Specific categories (Backend, Frontend, Mobile, etc.)
parentCategoriesstring[]Parent groupings (Engineering, Design, etc.)
minSalarynumberLower bound of published salary range
maxSalarynumberUpper bound of published salary range
currencystringISO currency (USD, EUR, GBP)
locationRestrictionsstring[]Worldwide, US, EU, LATAM, Americas
timezoneRestrictionsstring[]UTC ยฑ 3, Americas timezones, etc.
excerptstring200-char summary โ€” always included
descriptionstringFull HTML body โ€” only when includeDescription = true
urlstringDirect apply / listing URL
postedAtISOOriginal posting date
expiresAtISOListing expiry when published
guidstringStable Himalayas job identifier (dedupe key)
scrapedAtISOWhen this run captured the job

โš™๏ธ How it works

  1. Initialise โ€” read input, normalise filters to lowercase, log run config.
  2. Paginate โ€” request https://himalayas.app/jobs/api?limit=20&offset=N page by page; Himalayas hard-caps each page at 20 jobs.
  3. Track totals โ€” capture totalCount from the first response for progress logs.
  4. Parse โ€” map raw records to the canonical schema. Unix-second timestamps โ†’ ISO 8601.
  5. Filter client-side โ€” keyword (title + excerpt + company), categories, seniority, employment type. Substring matching, case-insensitive.
  6. Deduplicate โ€” guid or url based; the scraper keeps a Set so overlapping pages never write the same job twice.
  7. Push to dataset โ€” every matching job becomes one row.
  8. Stop conditions โ€” first of: maxJobs saved, maxPagesToScan reached, end-of-feed (totalCount), 3 consecutive network errors, persistent HTTP 429, or HARD_CEILING (5,500 pages).
  9. Backoff โ€” on HTTP 429: 2 โ†’ 4 โ†’ 8 โ†’ 15 s, give up after 4 retries.
  10. Finalise โ€” emit summary and exit cleanly.

โšก Performance

Pure HTTP, 20 jobs per page, ~50 ms courtesy delay between pages.

Run sizePagesApprox. timeNotes
100 jobs~5< 30 sQuick demo / QA test
500 jobs~251โ€“2 minRecruiter daily slice
2,000 jobs~1004โ€“6 minFull-category nightly run
10,000 jobs~50020โ€“30 minWeekly aggregator backfill
Full feed (~100k)~5,0003โ€“4 hOne-off historical pull

includeDescription = true increases dataset bytes 5โ€“10ร— but does not change scraping time.


๐Ÿ’ฐ Cost model

Standard Apify Actor pricing โ€” you pay only for the compute and dataset storage consumed. No per-result surcharge, no third-party API key. Compute is tiny for filtered runs and moderate for full-feed sweeps; dataset bytes are small with includeDescription = false and much larger when enabled. No proxy spend (pure HTTPS against a public API). Defaults are conservative โ€” loosen them deliberately when you need a bigger pull.


๐Ÿ”„ Schedule for continuous monitoring

Apify Schedules run this Actor on any cron interval. Patterns: Daily recruiter feed (narrow filter, maxJobs: 200, morning, append to Google Sheets); Hourly fast-mover scan (narrow keyword, maxJobs: 50, Slack webhook on new rows); Weekly market snapshot (broad filter, maxJobs: 5000, Monday 03:00 UTC, push to Postgres/BigQuery); Quarterly research dump (unlimited, includeDescription: true, archive to S3). Combine with Apify Webhooks to notify your ATS, CRM or Slack when a run finishes.


๐Ÿ› ๏ธ FAQ

Is it legal to scrape Himalayas? The Actor reads Himalayas's public jobs API and only collects publicly visible data. Comply with Himalayas's terms and applicable laws (GDPR, CCPA). Don't republish full descriptions verbatim without permission.

Do I need a Himalayas account or API key? No โ€” only an Apify account.

How many jobs can I get in one run? The feed is 100,000+ and the scraper can pull all of them (maxJobs: 0, maxPagesToScan: 0). Full sweep ~3โ€“4 hours; most use cases need 200โ€“2,000 jobs.

Why is description off by default? It's the biggest cost driver โ€” 1โ€“7 KB per job vs. ~1 KB for everything else. Disabling cuts dataset bytes 5โ€“10ร— without losing title, company, salary, categories, seniority, restrictions, 200-char excerpt or apply URL.

Will it get blocked or rate-limited? Himalayas's public API tolerates the 50 ms-per-page pace. On rare HTTP 429 the scraper backs off (2 โ†’ 4 โ†’ 8 โ†’ 15 s) and retries up to 4 times before stopping cleanly.

How do I get only new jobs each day? Schedule the Actor and dedupe downstream using the stable guid field, or sort by postedAt desc and keep rows newer than your last run's max timestamp.

Can I filter by salary? Not in input, but minSalary / maxSalary / currency are in output. Filter downstream (WHERE minSalary >= 100000 AND currency = 'USD').

Can I filter by location or timezone? Output has locationRestrictions and timezoneRestrictions arrays. Filter downstream. Himalayas only lists remote roles, so the question is geographic restriction rather than physical office.

What output formats are supported? JSON, JSONL, CSV, XLSX, XML, HTML, RSS. Pull via the Apify API or push to Google Sheets, S3, webhooks.

Can I integrate with an ATS / CRM? Yes โ€” use Apify Webhooks on ACTOR.RUN.SUCCEEDED to pipe rows into Greenhouse, Lever, Workable, Ashby, HubSpot, Salesforce. Map guid to your unique-key column so re-runs upsert.

What if Himalayas changes its API? The scraper is defensive โ€” missing fields โ†’ null, malformed JSON triggers a clean stop. Report breakage via the Actor's Issues tab.

Can I run it from my own script? POST your input JSON to https://api.apify.com/v2/acts/logiover~himalayas-remote-jobs-scraper/runs?token=YOUR_TOKEN, poll the run, fetch dataset items.


Pair Himalayas with the rest of the job-board collection: Remote OK (tech-heavy remote), Remotive (curated remote feed), Jobicy (tech & marketing remote), Welcome to the Jungle (European employer branding), Web3 & Crypto Jobs (web3.career), Arbeitnow (German-speaking + European remote), Hacker News Who Is Hiring (monthly HN thread), Lever Postings (Lever-hosted pages), Greenhouse Job Board (Greenhouse-hosted pages), The Muse (career platform), Built In Tech Jobs (US-city tech ecosystems), USAJobs (US federal), Internshala (Indian internships), Hirist Tech (Indian senior tech), WorkIndia (Indian mass hiring), Naukri (India's largest board), Lagou Tech Jobs (Chinese tech).


๐Ÿ”‘ Keyword cloud

Core: himalayas scraper, himalayas remote jobs scraper, himalayas.app scraper, remote jobs api, remote jobs scraper, worldwide remote jobs, remote job board scraper, remote tech jobs feed, remote jobs dataset, remote-first hiring data, structured job listings, jobs apify actor.

Niche / role / region: remote react engineer jobs, remote backend engineer jobs, remote python developer, remote senior engineer jobs, remote staff engineer, remote product manager jobs, remote designer jobs, remote UX UI jobs, remote data scientist jobs, remote DevOps SRE jobs, remote QA engineer jobs, remote marketing jobs, remote sales jobs, remote customer success jobs, remote operations jobs, remote finance jobs, remote HR people ops jobs, remote legal counsel jobs, US-only remote jobs, EU remote jobs, LATAM remote jobs, APAC remote jobs, timezone-restricted remote jobs.

Use case: remote hiring trends, remote-work labor market data, remote salary benchmark, remote jobs aggregator backend, recruiter sourcing automation, ATS data pipeline, talent intelligence dashboard, remote-jobs alert bot, job-hunt automation, EOR PEO market intelligence, dev-tools lead generation, remote-work academic research.

Audience: recruiters, talent acquisition, sourcers, HR-tech founders, ATS vendors, dev-tool marketers, EOR PEO sales teams, payroll software, equipment providers, job board operators, career platforms, labor market economists, journalists, indie developers, job seekers, career coaches.


Changelog

  • 2026-06-01 โ€” Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.
  • 2026-05-25 โ€” Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.

  • 2026-05-20 โ€” Maintenance pass: reviewed the input schema and default values for a smooth one-click start, and rebuilt the Actor on the latest base image.

Last reviewed: 2026-06-01.