Pricing

from $0.0005 / actor start

Try for free

Go to Apify Store

Scrape entry-level IT jobs in India

Try for free

Scrape entry-level IT jobs in India from LinkedIn and major ATS boards (Greenhouse, Lever, Workday, SmartRecruiters, etc.). Filter by recency, location, and job type. Outputs clean JSON to dataset and Excel report to key-value store.

Pricing

from $0.0005 / actor start

Rating

5.0

(1)

Developer

Dhanunjaya Y

Actor stats

Bookmarked

535

Total users

Monthly active users

4 months ago

Last modified

Entry-Level IT Jobs Scraper India - Export to Excel

What It Does

This actor scrapes entry-level IT jobs in India from LinkedIn and 12 ATS-style sources:

LinkedIn
Greenhouse
Lever
SmartRecruiters
Workday
iCIMS
Jobvite
BambooHR
Zoho Recruit
Freshteam
Keka
Darwinbox
Recruitee

It normalizes all results into one schema, filters by posting recency, removes duplicates, and exports:

JSON records to Apify dataset
OUTPUT.xlsx to Apify key-value store

Input Fields

keywords (string): Job search terms. Example: software engineer fresher, python developer, data analyst
location (string): Target location. Example: India, Bengaluru, Hyderabad
posted_within (enum): Time window filter. Options: 1 hour, 2 hours, 5 hours, 12 hours, today, 2 days, this week
sources (array): Which boards to scrape. Default includes all 13 sources.
job_type (enum): full-time, internship, or both
max_results_per_source (integer): Max jobs fetched per source
max_keyword_variants (integer): Number of auto-generated keyword variants tested per source (default: 20)
linkedin_li_at_cookie (secret string, optional): LinkedIn li_at cookie for improved LinkedIn fetch reliability on stricter rate limits
execution_mode (enum): ultra, fast, balanced, deep (default: fast)
- ultra: fastest turnaround, smaller scan depth
- fast: speed + coverage balance (recommended for frequent runs)
- balanced: deeper than fast, slower
- deep: maximum depth, slowest
run_timeout_seconds (integer): Global runtime budget before pending sources are cancelled
source_timeout_seconds (integer): Per-source timeout budget
source_concurrency / variant_concurrency / linkedin_variant_concurrency (integer): Advanced parallelism controls

Smart keyword expansion

The actor auto-expands user keywords for better entry-level coverage.
Example: QA jobs expands to variants like qa intern, manual testing intern, automation testing trainee, software tester, junior qa engineer, quality analyst, and more.
Prefix/suffix generation adds terms such as junior, associate, entry-level, fresher, intern, trainee, plus role endings like engineer, analyst, and tester.

Time Filter

Every job goes through a common parser that handles:

Relative strings: 2 hours ago, Posted 3 days ago, Today
ISO strings: 2024-01-15T10:30:00Z
Unix timestamps: 1705312200000
Calendar strings: Jan 15, 2024, 15/01/2024, 15 Jan 2024

When to use each option

1 hour: Very fresh alerting workflow
2 hours: Slightly wider near-real-time scan
5 hours: Same-shift refresh
12 hours: Half-day batch run
today: Daily run
2 days: Catch-up run
this week: Weekly sourcing run

Speed Notes

Scraping now runs in parallel across sources and keyword variants.
Sources that do not support server-side keyword search are auto-run with a single keyword pass to avoid redundant calls.
In fast mode, typical runs complete within roughly 1-3 minutes depending on selected sources, keyword breadth, and target result counts.

Output

The Excel file includes 4 sheets:

All Jobs
By Source Board
By City
Dashboard

All Jobs has styled headers, freeze pane, filters, alternating rows, and clickable apply links.

Recommended screenshot path for store listing:

assets/excel-output-sample.png (add your real run screenshot before publishing)

Recommended actor icon path:

assets/actor-icon.svg (upload this in Actor Settings > Profile > Icon)

Pricing

Recommended Apify store pricing setup:

Pay-per-use: $0.50 per 100 results
Subscription: $19/month unlimited runs
Free tier: 50 results for trial users

Local Setup

pip install -r requirements.txt
playwright install chromium
python -m unittest discover -s tests -v
python main.py

Apify Deploy

apify login
apify push

After deployment:

Run actor on Apify.
Verify OUTPUT.xlsx exists in key-value store.
Verify JSON rows exist in dataset.
Review logs for per-source counts and errors.

Data Seed Files

Company/portal seeds are in data/:

lever_companies.json
greenhouse_companies.json
jobvite_companies.json
bamboohr_companies.json
zoho_companies.json
freshteam_companies.json
keka_companies.json
darwinbox_companies.json
recruitee_companies.json
icims_clients.json
workday_portals.json

Populate these lists with your target companies for higher coverage.

Multi-ATS Job Scraper — Greenhouse, Lever, Ashby, Workday

scrapesage/multi-ats-job-scraper

Scrape jobs from the 5 biggest ATS platforms in one dataset — Greenhouse, Lever, Ashby, SmartRecruiters & Workday. Full descriptions, salaries, locations, departments & apply links. Auto-detects the ATS from a careers URL or company name. Monitor mode emits only new jobs.

Scrape Sage

Workday + Greenhouse + Lever Job Scraper

moving_beacon-owner1/workday-greenhouse-lever-job-scraper

Extract jobs from Workday, Greenhouse, and Lever into one normalized dataset. Filter by keywords, title, and location, enrich listings with descriptions and salary data, and automatically deduplicate postings across multiple company career boards.

Jamshaid Arif

ATS Jobs Scraper — Greenhouse, Lever & Workday ✅ No Auth

themineworks/ats-jobs

Scrape live job listings directly from Greenhouse, Lever, Workday & Ashby ATS boards. Pass company slugs, get clean unified job data. No login, no anti-bot, pay per job. Hiring-signal & recruiting feeds. Use it as an MCP server in Claude, ChatGPT & AI agents.

The Mine Works

Jobs Scraper — Greenhouse, Lever, Workday, Ashby & Remote

sashaebashu/job-postings-scraper

Scrape jobs from Greenhouse, Lever, Workday, Ashby, Recruitee, SmartRecruiters + remote boards. Titles, departments, locations, dates. Hiring-signal data.

Sasha Ebashu

SmartRecruiters Jobs Scraper

crawlergang/smartrecruiters-jobs-scraper

Scrape job postings from SmartRecruiters - fetch all jobs from one or more companies by company ID, or search globally by keyword, location, experience level, and employment type.

Crawler Gang

5.0

SmartRecruiters Jobs Scraper

crawlerbros/smartrecruiters-jobs-scraper

Scrape job postings from SmartRecruiters - fetch all jobs from one or more companies by company ID, or search globally by keyword, location, experience level, and employment type.

Crawler Bros

ATS Jobs Scraper for Greenhouse, Lever, Ashby & More

fetch_cat/ats-jobs-scraper

Scrape public Greenhouse, Lever, Ashby, Recruitee, SmartRecruiters, and Personio job boards into one normalized dataset with titles, locations, departments, apply URLs, dates, and optional descriptions.

Hanna Nosova

Job Listings Scraper — Company Jobs & Hiring Data

qualifyops/ats-jobs-scraper

Raw job postings from companies' official public ATS boards (Greenhouse, Lever, Ashby). One row per job: title, company, location, department, apply URL, updated date. No LinkedIn, no browser, no login.

QualifyOps

Multi-ATS Company Jobs Scraper

automation-lab/multi-ats-jobs-scraper

Scrape job listings from company career pages across 5 ATS platforms: Greenhouse, Workday, SmartRecruiters, Lever, and Ashby. Standardized output with 15 fields per job. Pure HTTP, no browser needed.

Stas Persiianenko

112

ATS Jobs Scraper - Greenhouse, Lever, Ashby, Workday

illehius/ats-jobs-scraper

Give it a list of companies (careers URLs or provider:token). Returns every live job across their ATS boards, normalized to one schema with full descriptions.