Internshala Internship & Jobs Scraper avatar

Internshala Internship & Jobs Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Internshala Internship & Jobs Scraper

Internshala Internship & Jobs Scraper

Scrape internship and job listings from Internshala.com — India's #1 career platform with 200K+ active listings. Filter by category, city, work-from-home, stipend and more. Full pagination, detail page data, and structured output.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Extract structured internship and job listing data from Internshala.com — India's #1 career platform trusted by 400K+ companies with 200K+ active listings. Filter by category, city, remote, stipend and more.

Apify Actor Built with Crawlee


Why This Actor

Internshala is the dominant platform for student internships and fresher jobs in India. Existing actors on the Apify Store either don't paginate (returning only the first page of results), lack city/category filtering, or don't extract detail-page data like skills, perks and full descriptions. This actor covers all of these.

Key advantages:

  • Full pagination — scrapes all available pages, not just the first
  • Flexible filtering — category + city + WFH + part-time + stipend combinations
  • Detail pages — skills, perks, full description, applicant count, openings
  • Fast & cheap — uses raw HTTP (CheerioCrawler), no browser overhead
  • No proxy required for most runs — Internshala has minimal bot protection

How It Works

Internshala is server-side rendered, so no JavaScript execution is needed. The actor uses Crawlee's CheerioCrawler to:

  1. Build search URLs from your filter combination (category × city)
  2. Scrape listing cards from each search results page
  3. Paginate automatically through all available pages
  4. Optionally open each listing's detail page for full data

Supported Listing Types

TypeDescription
internshipsStudent internship positions across all fields
jobsFresher / entry-level full-time jobs

Category SlugDescription
web-developmentWeb dev internships/jobs
data-scienceData science, ML, AI
marketingDigital marketing, SEO, social media
financeFinance, accounting, CA
content-writingContent, copywriting, blogging
graphic-designDesign, UI/UX
business-developmentBD, sales
human-resourcesHR, recruitment
engineeringCore engineering
computer-scienceCS, software development
pythonPython development
javaJava development
machine-learningML engineering

Any Internshala category slug works — just use the slug from the URL: internshala.com/internships/{slug}-internship/


Input

{
"listingType": "internships",
"categories": ["web-development", "data-science", "marketing"],
"cities": ["delhi", "mumbai", "bangalore"],
"workFromHome": false,
"partTime": false,
"minStipend": 5000,
"maxListings": 500,
"maxPages": 20,
"scrapeDetails": true,
"requestDelay": 500,
"maxConcurrency": 3
}

Parameters

ParameterTypeDefaultDescription
listingTypestringinternshipsinternships or jobs
categoriesarray[]Category slugs. Empty = all listings
citiesarray[]City slugs. Empty = all India
workFromHomebooleanfalseRemote/WFH listings only
partTimebooleanfalsePart-time only
minStipendinteger0Minimum monthly stipend in ₹
maxListingsinteger200Max total listings (0 = unlimited)
maxPagesinteger10Max pages per category+city (0 = unlimited)
scrapeDetailsbooleantrueOpen detail pages for skills, perks, description
requestDelayinteger (ms)500Delay between requests
maxConcurrencyinteger3Parallel requests

Output

Each listing is saved as one JSON record. Example:

{
"listingId": "1744234567",
"listingType": "internships",
"url": "https://internshala.com/internship/detail/web-development-internship-at-startup-xyz1744234567",
"title": "Web Development",
"company": "Startup XYZ",
"companyUrl": "https://internshala.com/company/startup-xyz",
"location": "Delhi",
"isRemote": false,
"stipend": "₹ 10,000 /month",
"stipendMin": 10000,
"stipendMax": 10000,
"duration": "3 Months",
"startDate": "Immediately",
"applyBy": "30 Apr' 26",
"openings": 2,
"applicants": 547,
"skills": ["HTML/CSS", "JavaScript", "React", "Node.js"],
"perks": ["Certificate", "Letter of recommendation", "Flexible work hours"],
"description": "We are looking for a Web Development intern to join our team...",
"isPartTime": false,
"hasJobOffer": true,
"postedAt": "Today",
"category": "web-development",
"scrapedAt": "2026-04-11T16:00:00.000Z"
}

Output Fields

FieldTypeDescription
listingIdstringUnique Internshala listing ID
listingTypestringinternships or job
urlstringDetail page URL
titlestringRole/position title
companystringHiring company
companyUrlstringCompany profile on Internshala
locationstringCity or Work From Home
isRemotebooleanTrue if WFH/remote
stipendstringStipend as displayed
stipendMinintegerParsed min monthly stipend (₹)
stipendMaxintegerParsed max monthly stipend (₹)
durationstringInternship duration
startDatestringStart date
applyBystringApplication deadline
openingsintegerNumber of positions
applicantsintegerNumber of applicants
skillsarrayRequired skills
perksarrayPerks offered
descriptionstringFull description
isPartTimebooleanPart-time flag
hasJobOfferbooleanWhether PPO is offered
postedAtstringPost date
categorystringCategory used in search
scrapedAtstringScrape timestamp (ISO 8601)

Download Formats

JSON, JSONL, CSV, XLSX, XML — from the Apify Console Dataset tab.


Automation & Integration

n8n / Make / Zapier

Schedule daily runs and route results to Google Sheets, Airtable, CRMs, Slack alerts or email — perfect for job market monitoring or recruitment pipelines.

Apify API

# Start a run
curl -X POST "https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"listingType": "internships",
"categories": ["data-science"],
"workFromHome": true,
"maxListings": 1000
}'

AI / LLM Pipelines

Use the description, skills and perks fields as grounding data for LLM-powered job matching, resume gap analysis, or market research agents. All fields consistently typed and null-explicit.


Performance & Cost

Uses CheerioCrawler (raw HTTP, no browser) — significantly faster and cheaper than Playwright-based scrapers.

ListingsConcurrencyEstimated TimeMemory
200 (with details)3~2–4 min256 MB
1,000 (with details)5~8–15 min256 MB
5,000 (list only)5~10–20 min256 MB

Set scrapeDetails: false for fastest bulk collection (list data only, no skills/description).


Common Use Cases

  • Recruitment intelligence — track which companies are hiring for specific skills across Indian cities
  • Market research — analyze stipend trends, popular skills, internship durations by category
  • Job board aggregation — build an India-focused job aggregator or newsletter
  • AI training data — large structured dataset of Indian internship/job descriptions in English
  • Startup tracker — monitor which startups are actively hiring interns (often first signal of growth)
  • Career counseling — understand what skills are most in demand per city/category

Users are responsible for compliance with Internshala Terms of Service and applicable data protection regulations. Scrape responsibly — use reasonable delays and do not republish individual user data.


Changelog

VersionDateNotes
1.0.02025-04-11Initial release — full pagination, category+city+WFH+stipend filters, detail pages