Internshala Internship & Jobs Scraper
Pricing
from $3.00 / 1,000 results
Internshala Internship & Jobs Scraper
Internshala API alternative. Scrape internships & jobs without login; export stipend, skills & company data to CSV/JSON dataset.
Pricing
from $3.00 / 1,000 results
Rating
0.0
(0)
Developer
Logiover
Maintained by CommunityActor stats
0
Bookmarked
401
Total users
79
Monthly active users
4 days ago
Last modified
Categories
Share
Extract structured internship and job listing data from Internshala.com — India's #1 career platform trusted by 400K+ companies with 200K+ active listings. Filter by category, city, remote, stipend and more.
Why This Actor
Internshala is the dominant platform for student internships and fresher jobs in India. Existing actors on the Apify Store either don't paginate (returning only the first page of results), lack city/category filtering, or don't extract detail-page data like skills, perks and full descriptions. This actor covers all of these.
Key advantages:
- Full pagination — scrapes all available pages, not just the first
- Flexible filtering — category + city + WFH + part-time + stipend combinations
- Detail pages — skills, perks, full description, applicant count, openings
- Fast & cheap — uses raw HTTP (CheerioCrawler), no browser overhead
- No proxy required for most runs — Internshala has minimal bot protection
How It Works
Internshala is server-side rendered, so no JavaScript execution is needed. The actor uses Crawlee's CheerioCrawler to:
- Build search URLs from your filter combination (category × city)
- Scrape listing cards from each search results page
- Paginate automatically through all available pages
- Optionally open each listing's detail page for full data
Supported Listing Types
| Type | Description |
|---|---|
internships | Student internship positions across all fields |
jobs | Fresher / entry-level full-time jobs |
Popular Categories
| Category Slug | Description |
|---|---|
web-development | Web dev internships/jobs |
data-science | Data science, ML, AI |
marketing | Digital marketing, SEO, social media |
finance | Finance, accounting, CA |
content-writing | Content, copywriting, blogging |
graphic-design | Design, UI/UX |
business-development | BD, sales |
human-resources | HR, recruitment |
engineering | Core engineering |
computer-science | CS, software development |
python | Python development |
java | Java development |
machine-learning | ML engineering |
Any Internshala category slug works — just use the slug from the URL: internshala.com/internships/{slug}-internship/
Input
{"listingType": "internships","categories": ["web-development", "data-science", "marketing"],"cities": ["delhi", "mumbai", "bangalore"],"workFromHome": false,"partTime": false,"minStipend": 5000,"maxListings": 500,"maxPages": 20,"scrapeDetails": true,"requestDelay": 500,"maxConcurrency": 3}
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
listingType | string | internships | internships or jobs |
categories | array | [] | Category slugs. Empty = all listings |
cities | array | [] | City slugs. Empty = all India |
workFromHome | boolean | false | Remote/WFH listings only |
partTime | boolean | false | Part-time only |
minStipend | integer | 0 | Minimum monthly stipend in ₹ |
maxListings | integer | 200 | Max total listings (0 = unlimited) |
maxPages | integer | 10 | Max pages per category+city (0 = unlimited) |
scrapeDetails | boolean | true | Open detail pages for skills, perks, description |
requestDelay | integer (ms) | 500 | Delay between requests |
maxConcurrency | integer | 3 | Parallel requests |
Output
Each listing is saved as one JSON record. Example:
{"listingId": "1744234567","listingType": "internships","url": "https://internshala.com/internship/detail/web-development-internship-at-startup-xyz1744234567","title": "Web Development","company": "Startup XYZ","companyUrl": "https://internshala.com/company/startup-xyz","location": "Delhi","isRemote": false,"stipend": "₹ 10,000 /month","stipendMin": 10000,"stipendMax": 10000,"duration": "3 Months","startDate": "Immediately","applyBy": "30 Apr' 26","openings": 2,"applicants": 547,"skills": ["HTML/CSS", "JavaScript", "React", "Node.js"],"perks": ["Certificate", "Letter of recommendation", "Flexible work hours"],"description": "We are looking for a Web Development intern to join our team...","isPartTime": false,"hasJobOffer": true,"postedAt": "Today","category": "web-development","scrapedAt": "2026-04-11T16:00:00.000Z"}
Output Fields
| Field | Type | Description |
|---|---|---|
listingId | string | Unique Internshala listing ID |
listingType | string | internships or job |
url | string | Detail page URL |
title | string | Role/position title |
company | string | Hiring company |
companyUrl | string | Company profile on Internshala |
location | string | City or Work From Home |
isRemote | boolean | True if WFH/remote |
stipend | string | Stipend as displayed |
stipendMin | integer | Parsed min monthly stipend (₹) |
stipendMax | integer | Parsed max monthly stipend (₹) |
duration | string | Internship duration |
startDate | string | Start date |
applyBy | string | Application deadline |
openings | integer | Number of positions |
applicants | integer | Number of applicants |
skills | array | Required skills |
perks | array | Perks offered |
description | string | Full description |
isPartTime | boolean | Part-time flag |
hasJobOffer | boolean | Whether PPO is offered |
postedAt | string | Post date |
category | string | Category used in search |
scrapedAt | string | Scrape timestamp (ISO 8601) |
Download Formats
JSON, JSONL, CSV, XLSX, XML — from the Apify Console Dataset tab.
Automation & Integration
n8n / Make / Zapier
Schedule daily runs and route results to Google Sheets, Airtable, CRMs, Slack alerts or email — perfect for job market monitoring or recruitment pipelines.
Apify API
# Start a runcurl -X POST "https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_TOKEN" \-H "Content-Type: application/json" \-d '{"listingType": "internships","categories": ["data-science"],"workFromHome": true,"maxListings": 1000}'
AI / LLM Pipelines
Use the description, skills and perks fields as grounding data for LLM-powered job matching, resume gap analysis, or market research agents. All fields consistently typed and null-explicit.
Performance & Cost
Uses CheerioCrawler (raw HTTP, no browser) — significantly faster and cheaper than Playwright-based scrapers.
| Listings | Concurrency | Estimated Time | Memory |
|---|---|---|---|
| 200 (with details) | 3 | ~2–4 min | 256 MB |
| 1,000 (with details) | 5 | ~8–15 min | 256 MB |
| 5,000 (list only) | 5 | ~10–20 min | 256 MB |
Set scrapeDetails: false for fastest bulk collection (list data only, no skills/description).
Common Use Cases
- Recruitment intelligence — track which companies are hiring for specific skills across Indian cities
- Market research — analyze stipend trends, popular skills, internship durations by category
- Job board aggregation — build an India-focused job aggregator or newsletter
- AI training data — large structured dataset of Indian internship/job descriptions in English
- Startup tracker — monitor which startups are actively hiring interns (often first signal of growth)
- Career counseling — understand what skills are most in demand per city/category
FAQ
Is this an Internshala API alternative?
Yes. Internshala has no official public API, so this actor works as an unofficial Internshala API — giving you structured internship and job data on demand without building or maintaining a scraper yourself.
Can I scrape Internshala without login?
Yes. Internshala listing and detail pages are server-side rendered and publicly accessible, so the actor extracts data without any login, account, or cookies.
How do I export Internshala internships to CSV or JSON?
Run the actor, then open the Apify Console Dataset tab and download the results as CSV, JSON, JSONL, XLSX or XML. Each Internshala internship dataset record includes stipend, skills, company and full description fields.
Legal & Ethical Use
Users are responsible for compliance with Internshala Terms of Service and applicable data protection regulations. Scrape responsibly — use reasonable delays and do not republish individual user data.
Changelog
2026-06-15
- Reliability pass: re-verified end-to-end on live data with real-world inputs. Routine maintenance build.
2026-06-07
- Docs: added coverage for using the actor as an Internshala unofficial API / API alternative, scraping without login, and exporting internship data to CSV/JSON.
2026-06-05
- 🛡️ Reliability fix: results are no longer dropped by strict output validation — runs now complete cleanly even at high volume (thousands of results).
- ⚡ Stability & performance hardening; fresh rebuild.
| Version | Date | Notes |
|---|---|---|
| 1.0.0 | 2025-04-11 | Initial release — full pagination, category+city+WFH+stipend filters, detail pages |
📝 Changelog
2026-06-04
- Verified live & refreshed build — reliability/maintenance pass.