Internshala Internship & Jobs Scraper avatar

Internshala Internship & Jobs Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Internshala Internship & Jobs Scraper

Internshala Internship & Jobs Scraper

Scrape internship and job listings from Internshala.com — India's #1 career platform with 200K+ active listings. Filter by category, city, work-from-home, stipend and more. Full pagination, detail page data, and structured output.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

309

Total users

81

Monthly active users

5 days ago

Last modified

Share

Internshala Internship & Jobs Scraper — Scrape Internshala.com Listings to JSON/CSV

Internshala Internship & Jobs Scraper

Scrape internship and fresher job listings from Internshala.com, India's #1 career platform trusted by 400K+ companies with 200K+ active listings. This Internshala scraper extracts structured data — role, company, stipend, skills, perks, duration, applicant counts and full descriptions — with full pagination, category/city/work-from-home/stipend filters, and detail-page data. It runs on raw HTTP (no browser), needs no login and no API key, and exports clean data to JSON, CSV or Excel.

✨ What this Actor does / Key features

  • 📄 Full pagination — scrapes every available results page, not just the first.
  • 🎯 Flexible filtering — combine listing type (internships or jobs), category slugs, cities, work-from-home, part-time and minimum stipend.
  • 🧩 Detail-page data — optionally opens each listing to extract required skills, perks, full description, openings and applicant counts.
  • 💰 Parsed stipend values — captures stipend as displayed plus parsed numeric min/max in ₹/month.
  • 🏢 Company data — company name and Internshala company profile URL for every listing.
  • Fast & cheap — uses a raw HTTP (Cheerio) engine; no headless browser overhead.
  • 🔁 Multi-search support — each category × city combination runs as a separate search for broad coverage.
  • 🛡️ Optional proxy — works without a proxy for most runs; Apify Proxy supported for high-volume jobs.
  • 📊 Structured, null-explicit output — consistently typed fields ready for spreadsheets, CRMs or AI pipelines.

🔍 Input

FieldTypeDescription
listingTypestringWhether to scrape internships or jobs (fresher / entry-level).
categoriesarrayOne or more Internshala category slugs (e.g., "web-development", "data-science"). Each generates a separate search. Leave empty for all categories.
citiesarrayFilter results by city slug (e.g., "delhi", "mumbai", "bangalore"). Leave empty for all India.
workFromHomebooleanIf enabled, only returns work-from-home / remote listings. Overrides city filters.
partTimebooleanFilter to part-time listings only.
minStipendintegerMinimum monthly stipend filter in ₹. 0 for no filter.
maxListingsintegerMaximum total listings to scrape across all categories and cities. 0 for unlimited.
maxPagesintegerMaximum pages to paginate per category+city combination (~15–20 listings/page). 0 for unlimited.
scrapeDetailsbooleanOpen each listing's detail page to extract full description, skills, perks and metadata.
proxyConfigurationobjectProxy settings. Datacenter proxies usually work; residential recommended for high-volume runs.
requestDelayintegerDelay between requests in milliseconds (300–800 ms is sufficient).
maxConcurrencyintegerNumber of parallel requests (Internshala tolerates 3–5).

🚀 Example input

{
"listingType": "internships",
"categories": ["web-development", "data-science", "marketing"],
"cities": ["delhi", "mumbai", "bangalore"],
"workFromHome": false,
"partTime": false,
"minStipend": 5000,
"maxListings": 500,
"maxPages": 20,
"scrapeDetails": true,
"requestDelay": 500,
"maxConcurrency": 3
}

📦 Output

Each listing is saved as one structured JSON record in the dataset. Export to JSON, JSONL, CSV, XLSX or XML.

FieldDescription
listingIdUnique Internshala listing ID
listingTypeinternships or jobs
urlDetail page URL
titleRole / position title
companyHiring company
companyUrlCompany profile on Internshala
locationCity or "Work From Home"
isRemoteTrue if WFH / remote
stipendStipend as displayed
stipendMinParsed minimum monthly stipend (₹)
stipendMaxParsed maximum monthly stipend (₹)
durationInternship duration
startDateStart date
applyByApplication deadline
openingsNumber of positions
applicantsNumber of applicants
skillsRequired skills (array)
perksPerks offered (array)
descriptionFull description
isPartTimePart-time flag
hasJobOfferWhether a pre-placement offer (PPO) is offered
postedAtPost date
categoryCategory used in search
scrapedAtScrape timestamp (ISO 8601)

💡 Use cases

  • Recruitment intelligence — track which companies hire for specific skills across Indian cities.
  • Market research — analyze stipend trends, popular skills and internship durations by category.
  • Job board aggregation — build an India-focused internship aggregator or newsletter.
  • AI / LLM training data — assemble a large structured dataset of internship and job descriptions.
  • Startup tracking — monitor which startups are actively hiring interns, often an early growth signal.
  • Career counseling — understand which skills are most in demand per city and category.

🛠️ How it works

Internshala is server-side rendered, so no JavaScript execution is needed. The Actor builds search URLs from your filter combinations, scrapes listing cards from each results page, paginates automatically through all pages, and optionally opens each listing's detail page for the full description, skills and perks.

❓ Frequently Asked Questions

Is it legal to scrape Internshala? The Actor only collects publicly visible listing data. You are responsible for complying with Internshala's Terms of Service and applicable data protection laws — scrape responsibly, use reasonable delays, and do not republish individual users' personal data.

Do I need an API key or a login? No. There is no Internshala account, login or API key required. You only need an Apify account to run the Actor.

Do I need a proxy? Not for most runs — Internshala has minimal bot protection and datacenter proxies usually work. For high-volume scraping, enabling Apify Proxy (residential) improves reliability.

How much data can I get? You can scrape thousands of listings per run. Each category × city combination is a separate paginated search, so combining multiple categories and cities gives broad coverage. Use maxListings and maxPages to control volume.

Can I filter by city, stipend or remote work? Yes. You can filter by listing type, category slug, city, minimum monthly stipend, part-time only, and work-from-home only.

How fast is it and how much does it cost? It uses a raw HTTP engine, so it's fast and cheap. As a rough guide, ~200 listings with details take about 2–4 minutes, and 1,000 listings with details about 8–15 minutes. Set scrapeDetails: false for the fastest bulk collection.

What output formats are supported? Results are stored in a structured Apify dataset and can be exported as JSON, JSONL, CSV, XLSX or XML, or pulled via the Apify API.

⏰ Scheduling & integration

Schedule this Actor on Apify to monitor the Indian internship and fresher job market daily or weekly. Export results to JSON, CSV or Excel, and route them to Google Sheets, Airtable, CRMs, Slack or email via the Apify API and n8n / Make / Zapier integrations.


Changelog

  • 2026-05-25 — Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.

  • 2026-05-20 — Maintenance pass: reviewed the input schema and default values for a smooth one-click start, and rebuilt the Actor on the latest base image.

Last reviewed: 2026-05-25.