H-1B LCA Visa Wage & Employer Data Scraper avatar

H-1B LCA Visa Wage & Employer Data Scraper

Pricing

Pay per event

Go to Apify Store
H-1B LCA Visa Wage & Employer Data Scraper

H-1B LCA Visa Wage & Employer Data Scraper

Scrape US DOL H-1B Labor Condition Application records: employer, job title, base salary, prevailing wage, work location, case status, SOC/NAICS codes, and decision dates.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

ParseForge Banner

🛂 H-1B LCA Disclosure Scraper

🚀 Pull every public H-1B Labor Condition Application in seconds. Filter by employer, job title, work city, or year. No API key, no manual XLSX wrangling, no PDF parsing.

🕒 Last updated: 2026-05-16 · 📊 60+ fields per case · 8M+ certified cases searchable · 2012 to 2025 coverage · Hourly + annual wages

Every year the US Department of Labor's Office of Foreign Labor Certification (OFLC) publishes hundreds of thousands of Labor Condition Application (LCA) decisions for H-1B, H-1B1, and E-3 visa workers. Buried in those records is the only public wage data the federal government publishes at the individual case level, along with the sponsoring employer, the job title, the SOC occupation code, the worksite address, the prevailing wage determination, and the immigration attorney representing the employer. Until now the only way to query it was to download a 600 MB quarterly XLSX file and write your own pivot tables.

This scraper turns that public disclosure data into a clean JSON or CSV feed. Plug in an employer name, a city, or a year and get back per-case records with the case status, decision date, base salary, wage range, prevailing wage level, employer POC, and law firm filing on the petition. Use it to benchmark visa sponsorship wages, build a sponsor lookup for an immigration practice, score employer H-1B dependency, or feed a salary comparison product. The data is government-sourced, court-admissible, and refreshed quarterly by DOL.

🎯 Built for💡 Common use cases
🧑‍⚖️ Immigration attorneysCase prep, employer due diligence, RFE responses
🏢 Corporate HR + mobility teamsWage benchmarking, sponsor compliance audits
🧮 Salary research productsAugment Levels.fyi / Glassdoor with visa wages
🎓 Policy researchers + journalistsTrack H-1B sponsor concentration, wage trends

📋 What the H-1B LCA Disclosure Scraper does

  • 🔍 Search by employer. Pull every LCA filed by Stripe, Google, Microsoft, Infosys, or any sponsor in the disclosure dataset.
  • 🏙️ Search by work city or year. Build wage tables for a specific metro or a specific fiscal year.
  • 💼 Search by job title. Get every Software Engineer, Data Scientist, or Account Executive LCA across all sponsors.
  • 💰 Capture full wage detail. Base wage, wage range (from + to), wage unit (Year / Hour / Month), prevailing wage, and PW level (I-IV).
  • 📂 Capture full case lifecycle. Submit date, decision date, employment start + end, status (Certified / Denied / Withdrawn).
  • 🏛️ Capture attorney + law firm filings. When an agent represents the employer the dataset includes the lawyer name, email, phone, and law firm.

Each record carries the canonical DOL case number, full employer address with NAICS code, the employer point of contact (name, title, phone, email), the worksite address and county, the H-1B dependency flag, the willful violator flag, and the public disclosure election. 60+ fields per case, every value populated when it is filed on the source LCA.

💡 Why it matters: H-1B wage data is the only public per-person wage dataset the US government publishes. It is the source of truth for visa sponsorship benchmarks, immigration case prep, and any salary product that wants to cover the non-resident segment of the US tech labor market.


🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing a sponsor lookup, a wage benchmark query, and a salary-by-city export.


⚙️ Input

FieldTypeRequiredWhat it does
employerstringnoFilter by sponsor name (case insensitive, partial match). Example: STRIPE, GOOGLE LLC, INFOSYS.
jobTitlestringnoFilter by job title keyword. Example: SOFTWARE ENGINEER, DATA SCIENTIST, FINANCIAL ANALYST.
citystringnoFilter by employment city. Example: SAN FRANCISCO, AUSTIN, NEW YORK.
yearenumnoSingle fiscal year (2012 to 2025) or All Years. Default 2024. Combine with at least one of employer / job / city for best results.
includeDetailsbooleannoDefault true. Fetch the per-case detail page to enrich each record with status, decision date, SOC code, NAICS, employer POC, attorney filing, prevailing wage, and worksite address. Set to false for a faster listing-only scrape.
startUrlstringnoPaste a search URL from the source site to bypass the filter fields. Useful when copying a saved query.
maxItemsintegernoFree plan: capped at 10 (preview). Paid plan: up to 1,000,000.

Example: every LCA Stripe filed in 2024, full detail.

{
"employer": "STRIPE",
"year": "2024",
"includeDetails": true,
"maxItems": 500
}

Example: every Austin LCA filed in 2024, listing only (fast).

{
"city": "AUSTIN",
"year": "2024",
"includeDetails": false,
"maxItems": 5000
}

⚠️ Good to Know: Combining an empty employer with All Years returns no rows because the source site refuses unbounded queries. Pin at least one of employer, job, city, or a specific year to get results.


📊 Output

Every record is a single LCA case with the employer, job, salary, work location, dates, and the full DOL filing detail.

🧾 Schema

FieldTypeExample
🆔 caseNumberstringI-200-24138-006531
statusstringCertified
🔗 caseUrlstringhttps://h1bdata.info/details.php?id=I-200-24138-006531
🛂 visaClassstringH-1B
🏢 employerLegalNamestringStripe, Inc.
💼 jobTitlestringAPPLICATION SECURITY ENGINEER
📑 socCodestring15-1212.00
📑 socTitlestringInformation Security Analysts
💰 baseSalarynumber169395
💵 wageRateFromnumber169395
💵 wageRateTonumber250000
📐 wageRateUnitstringYear
📊 prevailingWagenumber169395
🪜 prevailingWageLevelstringIV
🗓️ prevailingWageOesYearstring7/1/2023 - 6/30/2024
🏙️ workCitystringSOUTH SAN FRANCISCO
🗺️ workStatestringCA
📅 submitDatestring2024-05-16
📅 decisionDatestring2024-05-23
📅 employmentStartDatestring2024-10-25
📅 employmentEndDatestring2027-10-24
👥 totalWorkerPositionsinteger1
🏢 naicsCodestring522320
📞 employerPhonestring14157379490
✉️ employerPocEmailstringdrivera@stripe.com
⚖️ lawFirmNamestringFRAGOMEN, DEL REY, BERNSEN & LOEWY, LLP
🚩 h1bDependentbooleanfalse
🚨 willfulViolatorbooleanfalse
🕒 scrapedAtstring2026-05-16T04:30:07.640Z

(60+ fields total. Worksite address, POC contact info, attorney filing, and case workforce breakdown are all included when the source LCA carries them.)

📦 Sample records


✨ Why choose this Actor

Capability
🏛️Government-sourced data. Records originate from DOL OFLC public disclosure files, the only court-admissible H-1B wage source.
💰Full wage detail. Base wage, wage range, hourly + annual, prevailing wage, and PW level on every record.
🔁Quarterly refresh. Source data is updated every quarter by DOL, so your benchmarks track the live H-1B market.
🎯Search the way attorneys think. Filter by sponsor name, work city, job title, or fiscal year. Combine any of them.
⚖️Attorney + law firm detail. When outside counsel files the petition, the lawyer name, email, phone, and firm are all captured.
🧾60+ fields per case. Worksite address, NAICS, employer POC, H-1B dependency, willful violator flag, public disclosure election.
🚀No registration anywhere. No DOL account, no captcha, no manual XLSX wrangling. Hit run, get JSON or CSV.

📊 Over 8 million H-1B / H-1B1 / E-3 LCA decisions are searchable across fiscal years 2012 to 2025.


📈 How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
⭐ H-1B LCA Disclosure Scraper (this Actor)Pay per case2012 to 2025, all visa classesQuarterly with DOL releasesEmployer + job + city + yearZero config
Official quarterly downloadsFreeLatest 4 quarters per fileQuarterlyXLSX, no UISpreadsheet wrangling per file
Paid live APIsSubscriptionOften US-only, gatedVariesLimitedAccount + key + quotas
Legacy community dumpsFreeOften years out of dateRarelyNoneDIY parsing
Manual case status lookupFreeOne case at a timeLiveCase number requiredCaptcha per query

Most teams that try the official XLSX route give up at the second quarter. The scraper turns the same data into a JSON feed you can query like any other API.


🚀 How to use

  1. 🆕 Create a free Apify account. Sign up here. No credit card needed for the preview tier.
  2. 🔎 Open the H-1B LCA Disclosure Scraper page in the Apify Console and click "Try for free".
  3. ✏️ Fill in your filters. Pick a sponsor, a city, a job title, a year, or any combination.
  4. ▶️ Click Start. The Actor pulls the matching cases and writes one record per LCA.
  5. 📥 Export the dataset. Download as JSON, CSV, Excel, or HTML, or pipe it into your data warehouse via the Apify API.

⏱️ Total time: under 60 seconds from sign-up to first dataset export.


💼 Business use cases

🧑‍⚖️ Immigration attorneys

  • Sponsor due diligence before taking on a new corporate client
  • Wage evidence for RFE responses (other certified cases at the same SOC + level)
  • Track competitor law firm filings against shared sponsors
  • Build a private case-law database keyed by employer or job title

🏢 Corporate HR + global mobility

  • Benchmark visa sponsorship wages against your offer letters
  • Detect H-1B dependent flags before extending an offer
  • Audit your own filings for SOC + level consistency across worksites
  • Pre-fill prevailing wage tables for new requisitions

🧮 Salary research products

  • Augment Levels.fyi / Glassdoor / Payscale with visa wage data
  • Show users the H-1B sponsor pool for a given role + city
  • Build comp dashboards filtered by NAICS or SOC
  • Surface employer wage outliers for editorial coverage

📊 Recruiting + sales intelligence

  • Lead lists of H-1B-heavy sponsors by industry
  • Score companies by visa filing volume for sponsorship pitches
  • Track new market entrants (first H-1B filings in a metro)
  • Identify staffing firms and consultancies by NAICS + filing pattern

🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

🤝 Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

🧪 Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

🔌 Automating H-1B LCA Disclosure Scraper

Drive the scraper from your own code via the Apify API. The Actor returns a dataset URL with the full JSON output, ready to push into a warehouse or a BI tool.

Schedule recurring runs to keep your sponsor lookup table fresh after every DOL quarterly release. The Apify scheduler can fire the Actor weekly, push the results into your storage, and notify your team via webhook when a new batch lands.


❓ Frequently Asked Questions


🔌 Integrate with any app

  • Zapier - trigger flows on new LCA records
  • Make - low-code automation across SaaS apps
  • n8n - self-hosted automation with HTTP + database nodes
  • Google Sheets - push results straight into a tab
  • Slack - notify a channel when a sponsor crosses a wage threshold
  • Airbyte - sync the dataset into Snowflake, BigQuery, or Postgres

💡 Pro Tip: browse the complete ParseForge collection for more government-data and labor-market scrapers.


🆘 Need Help? Open our contact form and we will get back to you within one business day.


⚖️ Disclaimer: This scraper accesses public US Department of Labor disclosure data. Data is provided as-is for research, commercial, and personal use. ParseForge is not affiliated with the US Department of Labor or USCIS. Always verify case status and wage data against the official DOL source of record before relying on it in a legal proceeding.