GST Scraper avatar

GST Scraper

Pricing

from $9.99 / 1,000 results

Go to Apify Store
GST Scraper

GST Scraper

Scrape taxpayer details, filing & HSN from GST portal

Pricing

from $9.99 / 1,000 results

Rating

0.0

(0)

Developer

W3Crawler

W3Crawler

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

12 hours ago

Last modified

Share

GSTIN Scraper

Scrape taxpayer details, filing & HSN from GST portal without CAPTCHA

Features

  • Structured data extraction — outputs clean JSON fields ready for analysis
  • Cloud-ready — runs on Apify with scheduling, proxy rotation, and webhook support

Use Cases

  • Data collection and research automation at scale
  • Building structured datasets from public web content
  • Monitoring for new listings, updates, or changes over time
  • Competitive intelligence and market benchmarking

Input

FieldTypeDefaultRequiredDescription
gstinsarrayGSTIN numbers to scrape from GST portal
extractHsnCodesbooleanTrueExtract goods and services information including HSN/SAC codes
extractFilingDetailsbooleanTrueExtract filing status and financial year details
headlessbooleanTrueRun browser in headless mode (no visible UI). Set to false for debugging.
maxItemsinteger20Maximum number of items to scrape
proxyConfigurationobjectApify ResidentialProxy settings. Residential proxy is strongly recommended

Output

Each scraped item includes:

FieldDescription
goodserviceGoodservice
finanacialYearsFinanacial years
filingStatusFiling status
filingFrequencyFiling frequency
natureOfTaxpayerNature of taxpayer
isAadhaarVerifiedIs aadhaar verified
legalNameLegal name
stateJurisdictionState jurisdiction
stateState or province name
divisionDivision
zoneZone
circleCircle
taxpayerTypeTaxpayer type
cancellationDateCancellation date
gstinNumberGstin number
businessActivitiesBusiness activities
isEKYCVerifiedIs e k y c verified
compositionSchemeComposition scheme
registrationDateRegistration date
companyTypeCompany type

Sample Output

{
"goodservice": "goodservice_value",
"finanacialYears": "finanacialYears_value",
"filingStatus": "filingStatus_value",
"filingFrequency": "filingFrequency_value",
"natureOfTaxpayer": "natureOfTaxpayer_value",
"isAadhaarVerified": "isAadhaarVerified_value",
"legalName": "legalName_value",
"stateJurisdiction": "stateJurisdiction_value"
}

Notes

  • Proxy: Residential proxy is highly recommended — set proxyConfiguration: { useApifyProxy: true, apifyProxyGroups: ["RESIDENTIAL"] }
  • Rate limits: Keep maxItems reasonable per run to avoid IP bans or rate throttling
  • Scheduling: Use Apify Scheduler to run this actor on a recurring basis for a consistently fresh dataset