Webpage Tables Extractor avatar

Webpage Tables Extractor

Pricing

$20.00 / 1,000 tables extracteds

Go to Apify Store
Webpage Tables Extractor

Webpage Tables Extractor

Extract every HTML <table> from a page into clean JSON arrays (headers + rows) — feed spreadsheets straight to an agent or pipeline.

Pricing

$20.00 / 1,000 tables extracteds

Rating

0.0

(0)

Developer

Anthony Snider

Anthony Snider

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Turn any webpage's HTML <table>s into clean, structured JSON — headers and rows ready for a spreadsheet, an LLM, or a data pipeline.

Live on the Apify Store — run it instantly or call it as an agent tool via Apify MCP.

What you get

  • Every real data <table> on the page, parsed to JSON.
  • Each table: index, headers, rowCount, and rows (objects keyed by header, falling back to column index).
  • Layout/spacer tables (single column or fewer than 2 rows) are automatically skipped.
  • Loose colspan handling so cells stay aligned with headers.
  • Single URL or bulk URLs in one run.

Input

{
"url": "https://en.wikipedia.org/wiki/List_of_largest_companies_by_revenue",
"maxUrls": 25
}

Or bulk:

{
"urls": [
"https://example.com/report-a",
"https://example.com/report-b"
]
}

Output

One dataset item per page:

{
"url": "https://en.wikipedia.org/wiki/List_of_largest_companies_by_revenue",
"tableCount": 1,
"tables": [
{
"index": 0,
"headers": ["Rank", "Name", "Industry", "Revenue (USD millions)"],
"rowCount": 50,
"rows": [
{
"Rank": "1",
"Name": "Walmart",
"Industry": "Retail",
"Revenue (USD millions)": "648,125"
}
]
}
]
}

Pricing: pay-per-event — charged once per page processed.