Stack Overflow / Stack Exchange Scraper avatar

Stack Overflow / Stack Exchange Scraper

Pricing

from $2.00 / 1,000 question returneds

Go to Apify Store
Stack Overflow / Stack Exchange Scraper

Stack Overflow / Stack Exchange Scraper

Searches Stack Overflow and Stack Exchange sites (Server Fault, Super User, Ask Ubuntu, MathOverflow, Software Engineering) by keyword and/or tags and returns structured questions: title, score, answer count, views, tags, author, date, and the questi

Pricing

from $2.00 / 1,000 question returneds

Rating

0.0

(0)

Developer

Dami's Studio

Dami's Studio

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 hours ago

Last modified

Share

Search Stack Overflow — or any of the big Stack Exchange sites (Server Fault, Super User, Ask Ubuntu, MathOverflow, Software Engineering) — by keyword and/or tags, and get back clean, structured questions. No API key, no login, no anti-bot.

It talks to the public api.stackexchange.com API directly, so results are fast and reliable. It paginates for you, dedupes by question id, and returns the question body as plain text (HTML stripped).

What you get per question

questionId, title, url, score, answerCount, viewCount, isAnswered, tags, ownerName, ownerReputation, createdAt (ISO), and body (the question text, HTML stripped).

Input

FieldNotes
queryKeywords (title + body). Optional if you give tags.
sitestackoverflow, serverfault, superuser, askubuntu, mathoverflow, softwareengineering.
tagsComma-separated, e.g. javascript,promise. A question must carry all of them.
sortByvotes (default), relevance, creation, activity.
maxItemsHow many questions to collect (paginates the API in pages of 100).

You must provide a query or at least one tag.

Output

One dataset row per question. Empty searches return a single diagnostic row and are not charged.

Example

{ "query": "async await", "site": "stackoverflow", "tags": "javascript", "sortBy": "votes", "maxItems": 50 }

Notes

The unauthenticated Stack Exchange API allows ~300 requests/day per IP. The actor respects the API's backoff directive (sleeps when asked) and stops cleanly when the daily quota is exhausted, returning a clear diagnostic instead of failing silently. For very large jobs, split by tag, keyword, or site.