Wantstobehired Parser Spider avatar

Wantstobehired Parser Spider

Pricing

from $9.00 / 1,000 results

Go to Apify Store
Wantstobehired Parser Spider

Wantstobehired Parser Spider

This Apify Actor scrapes and parses candidate profiles from Hacker News' "Who Wants to Be Hired?" thread, extracting structured data on skills, locations, and contacts....

Pricing

from $9.00 / 1,000 results

Rating

0.0

(0)

Developer

GetDataForMe

GetDataForMe

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

Description

This Apify Actor scrapes and parses candidate profiles from Hacker News' "Who Wants to Be Hired?" thread, extracting structured data on skills, locations, and contacts....


Wantstobehired Parser Spider

The Wantstobehired Parser Spider is a powerful Apify Actor designed to scrape and parse candidate profiles from the "Who Wants to Be Hired?" thread on Hacker News. It extracts structured data from Supabase API endpoints, transforming raw text into clean, actionable insights for recruiters, HR professionals, and developers. This tool streamlines talent discovery by automating the collection of candidate details like skills, locations, and contact information, saving hours of manual effort.

Features

  • Automated Data Extraction: Seamlessly pulls candidate profiles from specified URLs, handling pagination and API responses.
  • Structured Output: Converts unstructured text into JSON with fields like technologies, location, email, and resume links.
  • Customizable Limits: Set item limits to control the volume of data processed per run.
  • High Reliability: Built with robust error handling to ensure consistent performance even with varying data formats.
  • Fast Processing: Optimized for speed, processing multiple profiles quickly without compromising accuracy.
  • Flexible Input: Accepts multiple URLs for batch processing, ideal for large-scale talent searches.
  • Apify Integration: Fully compatible with Apify's platform for easy scheduling, monitoring, and data export.

Input Parameters

ParameterTypeRequiredDescriptionExample
UrlsarrayYesA list of URLs to scrape candidate data from. Each URL should be a valid Supabase API endpoint.["https://doeyotjfaoilomjzisbu.supabase.co/rest/v1/candidates?select=*&offset=50&limit=25&order=created_at.desc"]
itemLimitintegerNoThe maximum number of candidate items to process and return.20

Example Usage

To run the Actor, provide input in JSON format:

{
"Urls": [
"https://doeyotjfaoilomjzisbu.supabase.co/rest/v1/candidates?select=*&offset=50&limit=25&order=created_at.desc"
],
"itemLimit": 20
}

Example output:

[
{
"id": 47272788,
"created_at": "2026-03-06 09:20:53+00:00",
"story_id": 47219667,
"author": "Savir_Khanna",
"raw_text": "Location: Boston &#x2F; By Area<p>Remote: Ok<p>Willing to relocate: Yes<p>Technologies: Python, SQL, TypeScript, AWS, Docker, FastAPI, Flask, Pandas, Plotly, React, D3, Node, MySQL, SQLite, Cursor<p>Resume: <a href=\"https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1QdYsbz0L1zdZUo-XEb1qBtJJj8l1lBUM&#x2F;view?usp=sharing\" rel=\"nofollow\">https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1QdYsbz0L1zdZUo-XEb1qBtJJj8l1lBUM&#x2F;view?usp=sharing</a><p>Email: hellosavir@gmail.com<p>Seeking backend&#x2F;fullstack internship opportunities Fall 2026 (July - December). Open to starting soon&#x2F;immediately. I prioritize making scalable APIs, maintaining system design, and have a hunger to learn, build, and ship.",
"location": "Boston",
"email": "hellosavir@gmail.com",
"remote": true,
"willing_to_relocate": true,
"resume": "https://drive.google.com/file/d/1QdYsbz0L1zdZUo-XEb1qBtJJj8l1lBUM/view?usp=sharing",
"technologies": "Python, SQL, TypeScript, AWS, Docker, FastAPI, Flask, Pandas, Plotly, React, D3, Node, MySQL, SQLite, Cursor",
"summary": "Seeking backend/fullstack internship opportunities Fall 2026 (July - December). Open to starting soon/immediately. I prioritize making scalable APIs, maintaining system design, and have a hunger to learn, build, and ship.",
"in_the_united_states": false,
"technologies_embeddings": "Python, SQL, TypeScript, AWS, Docker, FastAPI, Flask, Pandas, Plotly, React, D3, Node, MySQL, SQLite, Cursor",
"actor_id": "weYSda3fRow90cybP",
"run_id": "oXG2byIxDTi3pqe08"
},
{
"id": 47272350,
"created_at": "2026-03-06 08:15:07+00:00",
"story_id": 47219667,
"author": "anshima__saini",
"raw_text": "SEEKING REMOTE UI&#x2F;UX DESIGN OPPORTUNITIES Location: India Remote: Yes Willing to relocate: Open to discuss Technologies: Figma, Canva, Figjam, Notion, Miro, Visily, Maze, Jitter, Asana, AI, Framer, Wireframing, Prototyping, Mockups, Microinteractions, User Research, Competitive Analysis, User Flows, UI Design, UX Research, Product Management, Jira, Lottie animations.\nPortfolio: <a href=\"https:&#x2F;&#x2F;anshimasainiproduct.framer.website&#x2F;\" rel=\"nofollow\">https:&#x2F;&#x2F;anshimasainiproduct.framer.website&#x2F;</a><p>Email: sainianshima@gmail.com<p>I&#x27;m a Product designer and manager looking for remote opportunities anywhere. Feel free to check my portfolio website.",
"location": "India",
"email": "sainianshima@gmail.com",
"remote": true,
"willing_to_relocate": false,
"resume": "https://anshimasainiproduct.framer.website/",
"technologies": "Figma, Canva, Figjam, Notion, Miro, Visily, Maze, Jitter, Asana, AI, Framer, Wireframing, Prototyping, Mockups, Microinteractions, User Research, Competitive Analysis, User Flows, UI Design, UX Research, Product Management, Jira, Lottie animations",
"summary": "I'm a Product designer and manager looking for remote opportunities anywhere. Feel free to check my portfolio website.",
"in_the_united_states": false,
"technologies_embeddings": "Figma, Canva, Figjam, Notion, Miro, Visily, Maze, Jitter, Asana, AI, Framer, Wireframing, Prototyping, Mockups, Microinteractions, User Research, Competitive Analysis, User Flows, UI Design, UX Research, Product Management, Jira, Lottie animations",
"actor_id": "weYSda3fRow90cybP",
"run_id": "oXG2byIxDTi3pqe08"
},
{
"id": 47272277,
"created_at": "2026-03-06 08:03:23+00:00",
"story_id": 47219667,
"author": "Tarang21",
"raw_text": "Location: Gujarat, India\nRemote: Yes\nWilling to relocate: No\nTechnologies: Typescript, React.JS, Next.JS, Node.JS, PostgreSQL.\nResume: <a href=\"https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1jxobcDgmI3cCmF8iFM9npiVx64E6i1Wm&#x2F;view?usp=sharing\" rel=\"nofollow\">https:&#x2F;&#x2F;drive.google.com&#x2F;file&#x2F;d&#x2F;1jxobcDgmI3cCmF8iFM9npiVx64E6i1Wm&#x2F;view?usp=sharing</a>\nEmail: rajparatarang66@gmail.com",
"location": "Gujarat, India",
"email": "rajparatarang66@gmail.com",
"remote": true,
"willing_to_relocate": true,
"resume": "https://drive.google.com/file/d/1jxobcDgmI3cCmF8iFM9npiVx64E6i1Wm/view?usp=sharing",
"technologies": "Typescript, React.JS, Next.JS, Node.JS, PostgreSQL",
"summary": null,
"in_the_united_states": false,
"technologies_embeddings": "Typescript, React.JS, Next.JS, Node.JS, PostgreSQL",
"actor_id": "weYSda3fRow90cybP",
"run_id": "oXG2byIxDTi3pqe08"
}
]

Use Cases

  • Talent Acquisition: Recruiters can quickly identify candidates with specific skills like Python or React for job openings.
  • Market Research: Analyze trends in technologies and locations from Hacker News profiles to inform hiring strategies.
  • Competitive Intelligence: Monitor competitors' talent pools by scraping public candidate data.
  • Academic Research: Researchers can study job market dynamics, such as remote work preferences or skill distributions.
  • Business Automation: Automate data collection for HR dashboards or CRM integrations.
  • Networking: Developers and designers can find potential collaborators or mentors based on shared technologies.

Installation and Usage

  1. Search for "Wantstobehired Parser Spider" in the Apify Store.
  2. Click "Try for free" or "Run".
  3. Configure input parameters (URLs and item limit).
  4. Click "Start" to begin extraction.
  5. Monitor progress in the log.
  6. Export results in your preferred format (JSON, CSV, Excel).

Output Format

The Actor outputs an array of JSON objects, each representing a candidate profile. Key fields include:

  • id: Unique identifier for the profile.
  • created_at: Timestamp of profile creation.
  • author: Username of the candidate.
  • location: Candidate's location.
  • email: Contact email.
  • remote: Boolean indicating remote work preference.
  • willing_to_relocate: Boolean for relocation willingness.
  • resume: Link to resume or portfolio.
  • technologies: Comma-separated list of skills.
  • summary: Brief description of the candidate's goals.

Data is cleaned and structured for easy integration into databases or analysis tools.

Support

For custom/simplified outputs or bug reports, please contact:

We're here to help you get the most out of this Actor!