SlashDot Crawler

Pricing

$10.00 / 1,000 results

Try for free

Go to Apify Store

SlashDot Crawler

Try for free

Extract comprehensive data from SlashDot.org, the premier technology news aggregator. This actor scrapes detailed article content, author information, publication dates, comment counts, popularity indicators, source links, and department tags from SlashDot's main sections.

Pricing

$10.00 / 1,000 results

Rating

5.0

(5)

Developer

Crawler Bros

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

24 days ago

Last modified

SlashDot Technology News Scraper

This Apify actor scrapes technology news articles from SlashDot.org, extracting comprehensive information about articles, their content, engagement metrics, and community discussions.

Features

Comprehensive Article Data: Scrapes detailed information about technology news articles
Content Analysis: Extracts full article content, summaries, and metadata
Engagement Metrics: Collects comment counts, scores, views, and ratings
Community Features: Gathers comments, discussions, and user interactions
Categorization: Extracts sections, tags, and topic classifications
Related Content: Finds related articles and cross-references
Filtering Options: Supports filtering by sections and sorting methods
HTML Debugging: Saves HTML content for selector analysis during development

Input Parameters

Parameter	Type	Default	Description
`maxArticles`	Integer	100	Maximum number of articles to scrape
`scrapeDetails`	Boolean	true	Whether to scrape detailed article pages
`sections`	Array	[]	List of sections to filter by
`sortBy`	String	"latest"	Sort method (latest, popular, most_commented)

Output Data

Each article record includes:

Basic Information

article_id: Unique article identifier
title: Article title
summary: Article summary/teaser
url: URL to the full article
image_url: Article thumbnail/preview image URL

Author and Publication

author: Article author name
published_date: When the article was published
section: Article section/category

Categorization

tags: Array of tags and labels

Engagement Metrics

comment_count: Number of comments
score: Article score/rating
views: Number of views

Timestamps

scraped_at: When the data was scraped

Detailed Information (if scrapeDetails=true)

full_content: Complete article content
paragraphs: Array of article paragraphs
related_articles: Array of related articles with title and URL
comments: Array of comments with text, author, date, and score
media_files: Array of media files with URL, type, and alt text
source_links: Array of external source links
metadata: Article metadata from meta tags

Metadata

source: Source website (slashdot.org)

Usage Examples

Basic Usage

{
  "maxArticles": 50,
  "scrapeDetails": true
}

Filtered by Section

{
  "maxArticles": 200,
  "scrapeDetails": true,
  "sections": ["technology", "science"],
  "sortBy": "popular"
}

Most Commented Articles

{
  "maxArticles": 100,
  "scrapeDetails": true,
  "sortBy": "most_commented"
}

Quick Scraping (No Details)

{
  "maxArticles": 500,
  "scrapeDetails": false,
  "sortBy": "latest"
}

Development Features

HTML Debugging

During development, the scraper saves HTML content to the key-value store for selector analysis:

debug_slashdot_html: Contains the HTML content of the main page

Error Handling

Comprehensive error handling with detailed logging
Graceful handling of missing elements
Retry logic for failed requests

Browser Automation

Uses Playwright for reliable browser automation
Handles dynamic content loading
Implements proper delays and waits

Installation

Install dependencies:

$pip install -r requirements.txt

Install Playwright browsers:

$playwright install chromium

Run the scraper:

$python -m src

Docker Usage

docker build -t slashdot-scraper .
docker run -e APIFY_TOKEN=your_token slashdot-scraper

Notes

The scraper respects rate limits and implements delays between requests
HTML content is saved for debugging purposes during development
The scraper handles various article listing layouts and structures
All URLs are properly resolved and normalized
Comment extraction includes author information and engagement metrics
The scraper can handle both article listings and detailed article pages

PromptBase Scraper

crawlerbros/PromptBase

Extract comprehensive data from PromptBase.com, the world's largest AI prompt marketplace with 220k+ prompts for AI models. This actor scrapes detailed prompt content, pricing data , creator profiles, AI model classifications, and high-quality prompt images.

Crawler Bros

5.0

URL to BibTeX Converter

crawlerbros/url-to-bibtex-converter

Convert any URL (academic papers, articles, books, web pages) to properly formatted BibTeX citations. Automatically extracts metadata from arXiv, PubMed, IEEE, ACM, and general web pages. Supports multiple citation types.

Crawler Bros

5.0

Signature Generator

crawlerbros/signature-generator

Create professional email signatures in seconds! Choose from multiple templates, customize with your brand colors and logo, add social media icons, and export to HTML (copy-paste ready for Gmail/Outlook), PNG, JPG, or SVG. All outputs are saved to the dataset and downloadable from the Storage tab.

Crawler Bros

5.0

Twitter Keywords Scraper

crawlerbros/twitter-keywords-scraper

Extract tweets from Twitter/X based on keywords. Scrapes tweet text, usernames, engagement metrics, media, and timestamps for multiple search terms.

Crawler Bros

5.0

Website Links Graph Generator

crawlerbros/web-link-graph-visualizer

Creates an oriented graph visualizing links between webpages. Outputs: graph.png (visual network diagram) and graph.json (structured data) saved to Key-Value Store, plus detailed dataset of all crawled pages. Configure depth, boundaries, and layout.

Crawler Bros

5.0

Google Maps MCP

crawlerbros/google-maps-mcp

Unified Apify MCP server for Google Maps. Search for businesses and extract comprehensive data including ratings, reviews, contact info, and more. Scrape detailed reviews from any Google Maps place.

Crawler Bros

5.0

Google Maps Reviews Scraper

crawlerbros/google-maps-reviews-scraper

Extract detailed reviews from any Google Maps business page. This scraper retrieves reviewer information, ratings, review text, dates, likes, and owner responses.

Crawler Bros

5.0

Markdownify MCP Server

crawlerbros/markdownify-mcp-server

Convert any webpage to clean, formatted Markdown perfect for AI consumption. Ideal for building knowledge bases, documentation scrapers, and content migration tools.

Crawler Bros

5.0

Reddit Scraper

crawlerbros/reddit-scraper

Scrape entire subreddits with this crawler. Returns the posts in a subreddit along with their title, text, scores and timestamps etc.

Crawler Bros

5.0

Google Maps Scraper

crawlerbros/google-maps-scraper

Extract business data from Google Maps including ratings, reviews, contact info, prices, coordinates, and images. Fast scraper with automatic pagination for any location or search query.

Crawler Bros

5.0

SlashDot Crawler

SlashDot Crawler

SlashDot Technology News Scraper

Features

Input Parameters

Output Data

Basic Information

Author and Publication

Categorization

Engagement Metrics

Timestamps

Detailed Information (if scrapeDetails=true)

Metadata

Usage Examples

Basic Usage

Filtered by Section

Most Commented Articles

Quick Scraping (No Details)

Development Features

HTML Debugging

Error Handling

Browser Automation

Installation

Docker Usage

Notes

You might also like

PromptBase Scraper

URL to BibTeX Converter

Signature Generator

Twitter Keywords Scraper

Website Links Graph Generator

Google Maps MCP

Google Maps Reviews Scraper

Markdownify MCP Server

Reddit Scraper

Google Maps Scraper