Pricing

$10.00/month + usage

Go to Store

Lobsters Scraper

Try for free

Developed by

epctex

Scrape Lobste.rs posts and users based on any search criteria. Retrieve all the comments, domains, tags, titles, number of upvotes, and published dates. Use this extremely fast actor to retrieve all the information right away. Easy use and no limits!

0.0 (0)

Pricing

$10.00/month + usage

Total users

Monthly users

Runs succeeded

>99%

Last modified

8 hours ago

News

Actor - Lobsters Scraper

Lobsters scraper

Since Lobste.rs doesn't provide a good and free API, this actor should help you to retrieve data from it.

The Lobsters data scraper supports the following features:

Search any keyword - You can search any keyword you would like to have and get the results
Scrape domains - Get all the posts from each of the domains that are represented in lobste.rs.
Get posts by tags - Scraping the results by a certain tag is doable!
Retrieve user detail - If you are looking for specific user details, you are in the right place.
Fetch comments of any post - All the comments that have been shared under a post are also included inside the search results.
Get active and recent posts - Don't get outdated! Active and recent posts can be harvested right away from the Lobsters.

Bugs, fixes, updates, and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Upcoming Features

Integrate hierarchical comment tree structure.

Input Parameters

The input of this scraper should be JSON containing the list of pages on Lobsters that should be visited. Possible fields are:

search: (Optional) (String) Keyword that you want to search on Lobsters.
startUrls: (Optional) (Array) List of Lobsters URLs. You should only provide domains, tags, user detail, post detail, active posts, recent posts, or search URLs.
endPage: (Optional) (Number) Final number of page that you want to scrape. The default is Infinite. This applies to all search requests and startUrls individually.
maxItems: (Optional) (Number) You can limit scraped items. This should be useful when you search through the big lists or search results.
proxy: (Required) (Proxy Object) Proxy configuration.
extendOutputFunction: (Optional) (String) Function that takes a JQuery handle ($) as an argument and returns an object with data.
customMapFunction: (Optional) (String) Function that takes each object's handle as an argument and returns the object with executing the function.

This solution requires the use of Proxy servers, either your own proxy servers or you can use Apify Proxy.

Tip

When you want to scrape over a specific list URL, just copy and paste the link as one of the startUrl.

If you would like to scrape only the first page of a list then put the link for the page and have the endPage as 1.

With the last approach that is explained above you can also fetch any interval of pages. If you provide the 5th page of a list and define the endPage parameter as 6 then you'll have the 5th and 6th pages only.

Compute Unit Consumption

The actor is optimized to run blazing fast and scrape as many items as possible. Therefore, it forefronts all the detailed requests. If the actor doesn't block very often it'll scrape 100 listings in 30 seconds with ~0.01-0.02 compute units.

Lobsters Scraper Input example

{
 "startUrls": [
  "https://lobste.rs/domains/google.com",
  "https://lobste.rs/t/devops",
  "https://lobste.rs/u/lambda",
  "https://lobste.rs/active",
  "https://lobste.rs/recent",
  "https://lobste.rs/search?q=google&what=stories&order=newest"
 ],
 "maxItems":10,
 "endPage":2,
  "proxy":{
    "useApifyProxy":true
  }
}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with a failure state and output an explanation of what is wrong.

Lobsters Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this Lobsters actor.

Scraped Lobsters Properties

The structure of each item in Lobsters looks like this:

User Detail

{
	"type": "user",
	"name": "lambda",
	"url": "https://lobste.rs/u/lambda",
	"avatar": "https://lobste.rs/avatars/lambda-100.png",
	"status": "Active user",
	"homepage": "https://maxcountryman.com",
	"github": "https://github.com/maxcountryman",
	"about": "Indie hacker and people-first leader. Building https://remotejobs.com in public on Twitter.",
	"joinedAt": "2013-12-30 09:46:46 -0600",
	"karma": "392",
	"numberOfComments": "10",
	"numberOfStories": "28"
}

Post Detail

{
    "type": "post",
    "id": "kour63",
    "url": "https://lobste.rs/s/kour63/help_test_cargo_s_new_index_protocol",
    "title": "Help test Cargo's new index protocol",
    "link": "https://blog.rust-lang.org/inside-rust/2023/01/30/cargo-sparse-protocol.html",
    "numberOfUpvotes": 13,
    "userName": "icefox",
    "userLink": "https://lobste.rs/u/icefox",
    "domain": "blog.rust-lang.org",
    "date": "2023-03-09 12:24:32 -0600",
    "tags": [
        "devops",
        "rust"
    ],
    "comments": [
        {
            "id": "dudcdn",
            "body": "Rust 1.68.0 has been released so this is now usable in stable Rust too. Still opt-in though. https://blog.rust-lang.org/2023/03/09/Rust-1.68.0.html",
            "numberOfUpvotes": 3,
            "date": "2023-03-09 16:48:32 -0600",
            "userLink": "https://lobste.rs/u/wezm",
            "userName": "wezmlink"
        }
    ]
}

Contact

Please visit us through epctex.com to see all the products that are available for you. If you are looking for any custom integration or so, please reach out to us through the chat box in epctex.com. In need of support? business@epctex.com is at your service.

Instructables Scraper

epctex/instructables-scraper

Retrieve all the information right away from Instructables. Get all the project and user detailed information without any limits or restrictions. Titles, descriptions, images, comments, steps, and detailed instructions about the projects are ready in a structural way. Easy usage, super fast!

epctex

Newsweek Scraper

epctex/newsweek-scraper

Scrape the latest articles right away from Newsweek. Retrieve your news directly with the information of title, body, published time, updated time, keywords, images, videos, and many more! Get your data with JSON, XML, Excel, CSV, or many other options. Extremely fast, optimized, and with no limits!

epctex

Imgur Scraper

epctex/imgur-scraper

Extract millions of posts and memes from Imgur. Crawl and scrape descriptions, number of views, favorites, upvotes, downvotes, score, comment details, post creator, and all other deep-level details. You can specify search terms, filters, tags, list pages, and much more! Extremely fast, no limits!

epctex

Google Jobs Scraper

epctex/google-jobs-scraper

The most comprehensive Google Jobs Scraper ever! Extremely configurable, highly customizable, and blazing fast. Retrieve salary, social links, apply links, and all detailed sections. Both for data retrieval and API Integration. Easy use without any limits!

epctex

549

PaperbackSwap Scraper

epctex/paperbackswap-scraper

Get information on books right away from PaperBack Swap! Get title, prices, rating, ISBN13, ISBN10, publisher, and a lot more are waiting for you. Use any filter, search anything, and retrieve your results without any limits. Get it with JSON, XML, Excel, CSV, or many other options. Super fast!

epctex

💬 Instagram Comments Scraper (No Login)

louisdeconinck/instagram-comments-scraper

Scrape all comments from any Instagram post in seconds. No login required. Get text, likes, timestamps, and usernames — even from large posts. Fast, affordable, and easy to use. Ideal for lead gen, research, and trend spotting. Just paste the URL and go.

Louis Deconinck

SoundCloud Scraper

epctex/soundcloud-scraper

Retrieve information from SoundCloud without any restrictions or rate limits. Extremely fast data harvesting about tracks, comments, user profiles, albums, playlists, and more. Download URLs of tracks, monetization methods, number of likes, number of shares, and many more are ready for you.

epctex

178

Twitter Scraper (Search)

desearch/ai-twitter-search

The Basic X Search API enables users to retrieve relevant links and tweets based on specified search queries without utilizing AI-driven models. It analyzes links from X posts that align with the provided search criteria

Desearch

SoundCloud Artists Scraper

epctex/soundcloud-artists-scraper

Retrieve Artist from SoundCloud without any restrictions or rate limits. Extremely fast data harvesting about users, ids, names, playlist counts, like counts, and all the other related deep-level information. Perfect for Lead Generation!

epctex

Facebook Likes Scraper (Fast & Cheap) 👍 🌟

scrapestorm/facebook-likes-scraper-fast-cheap

Collect Facebook likes data 📊 from posts. Retrieve the URL, name 📝, ID 🆔, and profile photo 📸 of users who liked 👍, with the option to set a maximum number of results to scrape 🔢. Download data in JSON, CSV, or Excel formats 📥 for use in apps, reports, spreadsheets, or analytics 📈.