Lemmy Scraper - Federated Reddit Alternative
Pricing
from $0.30 / 1,000 results
Lemmy Scraper - Federated Reddit Alternative
Scrape posts and comments from any Lemmy instance (the open, federated Reddit alternative). Filter by community, search keyword, or pull instance-wide feeds. No login required. Built for AI training datasets, fediverse research, and community monitoring.
Pricing
from $0.30 / 1,000 results
Rating
0.0
(0)
Developer
NIJ KANANI
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
12 days ago
Last modified
Categories
Share
π Lemmy Scraper
Scrape posts and comments from any Lemmy instance β the federated, open-source Reddit alternative. No login. No rate-limit nightmares. Works with lemmy.world, lemmy.ml, beehaw.org, sh.itjust.works, and any other instance.
π― Built for AI/LLM training datasets, fediverse research, brand monitoring on emerging platforms, and Reddit alternatives analysis.
β¨ What you can do
- ποΈ Community posts β pull all posts from one or many communities
- π Search β keyword search across an instance
- π Instance feed β top/hot/new across the whole instance
- π¬ Optional comment trees β flattened with paths for tree reconstruction
- π Sort options β Hot, Active, New, Top (multiple ranges), MostComments
- π Cross-instance federation aware (
asklemmy@lemmy.ml)
π Quick start
{"instance": "lemmy.world","mode": "community","communities": ["technology@lemmy.world", "asklemmy@lemmy.ml"],"sort": "Top","topRange": "TopWeek","maxItems": 200}
π₯ Input
| Field | Description |
|---|---|
instance | Hostname (e.g. lemmy.world) |
mode | community / search / instance |
communities | Names like tech or tech@lemmy.world |
searchQueries | Keywords |
sort | Hot, Active, New, Top, MostComments, NewComments |
topRange | When sort = Top: TopHour β¦ TopAll |
maxItems | Cap per target |
includeComments | Fetch comment trees |
π€ Output (per post)
{"instance": "lemmy.world","community": "technology","title": "Some headline","body": "Body text or empty","creator": "username","creatorActor": "https://lemmy.world/u/username","score": 123,"upvotes": 130,"downvotes": 7,"comments": 42,"publishedAt": "2026-04-15T...","url": "https://example.com/article","thumbnailUrl": "https://...","nsfw": false,"apId": "https://lemmy.world/post/123456","postUrl": "https://lemmy.world/post/123456","commentsList": [{"id": 9999,"creator": "commenter","content": "Reply text","score": 12,"publishedAt": "...","path": "0.123.456"}]}
π― Use cases
| Who | Why |
|---|---|
| π€ AI/LLM teams | Reddit-style training data without Reddit's API gate |
| π Researchers | Federation studies, online community migration patterns |
| π Marketers | Track brand mentions on emerging platforms |
| π° Journalists | Source mining on Reddit-alternative communities |
βοΈ Tech notes
- Uses Lemmy's official
/api/v3REST endpoints β fully open, no key required - Federation-aware:
community@instancesyntax works for any cross-instance pull - Pagination via
pageparameter; auto-stops when no new posts returned - Comment trees fetched separately and capped per post for performance
