Medium Articles Scraper
Pricing
from $4.99 / 1,000 results
Medium Articles Scraper
Scrape Medium articles by search query or topic/tag. Extracts title, author, publication, claps, responses, read time, tags, and more.
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
codingfrontend
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
Scrape articles from Medium by tag/topic or keyword. Uses Medium's public RSS feeds for reliable data extraction, bypassing Cloudflare bot protection.
Features
- Tag/Topic mode — browse latest articles for any Medium tag
- Search mode — converts keyword to multiple related tag feeds for broader coverage
- Rich article data — title, author, publication, tags, thumbnail, published date
- Pagination — automatically fetches multiple feeds to reach
maxItems - Deduplication — skips duplicate articles across feeds
- No browser required — uses RSS feeds (HTTP-based, fast and reliable)
Note: Medium's website is behind Cloudflare Managed Challenge which cannot be bypassed with standard tools. This scraper uses Medium's public RSS feeds instead, which are fully accessible. Fields that require JS rendering (claps, responses, read time) are not available from RSS feeds.
Input Parameters
| Field | Type | Default | Description |
|---|---|---|---|
query | string | (required) | Search keyword or Medium tag slug |
mode | string | "search" | search (multi-tag) or tag (single tag) |
maxItems | integer | 50 | Maximum articles to scrape |
proxyConfiguration | object | — | Optional proxy settings |
Example INPUT.json
{"query": "artificial intelligence","mode": "search","maxItems": 50}
Output Fields
| Field | Type | Description |
|---|---|---|
title | string | Article title |
subtitle | string | Article snippet/subtitle |
url | string | Article URL (clean, no tracking) |
articleId | string | Medium article ID |
author | string | Author name |
authorUrl | string | Author profile URL |
publication | string | Publication name (if applicable) |
publicationUrl | string | Publication URL |
publishedAt | string | ISO 8601 publication date |
updatedAt | string | ISO 8601 update date |
thumbnail | string | Article cover image URL |
tags | array | Article category tags |
claps | null | Not available via RSS (requires browser) |
responses | null | Not available via RSS (requires browser) |
readTime | null | Not available via RSS (requires browser) |
searchQuery | string | Query used for this article |
scrapedAt | string | ISO 8601 scrape timestamp |
Known Limitations
- Claps, responses, read time: Not available via RSS feeds. Medium's website search and tag pages are behind Cloudflare Managed Challenge. A future version with CAPTCHA solving service integration could unlock these fields.
- RSS pagination: Each RSS feed returns ~10 latest articles. For search mode, multiple related tag feeds are queried.
- Sorting: RSS feeds return latest articles only (no "trending" or "top" sorting).