Tumblr Scraper avatar

Tumblr Scraper

Pricing

from $10.00 / 1,000 results

Go to Apify Store
Tumblr Scraper

Tumblr Scraper

Multiple blogs — scrape any number of public Tumblr blogs in a single run Custom domains — supports both blogname.tumblr.com Image extraction — all image URLs with alt text and dimensions Video detection — finds video sources and poster images Tag extraction — post tags/categories from RSS

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Jamshaid Arif

Jamshaid Arif

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

15 days ago

Last modified

Share

Tumblr Scraper — Blog Posts, Images & Tags

Scrape public Tumblr blog posts via RSS. No login, no API key, no browser — fast and lightweight.

Features

  • Multiple blogs — scrape any number of public Tumblr blogs in a single run
  • Custom domains — supports both blogname.tumblr.com and custom domains
  • Image extraction — all image URLs with alt text and dimensions
  • Video detection — finds video sources and poster images
  • Tag extraction — post tags/categories from RSS
  • Post type detection — auto-classifies: text, photo, photoset, video, audio, quote, link, embed
  • Reblog detection — identifies reblogs and their source blogs
  • Outbound links — extracts all external links from posts
  • No browser — pure HTTP requests, runs 10x faster than browser-based scrapers

Input

FieldTypeDefaultDescription
blogNamesstring[]["staff"]Blog names (without .tumblr.com)
maxPostsPerBloginteger20Max posts per blog (RSS max ~20)
extractImagesbooleantrueInclude image/video URLs
extractFullHtmlbooleanfalseInclude raw HTML content
extractTagsbooleantrueInclude post tags

Example Input

{
"blogNames": ["staff", "engineering", "photojojo"],
"maxPostsPerBlog": 15,
"extractImages": true,
"extractTags": true
}

Output

{
"blog": "staff",
"blog_url": "https://staff.tumblr.com",
"blog_title": "Tumblr Staff",
"post_id": "812545258409934848",
"title": "We meant it. Let's work together",
"text": "This place matters, and our goal is to keep Tumblr thriving...",
"link": "https://staff.tumblr.com/post/812545258409934848",
"date": "Mon, 30 Mar 2026 16:45:43 -0400",
"post_type": "text",
"images": [
{ "url": "https://64.media.tumblr.com/...", "alt": "" }
],
"image_count": 3,
"tags": ["tumblr", "updates"],
"is_reblog": false,
"outbound_links": [
{ "url": "https://example.com", "text": "read more" }
]
}

Run via API

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("your-username/tumblr-scraper").call(run_input={
"blogNames": ["staff", "engineering"],
"maxPostsPerBlog": 20,
})
items = client.dataset(run["defaultDatasetId"]).list_items().items