Tumblr Media Scraper avatar

Tumblr Media Scraper

Pricing

$9.00/month + usage

Go to Apify Store
Tumblr Media Scraper

Tumblr Media Scraper

Extract complete Tumblr posts with all media types—images, videos, audio, GIFs—plus rich metadata like tags, uploader info, and engagement stats. Get direct CDN download links and comprehensive post data in clean JSON format for archiving, analysis, or content repurposing.

Pricing

$9.00/month + usage

Rating

0.0

(0)

Developer

CodeNest

CodeNest

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Tumblr Media Scraper - Ultimate Tumblr Content Extraction Tool

Effortlessly download any media from Tumblr with our powerful Tumblr Media Scraper! This enterprise-grade Apify actor enables you to batch extract images, videos, audio, and GIFs from Tumblr posts while preserving high-quality media, comprehensive metadata, and direct CDN access.


📋 Overview

Need to archive Tumblr content or repurpose creative assets? Our Tumblr Media Scraper delivers everything you need:

  • 🖼️ Multi-format extraction: Images, videos, audio, and GIFs in original quality
  • 📊 Rich metadata: Titles, descriptions, uploaders, timestamps, tags, and engagement metrics
  • 🔗 Direct download links: CDN URLs for reliable media access
  • 📦 Batch processing: Handle multiple Tumblr URLs in a single run
  • 🏷️ Tag extraction: Capture all post tags for content categorization

Perfect for digital archivists 📚, content researchers 🔍, social media managers 📈, and creative professionals 🎨!


✨ Core Capabilities/Key Features

🎯 Media Extraction

  • Universal Support: Images (JPG, PNG, GIF), videos (MP4), and audio (MP3)
  • Original Quality: Access media in highest available resolution
  • GIF Preservation: Animated GIFs maintain their format
  • Multiple Assets: Extract all media from posts with multiple images/videos

📊 Metadata Mastery

  • Creator Profiles: Uploader names and unique IDs
  • Timestamps: Precise upload dates in YYYYMMDD format
  • Engagement Data: Like counts, note counts, and interaction metrics
  • Content Tags: Complete tag lists for content categorization
  • Open Graph Data: OG type and site information

🔧 Advanced Features

  • Batch Processing: Process up to 100+ Tumblr URLs per run
  • Proxy Support: Bypass regional restrictions
  • Apify Storage: Optional cloud storage for media links
  • Multi-format Output: Structured JSON with comprehensive data

⚙️ Input Configuration

Simply enter your Tumblr post URLs in the Input Section, click the "start" button, and wait for results. The Tumblr Media Scraper accepts URLs in this format:

{
"urls": [
{
"url": "https://www.tumblr.com/audiojunkyard/809712185709346816/mitski-in-a-lake?source=share"
},
{
"url": "https://www.tumblr.com/haigiaa/809465352370192384/i-really-thought-about-putting-narinder-as-the?source=share"
},
{
"url": "https://www.tumblr.com/ketchp-rat/809737514346774529/a-dtiys-that-i-did-a-while-back-but-im-still-semi?source=share"
},
{
"url": "https://www.tumblr.com/pururin/808556146977816576?source=share"
}
]
}

📝 Input Specifications

ParameterTypeRequiredDescription
urlsArrayYesArray of Tumblr post URLs to process
urlStringYesValid Tumblr post URL (any format)

📤 Output Structure

The Tumblr Media Scraper produces comprehensive JSON output like this:

[
{
"platform": "tumblr",
"input_url": "https://www.tumblr.com/audiojunkyard/809712185709346816/mitski-in-a-lake?source=share",
"title": "Mitski - In a Lake",
"description": "💬 0  🔁 57  ❤️ 44 · Mitski - In a Lake",
"uploader": "Mitski",
"uploader_id": "mitski",
"upload_date": "20260116",
"timestamp": 1768568481,
"page_url": "https://www.tumblr.com/audiojunkyard/809712185709346816/mitski-in-a-lake",
"thumbnail": "https://f4.bcbits.com/img/a2846043416_5.jpg",
"duration": 184.56,
"tags": [
"alternative",
"New York"
],
"og_type": "music",
"site_name": "Tumblr",
"download_links": [
{
"type": "image",
"url": "https://64.media.tumblr.com/bfd23551e8bd824c18c87a7aab2cc26c/99c6d943386b5079-e5/s2048x3072/a87a7eae65ac710a1ceb7ef6d0b961f3c37845ec.jpg"
},
{
"type": "audio",
"url": "https://bandcamp.com/stream_redirect?enc=mp3-128&track_id=329737679&ts=1772185851&t=d1c13bcf2a19c3c85adf8e55bcb990c02ad7f895",
"ext": "mp3",
"format_id": "mp3-128",
"resolution": "audio only",
"acodec": "mp3"
}
]
}
]

📊 Output Field Documentation

📌 Post Metadata Section

FieldDescription
platformAlways "tumblr" for easy filtering
input_urlOriginal URL you provided
titlePost title or generated title
descriptionPost description with engagement stats
uploaderContent creator's display name
uploader_idCreator's unique Tumblr handle
upload_datePublication date in YYYYMMDD format
timestampUnix timestamp of upload
page_urlClean Tumblr post URL
thumbnailPost thumbnail/preview image URL
durationMedia duration (for audio/video)
tagsComplete array of post tags
like_countNumber of likes (when available)
og_typeOpen Graph content type
site_nameAlways "Tumblr"

🎬 Media Assets Section

FieldDescription
download_linksArray of all media URLs in post
typeMedia type: image, video, or audio
urlDirect CDN download URL
extFile extension (jpg, png, gif, mp4, mp3)
format_idFormat identifier for audio/video
resolutionVideo resolution when applicable
vcodecVideo codec specification
acodecAudio codec specification

🎨 Media Types Supported

The Tumblr Media Scraper automatically detects and extracts all media types:

  1. 🖼️ Images - JPG, PNG, GIF (including animated GIFs)
  2. 🎥 Videos - MP4 in various resolutions
  3. 🎵 Audio - MP3 streams with format details
  4. 📁 Multi-asset posts - Complete extraction of all media in a single post

🔧 Technical Features

📡 Advanced Extraction

  • Direct CDN Access: Bypass Tumblr interface for reliable downloads
  • Multi-format Detection: Automatic identification of all media types
  • Bandcamp Integration: Extract audio from Bandcamp links within Tumblr
  • Resolution Options: Access to multiple quality tiers when available

📊 Comprehensive Data Collection

  • Engagement Metrics: Like counts, reblog counts, and note statistics
  • Tag Taxonomy: Complete tag lists for content organization
  • Timestamp Precision: Millisecond-accurate duration for media
  • Creator Attribution: Full uploader details and IDs

🛡️ Reliability Features

  • URL Normalization: Automatic cleaning and validation of Tumblr URLs
  • Error Handling: Graceful fallbacks for unavailable content
  • Rate Limit Management: Smart throttling to avoid blocks
  • Proxy Compatibility: Works with residential proxy rotation

💡 Use Cases

  • 📚 Digital Archivists – Preserve Tumblr content before it's deleted
  • 🔍 Content Researchers – Study viral patterns and tag ecosystems
  • 📱 App Developers – Build Tumblr-powered applications
  • 🎨 Creative Professionals – Source inspiration and assets
  • 📊 Social Media Managers – Archive client content and track engagement
  • 🏷️ Tag Analysts – Study content categorization and trends

✅ Why Choose Our Tumblr Media Scraper?

  • 🎯 Purpose-built: Specifically optimized for Tumblr's unique structure
  • ⚡ Fast Performance: Parallel processing for multiple URLs
  • 🔄 Regular Updates: Maintained to ensure compatibility with Tumblr changes
  • 📦 Complete Data: Get all metadata and media in one structured output
  • 🆓 Easy to Use: Simple input format, comprehensive output
  • 🔧 Developer Friendly: Clean JSON structure for easy integration

⚠️ Limitations

  • Only works with public Tumblr posts (no private blogs)
  • Some media may have regional restrictions
  • Bandcamp audio requires Bandcamp availability
  • The Tumblr Media Scraper may hit rate limits with excessive requests

📧 Need Customization?

Want *higher resolution extraction, **batch processing enhancements, or *custom metadata fields for your Tumblr Media Scraper?

✉ Email codenest2.0@gmail.com for tailored solutions!