Wordpress Content Extractor avatar

Wordpress Content Extractor

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Wordpress Content Extractor

Wordpress Content Extractor

๐Ÿ“ Extract complete content from WordPress sites including posts, categories, and metadata. Perfect for content migration, blog aggregation, and CMS integration.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

SimplifySME Toolbox

SimplifySME Toolbox

Maintained by Community

Actor stats

0

Bookmarked

10

Total users

0

Monthly active users

3 months ago

Last modified

Share

๐Ÿ“ Extract complete content from WordPress sites including posts, categories, and metadata. Perfect for content migration, blog aggregation, and CMS integration.


๐Ÿ“บ What It Extracts

  • Site Metadata: Site information and details
  • Posts: All posts with full content, metadata, and media
  • Categories: All post categories with metadata
  • Statistics: Total post and category counts

๐Ÿš€ Key Features

FeatureDescription
๐Ÿ“ Complete ContentExtracts all posts with full content and metadata
๐Ÿท๏ธ Category SupportExtracts all categories and their relationships
๐Ÿ–ผ๏ธ Media ExtractionIncludes featured images and media URLs
๐Ÿ“Š Structured OutputClean JSON format with nested post and category data
โšก Fast PerformanceDirect API access for quick data retrieval
๐Ÿ”„ Pagination SupportHandles large sites with configurable post limits

๐Ÿ“ฅ Input

Required

  • siteUrl (string): The WordPress site URL
    • Example: "https://example.com"
    • Supports any WordPress site with REST API enabled

Optional

  • maxPosts (integer, default: 100): Maximum number of posts to extract
    • Example: 200

๐Ÿ“ค Output

Returns comprehensive WordPress content data:

Site Metadata

{
"site": {
"name": "Site Name",
"description": "Site description...",
"url": "https://example.com"
}
}

Posts Array

{
"posts": [
{
"id": 123,
"title": "Post Title",
"content": "Post content...",
"excerpt": "Post excerpt...",
"author": "Author Name",
"date": "2024-01-01T00:00:00Z",
"modified": "2024-01-15T00:00:00Z",
"slug": "post-slug",
"status": "publish",
"link": "https://example.com/post-slug",
"categories": [1, 2],
"tags": [3, 4],
"featuredMedia": 456,
"featuredImageUrl": "https://example.com/image.jpg"
}
],
"totalPosts": 100
}

Categories Array

{
"categories": [
{
"id": 1,
"name": "Category Name",
"slug": "category-slug",
"description": "Category description...",
"count": 25
}
],
"totalCategories": 10
}

๐Ÿ’ก Use Cases

  • โœ… Blog Aggregation - Collect content from multiple WordPress sites
  • โœ… Content Migration - Extract content for platform migration
  • โœ… Content Research - Analyze blog content and topics
  • โœ… CMS Integration - Import WordPress content into other systems
  • โœ… Content Analysis - Study content patterns and categories
  • โœ… Backup & Archive - Create backups of WordPress content

โš™๏ธ Technical Details

  • Extraction Method: Direct API access to WordPress REST API endpoints
  • REST API: Uses WordPress REST API (usually available at /wp-json/wp/v2/)
  • Pagination: Handles pagination for large sites with configurable limits
  • Error Handling: Validates responses and handles missing data gracefully
  • Performance: Fast API-based extraction without browser overhead

๐Ÿ“ Example Usage

Basic Extraction

{
"siteUrl": "https://example.com"
}

With Post Limit

{
"siteUrl": "https://example.com",
"maxPosts": 200
}

โš ๏ธ Important Notes

  • This actor uses WordPress REST API endpoints
  • The REST API must be enabled on the WordPress site (usually enabled by default)
  • Some WordPress sites may have REST API disabled or restricted
  • Featured images are extracted when available
  • Categories and tags are included with their IDs and metadata