Search Threads By Keywords avatar
Search Threads By Keywords

Pricing

$8.00 / 1,000 results

Go to Apify Store
Search Threads By Keywords

Search Threads By Keywords

Developed by

Giichi Arai

Giichi Arai

Maintained by Community

Fast and reliable scraper for searching posts on Threads by Meta. Perfect for social media monitoring, research, and business intelligence.

0.0 (0)

Pricing

$8.00 / 1,000 results

0

1

1

Last modified

8 minutes ago

Fast and reliable scraper for searching posts on Threads by Meta. Supports multiple output formats (JSON, CSV, Excel), advanced filtering options, and real-time data extraction. Perfect for social media monitoring, research, and business intelligence.

Features

  • ๐Ÿ” Keyword-based Search: Search for posts using multiple keywords, hashtags, usernames, or phrases
  • ๐Ÿ“Š Multiple Output Formats: Export data in JSON, CSV, or Excel format
  • ๐Ÿ”„ Real-time Data: Get the latest posts from Threads
  • ๐Ÿ“ˆ Engagement Metrics: Collect likes, replies, reposts, quotes, and view counts
  • ๐ŸŽฏ Advanced Filtering: Sort by recent or relevant posts
  • ๐Ÿš€ High Performance: Fast API-powered scraping with pagination support
  • ๐Ÿ”„ Deduplication: Automatically removes duplicate posts across keywords

Input Parameters

Required Fields

  • Search Keywords (keywords): Array of keywords to search for
    • Examples: ["#AI", "Python", "Meta", "@username"]
    • Each keyword is searched separately via individual API calls

Optional Fields

  • Maximum Items Per Keyword (maxItemsPerKeyword): Maximum number of posts per keyword (1-2000, default: 50)
  • Sort By Recent (sortByRecent): Sort by recent posts (true) or relevant posts (false, default: true)
  • Output Format (outputFormat): Choose output format - JSON, CSV, or Excel (default: JSON)
  • Proxy Configuration (proxyConfiguration): Proxy settings for requests

Output Data

Each post includes the following fields:

Basic Information

  • id: Unique post identifier
  • text: Post content/caption
  • author: Username of the post author
  • author_name: Display name of the author
  • author_id: Unique author identifier
  • created_at: Post creation timestamp

Engagement Metrics

  • like_count: Number of likes
  • reply_count: Number of replies
  • repost_count: Number of reposts
  • quote_count: Number of quotes
  • view_count: Number of views

Content Analysis

  • hashtags: List of hashtags used
  • mentions: List of mentioned users
  • urls: List of URLs in the post
  • media: List of media attachments
  • lang: Language of the post

Metadata

  • is_reply: Whether the post is a reply
  • is_repost: Whether the post is a repost
  • url: Direct link to the post
  • verified: Whether the author is verified
  • follower_count: Author's follower count
  • following_count: Author's following count

Usage Examples

{
"keywords": ["#AI", "Python", "Meta"],
"maxItemsPerKeyword": 100,
"sortByRecent": true,
"outputFormat": "json"
}

Advanced Configuration

{
"keywords": ["#MachineLearning", "@meta", "artificial intelligence"],
"maxItemsPerKeyword": 200,
"sortByRecent": false,
"outputFormat": "excel",
"proxyConfiguration": {
"useApifyProxy": true
}
}

Multiple Keywords Example

{
"keywords": [
"#TechNews",
"#Innovation",
"#Startup",
"@elonmusk",
"blockchain"
],
"maxItemsPerKeyword": 50,
"sortByRecent": true,
"outputFormat": "csv"
}

API Endpoint

This actor uses the /threads/search/posts endpoint with the following parameters:

  • query: Search query string
  • cursor: Pagination cursor for next page (optional)
  • recent: Sort by recent (1) or relevant (0) posts

Performance Notes

  • The scraper uses pagination to collect large datasets efficiently
  • Automatic retry logic with exponential backoff for reliability
  • Rate limiting between requests to be respectful to the API
  • Deduplication removes duplicate posts when using multiple keywords
  • Maximum 20 items per API call for optimal performance

Output Formats

JSON (Default)

  • Complete data including all fields and raw API response
  • Best for programmatic processing and analysis

CSV

  • Clean tabular format without raw data
  • Perfect for spreadsheet analysis and data visualization

Excel (XLSX)

  • Professional format with proper data types
  • Ideal for business reports and presentations

Error Handling

The actor includes comprehensive error handling:

  • Automatic retry on network failures
  • Graceful handling of API rate limits
  • Detailed logging for troubleshooting
  • Validation of input parameters

Support

For questions, issues, or feature requests, please contact our support team or create an issue in the repository.

License

This project is licensed under the Apache-2.0 License.