EDX  Discovery Scraper avatar

EDX Discovery Scraper

Pricing

from $9.00 / 1,000 results

Go to Apify Store
EDX  Discovery Scraper

EDX Discovery Scraper

The EDX Discovery Scraper extracts detailed course data from EDX, including descriptions, pricing, and organization info, aiding market research and competitive analysis....

Pricing

from $9.00 / 1,000 results

Rating

0.0

(0)

Developer

GetDataForMe

GetDataForMe

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

EDX Discovery Scraper

Introduction

The EDX Discovery Scraper is a powerful tool designed to extract detailed course information from the EDX platform. It enables users to gather comprehensive data on courses, including descriptions, pricing, and organizational details, facilitating market research, competitive analysis, and content aggregation.

Features

  • Comprehensive Data Extraction: Scrapes detailed course information including descriptions, pricing, and organizational details.
  • High Data Quality: Ensures accurate and up-to-date information directly from EDX.
  • Customizable Searches: Allows users to specify search queries for targeted data extraction.
  • Efficient Performance: Capable of handling large volumes of data with a high degree of reliability.
  • Proxy Support: Utilizes Apify's residential proxy network to ensure anonymity and avoid IP bans.
  • Scalable: Easily handles varying data loads, from small to large-scale scraping tasks.

Input Parameters

ParameterTypeRequiredDescriptionExample
searchQueryStringYesEnter the Search Keywords"data"
maxItemsIntegerNoMaximum number of items to scrape.100
proxyConfigurationObjectNoSpecifies proxy servers to be used by the scraper to hide its origin.See below

Proxy Configuration Example

{
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "US"
}

Example Usage

Example Input

{
"searchQuery": "data science",
"maxItems": 50,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "US"
}
}

Example Output

[
{
"id": "91f52ef3-fa3f-4934-9d19-8d5a32635cd4",
"name": "Data Science: R Basics",
"source": "edx",
"type": "course",
"subType": "Course",
"slug": "learn/r-programming/harvard-university-data-science-r-basics",
"description": "<p>The first in our Professional Certificate Program in Data Science...</p>",
"shortDescription": "<p>Build a foundation in R and learn how to wrangle, analyze, and visualize data.</p>",
"imageUrl": "https://prod-discovery.edx-cdn.org/cdn-cgi/image/width=auto,height=auto,quality=75,format=webp/media/course/image/91f52ef3-fa3f-4934-9d19-8d5a32635cd4-d99e27f09d19.jpg",
"destinationUrl": "https://www.edx.org/learn/r-programming/harvard-university-data-science-r-basics",
"organization": {
"name": "Harvard University",
"logoUrl": "https://prod-discovery.edx-cdn.org/organization/logos/44022f13-20df-4666-9111-cede3e5dc5b6-2cc39992c67a.png"
},
"meta": {
"productType": "Course",
"completionTimeText": "8 weeks",
"flexibilityLabel": "Self-paced",
"highestLevel": "Introductory",
"learningOutcome": "1 certificate",
"listPriceFormatted": "$219",
"strikethroughPriceFormatted": ""
},
"subjects": [
"Data Analysis & Statistics",
"Computer Science"
],
"skills": [
"Data Science",
"Git (Version Control System)",
"Data Wrangling",
"RStudio",
"Document Preparations",
"Sorting",
"Data Visualization",
"Machine Learning",
"R (Programming Language)",
"Data Analysis",
"Ggplot2",
"Unix",
"Dplyr",
"Version Control",
"Github",
"Probability",
"Linux",
"File Organization"
],
"level": "Introductory",
"flexibility": "self_paced",
"weeksToComplete": 8,
"minHoursEffortPerWeek": 1,
"courseCount": 0,
"language": [
"English"
],
"listPrice": 219,
"strikethroughPrice": 0,
"currencyCode": "USD",
"availability": [
"Current"
],
"partner": [
"Harvard University"
],
"sortPosition": 0,
"scrapedAt": "2026-05-29T09:42:21.183Z"
}
]

Use Cases

  • Market Research and Analysis: Gather insights on educational trends and course offerings.
  • Competitive Intelligence: Monitor competitor course offerings and pricing strategies.
  • Price Monitoring: Track changes in course pricing and availability.
  • Content Aggregation: Compile comprehensive course data for educational platforms.
  • Academic Research: Analyze educational content and trends for research purposes.
  • Business Automation: Automate data collection for educational service providers.

Installation and Usage

  1. Search for "EDX Discovery Scraper" in the Apify Store.
  2. Click "Try for free" or "Run".
  3. Configure input parameters.
  4. Click "Start" to begin extraction.
  5. Monitor progress in the log.
  6. Export results in your preferred format (JSON, CSV, Excel).

Output Format

The output is a JSON array where each object contains detailed information about a course, including fields like id, name, description, organization, subjects, skills, and more. This structured format allows for easy integration into various applications and analysis tools.

Support

For custom/simplified outputs or bug reports, please contact:

We're here to help you get the most out of this Actor!