No credit card required

edX Online Course Data Extractor

epctex/edx-scraper

No credit card required

Effortlessly scrape thousands of online courses from edX. Extract titles, images, details, owners, and all other course details. Customize your search with filters like language and more for precise results.

Actor - edx Scraper

edx scraper

Since edx doesn't provide a proper free API, this actor should help you to retrieve data from it.

The edx data scraper supports the following features:

Scrape any courses you would like to get - You can search for a specific keyword and scrape the results accordingly.
Apply any of the filters - You can apply any filter provided by the website.
Scrape by language - You can filter by language from the actor default.
Limit the results by page or amount of property. - If you don't want to get all the results but a specific amount you can limit it.

Bugs, fixes, updates, and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Input Parameters

The input of this scraper should be JSON containing the list of pages on edx that should be visited. Possible fields are:

search: (Optional) (String) Search keyword that you would like to search the courses in.
language: (Optional) (String) Scrape the results by the course language.
maxItems: (Optional) (Number) You can limit scraped items. This should be useful when you search through the big lists or search results.
proxy: (Required) (Proxy Object) Proxy configuration.

This solution requires the use of Proxy servers, either your own proxy servers or you can use Apify Proxy.

Compute Unit Consumption

The actor is optimized to run blazing fast and scrape many listings as possible. Therefore, it forefronts all listing detail requests. If the actor doesn't block very often it'll scrape 100 listings in 1 minute with ~0.03-0.04 compute units.

edx Scraper Input example

1{
2  "search":"span",
3  "language":"English",
4  "maxItems":10,
5  "proxy":{
6    "useApifyProxy":true
7  }
8}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with a failure state and output an explanation of what is wrong.

edx Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any language (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this edx actor.

Scraped edx Properties

The structure of each item in edx listings looks like this:

###Course Output

1{
2    "uuid": "09222e6c-a1bc-4307-ba80-294ea4281117",
3    "title": "Shaping Work of the Future",
4    "inProspectus": true,
5    "prospectusPath": "/course/shaping-work-of-the-future",
6    "organizationShortCodeOverride": "",
7    "organizationLogoOverrideUrl": null,
8    "courseType": "verified-audit",
9    "inYearValue": null,
10    "activeCourseRun": {
11        "key": "course-v1:MITx+15.662x+1T2020",
12        "type": "verified",
13        "marketingUrl": "https://www.edx.org/course/shaping-work-of-the-future-3",
14        "minEffort": 4,
15        "maxEffort": 5,
16        "weeksToComplete": 8
17    },
18    "image": {
19        "src": "https://prod-discovery.edx-cdn.org/media/course/image/09222e6c-a1bc-4307-ba80-294ea4281117-e28e19d05647.small.jpg"
20    },
21    "locationRestriction": null,
22    "owners": [
23        {
24            "key": "MITx",
25            "logoImageUrl": "https://prod-discovery.edx-cdn.org/organization/logos/2a73d2ce-c34a-4e08-8223-83bca9d2f01d-2cc8854c6fee.png",
26            "name": "Massachusetts Institute of Technology"
27        }
28    ],
29    "recentEnrollmentCount": 1149,
30    "topics": [],
31    "additionalMetadata": null,
32    "objectID": "course-09222e6c-a1bc-4307-ba80-294ea4281117",
33    "cardType": "course",
34    "cardIndex": 3,
35    "url": "https://learning.edx.org/course/course-v1:MITx+15.662x+1T2020/home."
36}

Contact

Please visit us through epctex.com to see all the products that are available for you. If you are looking for any custom integration or so, please reach out to us through the chat box in epctex.com. In need of support? devops@epctex.com is at your service.

Developer

epctex

Actor metrics

2 monthly users
100.0% runs succeeded
0.0 days response time
Created in Feb 2020
Modified about 2 hours ago

Categories

Business

Jobs

Videos

Google Maps Scraper

compass/crawler-google-places

Extract data from hundreds of Google Maps locations and businesses. Get Google Maps data including reviews, images, contact info, opening hours, location, popular times, prices & more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Compass

60.5k

Website Content Crawler

apify/website-content-crawler

Automatically crawl and extract text content from websites with documentation, knowledge bases, help centers, or blogs. This Actor is designed to provide data to feed, fine-tune, or train large language models such as ChatGPT or LLaMA.

Apify

12.2k

TikTok Data Extractor

clockworks/free-tiktok-scraper

Extract data about videos, users, and channels based on hashtags or scrape full user profiles including posts, total likes, name, nickname, numbers of comments, shares, followers, following, and more.

Clockworks

10.8k

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

4.1k

AI Product Matcher

equidem/ai-product-matcher

Match products across multiple e-commerce websites. Use this AI product matching Actor whenever you need to find matching pairs of products from different online shops for dynamic pricing, competitor analysis or market research.

Matěj Sochor

281

Youtube Scraper

streamers/youtube-scraper

YouTube crawler and video scraper. Alternative YouTube API with no limits or quotas. Extract and download channel name, likes, number of views, and number of subscribers.

Streamers

3.2k

Facebook Ads Scraper

apify/facebook-ads-scraper

Extract advertising data from one or multiple Facebook Pages. Get page details, reach estimates, publisher platforms, report count, number of impressions, ad IDs, timestamps, and more. Download Facebook ads data in JSON, CSV, and Excel and use it in apps, spreadsheets, and reports.

Apify

3.8k

Indeed Scraper

misceres/indeed-scraper

Scrape jobs posted on Indeed. Get detailed information from this job portal about saved and sponsored jobs. Specify the search based on location with the output attributes position, location, and description.

Misceres

2.2k

GIF Scroll Animation

glenn/gif-scroll-animation

Free tool to automatically create an animated GIF of any scrolling web page. Useful for testing UX, showcasing your work, and capturing any website as a GIF, including clickable elements and animations. Includes settings to adjust speed, wait before scrolling, slow down on-page animations, and more.

Glenn Goossens

4.2k

TikTok Scraper

clockworks/tiktok-scraper

Extract data from TikTok videos, hashtags, and users. Use URLs or search queries to scrape TikTok profiles, hashtags, posts, URLs, shares, followers, hearts, names, video, and music-related data. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

Clockworks

10.6k

How to scrape Google Images

Top 5 web scraping tools to help you gather retail analytics

How to scrape data from Walmart

Build new tools

Are you a developer? Build your own Actors and run them on Apify.

Learn more

Get a custom solution

Get a custom web scraping or RPA solution.

Book a demo

edX Online Course Data Extractor

Actor - edx Scraper

edx scraper

Bugs, fixes, updates, and changelog