Udemy IT & Software Development Course Metrics
Education & Learning Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A focused collection of over 22,000 educational courses available in the IT and Software category on the Udemy platform. This data compilation is centered primarily on the development niche, covering details for approximately 10,000 specific courses within that domain, though the overall file size reflects 22,000+ entries. The dataset offers granular insights into course popularity, pricing strategies, rating performance, and structural characteristics, making it valuable for market analysis of the online learning industry.
Columns
The dataset contains 20 distinct data fields:
- id: The unique identifier assigned to each course.
- title: The specific, unique name of the course.
- url: The web address linking directly to the course page.
- is_paid: A Boolean value indicating whether the course requires payment or is free (True/False).
- num_subscribers: The total number of people who have subscribed to or enrolled in the course. The median subscriber count is around 2,483.
- avg_rating: The calculated average rating received by the course.
- avg_rating_recent: Reflects changes in the course’s average rating over a recent period.
- rating: Shows the overall rating attributed to the course.
- num_reviews: Gives an idea of the total number of reviews or ratings received.
- is_wishlisted: Indicates whether the course has been added to a user’s wishlist.
- num_published_lectures: The count of lectures offered within the course.
- num_published_practice_tests: The number of practice tests available for the course.
- created: The date and time when the course was initially created.
- published_time: The date and time when the course was officially published.
- discount_price__amount: The monetary value of the course after any discount is applied.
- discount_price__currency: The currency type corresponding to the discounted price (e.g., INR).
- discount_price__price_string: The discounted price represented as a string format (e.g., '₹455').
- price_detail__amount: The original full price of the course.
- price_detail__currency: The currency type corresponding to the original price for uniformity.
- price_detail__price_string: The original price presented in a string format.
Distribution
The data is provided in a standard structured file format, likely CSV, and is approximately 5.23 MB in file size. It encompasses 20 columns and features 22,900 valid records. The data is generally very clean, with 100% validity across core identifiers and ratings. However, specific discounted price fields show approximately 8% missing values, typically correlating with courses that are either free or not currently on sale.
Usage
This product is highly useful for benchmarking and analysis within the educational technology (EdTech) sector. Ideal applications include:
- Analysing market trends in IT and Software course demand based on subscriber numbers.
- Evaluating the impact of course structure (lectures, tests) on user ratings and engagement.
- Developing dynamic pricing models by studying the difference between original and discounted prices.
- Identifying top-performing or rapidly rising courses and instructors.
- Academic research into mass online education patterns.
Coverage
The data scope covers IT and Software courses, heavily weighted toward the development subcategory, offered globally via Udemy. The courses included span a creation and publishing period starting in April 2010 and extending through to September 2020. Currency fields are predominantly in INR, providing a specific regional perspective on pricing structures.
License
CC0: Public Domain
Who Can Use It
- Data Engineers and Scientists: Utilizing the data to build machine learning models to predict course success or forecast subscription rates.
- Online Education Platforms: Benchmarking their course catalogue against successful competitors and refining content strategy.
- Financial Analysts: Studying pricing elasticity and revenue streams across diverse educational offerings.
- Independent Course Creators: Gaining strategic insight into optimal course length, structure, and marketing timing.
Dataset Name Suggestions
- Udemy IT & Software Development Course Metrics
- 22k+ Udemy E-Learning Analysis Data
- Global Online Tech Education Market Data
- Udemy Course Performance Indicators (2010-2020)
Attributes
Original Data Source:Udemy IT & Software Development Course Metrics
Loading...
