GPA vs. Study Hours Survey Data
Education & Learning Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset captures grade point averages (GPA) and weekly study hours for a sample of 193 undergraduate university students. It was collected in 2012 from students enrolled in an introductory statistics course at a private US university. The dataset is particularly useful for illustrating challenges often encountered with real-world survey data, as both variables are self-reported and include a notable data error where one student's GPA exceeds the typical 4-point scale.
Columns
- gpa: Represents the Grade Point Average of the student. GPA typically ranges from 0 to 4 points; however, one observation in this dataset shows a GPA greater than 4, which has been intentionally left in to demonstrate real survey data issues. The mean GPA is approximately 3.59 with a standard deviation of 0.28.
- study_hours: Denotes the number of hours students reported studying per week. The mean study hours are around 17.5, with a standard deviation of 11.4.
Distribution
The dataset is structured as a data frame comprising 193 distinct observations across 2 variables. It is provided in a CSV format, specifically as
gpa_study_hours.csv
, with a file size of 1.42 kB.Usage
This dataset is ideally suited for various analytical and educational applications. It can be used for:
- Conducting statistical analyses to explore the relationship between study hours and academic performance.
- Developing and testing statistical models.
- Teaching students about data cleaning, error handling, and the nuances of working with self-reported data.
- Illustrating typical issues found in real survey data, such as outliers and potential inaccuracies.
Coverage
The data was collected in 2012 from undergraduate students attending a private university located in the United States. It specifically targets students enrolled in an introductory statistics course. It is important to note that both GPA and study hours are self-reported, which may impact their accuracy. Additionally, one student's reported GPA is an outlier, exceeding the typical maximum of 4 points, and this observation has been retained to highlight real-world data challenges.
License
CC BY-SA 3.0
Who Can Use It
This dataset is valuable for:
- Students and Educators: For learning and teaching concepts in statistics, data analysis, and survey methodology.
- Researchers: Studying educational outcomes, student behaviour, or the impact of study habits.
- Data Analysts: Practising data cleaning, exploratory data analysis, and handling real-world data imperfections.
Dataset Name Suggestions
- Student GPA and Study Hours
- Undergraduate Study Habits Data
- GPA vs. Study Hours Survey Data
- University Student Performance Dataset
Attributes
Original Data Source: GPA vs. Study Hours Survey Data