Simple Linear Regression Salary Data
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed for salary prediction using simple linear regression, a fundamental machine learning technique. It comprises two key columns, enabling the development and evaluation of models that forecast an individual's salary based on their years of experience. The dataset has been utilised in the Machine Learning A-Z course, making it an excellent resource for educational purposes and introductory machine learning projects.
Columns
- Years of Experience: Represents the professional experience of an individual, measured in years. This column serves as the independent variable for prediction models.
- Minimum value: 1.1 years
- Maximum value: 10.5 years
- Mean experience: 5.31 years
- Standard Deviation: 2.79 years
- This column contains 30 valid entries with no missing or mismatched data.
- Salary: Indicates the salary of the person, typically the dependent variable to be predicted.
- Minimum salary: £37,700 (approx)
- Maximum salary: £122,000 (approx)
- Mean salary: £76,000 (approx)
- Standard Deviation: £27,000 (approx)
- This column also has 30 valid entries, with no missing or mismatched data.
Distribution
The dataset is provided as a CSV file named 'Salary Data.csv' and has a size of 454 B. It consists of 30 records (rows) and 2 columns, formatted for straightforward use in data analysis and machine learning applications.
Usage
This dataset is ideally suited for:
- Building and testing simple linear regression models for salary prediction.
- Educational purposes in machine learning courses, particularly for understanding basic regression concepts.
- Demonstrating data analysis workflows involving numerical features.
- Exploratory data analysis to understand the relationship between experience and salary.
Coverage
The dataset focuses purely on the relationship between years of experience and salary. It does not provide specific geographic, time range, or demographic details for the individuals represented.
License
CC0: Public Domain
Who Can Use It
This dataset is suitable for a wide range of users, including:
- Students and educators studying or teaching machine learning, particularly regression analysis.
- Data analysts looking for a clean, simple dataset to practise fundamental analytical techniques.
- Beginner data scientists to build their initial predictive models.
- Anyone interested in the correlation between experience and earnings.
Dataset Name Suggestions
- Experience-Salary Prediction Dataset
- Simple Linear Regression Salary Data
- Years of Experience Salary Dataset
- Machine Learning A-Z Salary Data
Attributes
Original Data Source: Simple Linear Regression Salary Data