Simple Insurance Premium Prediction Data
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed for machine learning applications, specifically for Simple Linear Regression and Prediction Practices. It features two key columns, 'Age' and 'Premium', making it a valuable resource for exploring and understanding the relationship between an individual's age and their associated insurance premium. The dataset is particularly useful for educational purposes within data analytics and computer science.
Columns
- Age: This column represents the age of an individual. There are 7 total values, all of which are valid and contain no missing data. The mean age is 25.9, with a standard deviation of 4.88. The ages range from a minimum of 18 to a maximum of 33.
- Premium: This column details the insurance premium per individual. It also contains 7 total valid values with no missing data. The mean premium is 20.2k, and its standard deviation is 5.82k. Premiums range from a minimum of 10,000 to a maximum of 27,000.
Distribution
The dataset is provided in CSV format and is structured with two columns. It comprises 7 records, or rows, each containing a pair of 'Age' and 'Premium' values. The file size is 83 B.
Usage
This dataset is ideally suited for a variety of applications, including:
- Developing and testing machine learning models, particularly those focused on simple linear regression.
- Practising prediction techniques to forecast insurance premiums based on age.
- Serving as a practical tool for educational exercises in areas such as data analytics, computer science, and statistics.
- Facilitating data visualisation using libraries like Matplotlib to illustrate linear relationships.
Coverage
The dataset's demographic scope is limited to individuals aged between 18 and 33 years. Information regarding specific geographic regions or time ranges is not available in the provided sources. All 7 records are fully valid, with no missing or mismatched data points, ensuring high data availability.
License
CC0: Public Domain
Who Can Use It
This dataset is beneficial for:
- Students and educators seeking to learn and teach core concepts in machine learning, linear regression, and data analysis.
- Data scientists and analysts who require a clean, straightforward dataset for rapid model prototyping or for demonstrating fundamental predictive analytics.
- Individuals interested in insurance data modelling or exploring basic correlations between demographic factors and financial metrics.
Dataset Name Suggestions
- Age-Premium Linear Regression Dataset
- Simple Insurance Premium Prediction Data
- Individual Age and Insurance Cost Data
- Machine Learning Regression Practice Set
Attributes
Original Data Source: Simple Insurance Premium Prediction Data