Synthetic HR Employee Records
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset features 30,000 synthetic employee records, offering key details such as names, ages, departments, positions, salaries, and joining dates. It is specifically designed for HR analytics, salary trend analysis, and various machine learning applications. Being entirely synthetic, its primary purpose is for educational, research, and analytical use.
Columns
- Employee_ID: A unique identifier assigned to each employee.
- Employee_Name: A randomly generated full name for each employee.
- Age: The employee's age, ranging from 22 to 60 years.
- Country: The country of employment, selected from 10 distinct countries.
- Department: The assigned department, such as HR, Finance, or Engineering.
- Position: The employee's job role, for instance, Manager, Developer, or Analyst.
- Salary: The annual salary, generated to be between $30,000 and $150,000.
- Joining_Date: The employee's start date, randomly chosen from the past decade.
Distribution
The dataset comprises 30,000 individual employee records, formatted as a CSV file (
employee_records.csv
) with a size of 2.01 MB. It includes 8 distinct columns. All records across all columns are validated, showing no missing or mismatched values. The age distribution spans from 22 to 60 years, while salaries range from $30,000 to $150,000. Joining dates cover a period from 11th March 2015 to 8th March 2025.Usage
This dataset is ideal for:
- HR analytics: Enabling the study of workforce demographics and departmental distributions.
- Salary trend analysis: Facilitating the examination of compensation patterns across various job roles and geographical regions.
- Employee attrition prediction: Serving as a foundation for building machine learning models aimed at gaining insights into employee retention.
- Workforce planning: Supporting simulations for hiring scenarios and salary forecasting.
Coverage
- Geographic: The data includes employees from 10 different countries, with Germany and Australia each representing 10% of the entries.
- Time Range: Employee joining dates are spread over the last 10 years, specifically from 11th March 2015 to 8th March 2025.
- Demographic Scope: The age of employees in the dataset ranges from 22 to 60 years.
License
CC0: Public Domain
Who Can Use It
This dataset is particularly useful for:
- HR professionals and analysts: For in-depth workforce analysis and understanding demographic trends.
- Data scientists and machine learning engineers: For developing predictive models, such as those for salary forecasting or employee retention.
- Academics and students: As a practical resource for educational projects, research, and statistical analysis.
- Business strategists: For strategic workforce planning and scenario modelling related to hiring and compensation.
Dataset Name Suggestions
- Synthetic HR Employee Records
- Workforce Data for Analytics
- Employee Salary and Demographics
- HR Metrics Simulation Dataset
- Employee Data for Machine Learning
Attributes
Original Data Source: Synthetic HR Employee Records