Opendatabay APP

Customer Churn in Financial Services

Finance & Banking Analytics

Tags and Keywords

Churn

Banking

Customer

Prediction

Finance

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Customer Churn in Financial Services Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is designed for predicting customer churn in the banking industry using machine learning. It contains detailed information on bank customers, indicating whether they have left the bank or remain a customer. The primary purpose is to enable the development of predictive models to identify customers at risk of churning, offering insights into factors that influence customer retention within banking services.

Columns

  • Customer ID: A unique identifier for each customer.
  • Surname: The customer's surname or last name.
  • Credit Score: A numerical value representing the customer's credit score.
  • Geography: The country where the customer resides, limited to France, Spain, or Germany.
  • Gender: The customer's gender, either Male or Female.
  • Age: The customer's age in years.
  • Tenure: The number of years the customer has been with the bank.
  • Balance: The customer's account balance.
  • NumOfProducts: The number of bank products the customer currently uses (e.g., savings account, credit card).
  • HasCrCard: Indicates whether the customer possesses a credit card (1 for yes, 0 for no).
  • IsActiveMember: Indicates whether the customer is considered an active member (1 for yes, 0 for no).
  • EstimatedSalary: The customer's estimated salary.
  • Exited: The target variable, indicating whether the customer has churned (1 for yes, 0 for no).
  • RowNumber: A sequential number for each row in the dataset.

Distribution

The dataset is provided as a CSV file named Churn_Modelling.csv, with a file size of 685 kB. It consists of 10,000 records across 14 columns. All records are valid, with 0% mismatched data. A small number of missing values (1 record) are noted for the 'Geography', 'Age', 'HasCrCard', and 'IsActiveMember' columns.
Key statistical summaries for numerical columns include:
  • RowNumber: Mean 5,000, Std. Deviation 2.89k, ranging from 1 to 10,000.
  • Customer ID: Mean 15.7m, Std. Deviation 71.9k, ranging from 15.6m to 15.8m.
  • Credit Score: Mean 651, Std. Deviation 96.7, ranging from 350 to 850.
  • Age: Mean 38.9, Std. Deviation 10.5, ranging from 18 to 92.
  • Tenure: Mean 5.01, Std. Deviation 2.89, ranging from 0 to 10 years.
  • Balance: Mean 76.5k, Std. Deviation 62.4k, ranging from 0 to 251k. A notable proportion of customers have a balance of 0.
  • NumOfProducts: Mean 1.53, Std. Deviation 0.58, ranging from 1 to 4.
  • HasCrCard: Mean 0.71, Std. Deviation 0.46.
  • IsActiveMember: Mean 0.51, Std. Deviation 0.5.
  • EstimatedSalary: Mean 100k, Std. Deviation 57.5k, ranging from 11.6 to 200k.
  • Exited: Mean 0.2, Std. Deviation 0.4.

Usage

This dataset is ideal for:
  • Developing machine learning models for customer churn prediction, particularly using techniques like Neural Networks and Random Forest.
  • Data visualisation to explore patterns and relationships between customer attributes and churn behaviour.
  • Data cleaning exercises as part of a machine learning workflow.
  • Analysing customer behaviour in the banking sector to understand factors contributing to customer attrition.
  • Building solutions for E-Commerce services related to customer retention strategies.

Coverage

  • Geographic Scope: Customers primarily reside in France (most common at 50%), Spain, and Germany.
  • Demographic Scope: Includes customer gender (55% Male, 45% Female) and age, with customers ranging from 18 to 92 years old.
  • Time Range: The dataset does not specify a particular time range for the customer data.

License

CC0: Public Domain

Who Can Use It

This dataset is suitable for:
  • Data Scientists and Machine Learning Engineers looking to build and test churn prediction models.
  • Business Analysts in the banking and financial services sector who aim to identify at-risk customers and inform retention strategies.
  • Researchers studying customer behaviour, loyalty, and attrition in the financial industry.
  • Students and educators for learning and teaching concepts in predictive analytics and customer relationship management.

Dataset Name Suggestions

  • Bank Customer Churn Prediction Dataset
  • Banking Churn Analytics
  • Customer Churn in Financial Services
  • Bank Client Retention Data

Attributes

Listing Stats

VIEWS

5

DOWNLOADS

3

LISTED

20/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format