Synthetic Dating Behaviour Data
Synthetic Data Generation
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a detailed view into the dynamics of online matchmaking interactions, capturing essential variables that influence the likelihood of successful matches across different genders. It aims to assist in understanding how factors such as VIP subscription status, income levels, parental status, age, and self-perceived attractiveness contribute to the outcomes of online dating endeavours. This data is designed for educational purposes, allowing researchers and analysts to explore and model these influential elements.
Columns
- Gender: Indicates the user's gender, represented as 0 for Male and 1 for Female.
- PurchasedVIP: A binary flag showing whether a user has a VIP subscription, with 0 for No and 1 for Yes.
- Income: The user's annual income, expressed in USD.
- Children: The numerical count of children a user has.
- Age: The age of the user.
- Attractiveness: A subjective rating of a user's attractiveness, ranging from 1 to 10.
- Matches: The number of matches a user has obtained, serving as the target variable and an indicator of online dating success.
Distribution
This dataset, titled Online_Dating_Behavior_Dataset.csv, is available in CSV format and consists of 1000 records. It comprises 7 distinct columns and is structured as a tabular dataset. It is important to note that 1000 records is considered a relatively low count within this category of datasets.
Usage
Ideal applications and use cases for this dataset include:
- Analysing gender-specific dating preferences and behaviours.
- Developing and evaluating models for predicting match success in online dating.
- Exploring the impact of user attributes on matchmaking outcomes.
- Supporting educational data science and machine learning projects focusing on social dynamics.
Coverage
The dataset focuses on the behaviour of online dating users, encompassing various demographics such as gender, age, income, parental status, and self-perceived attractiveness. Data was captured intermittently over different periods. It is a synthetic dataset generated for educational purposes, meaning the findings may not perfectly represent the real dating world. Due to confidentiality, only users with variables showing a high correlation with the matching variable were included, and some match categories and crucial variables are absent.
License
the Attribution 4.0 International (CC BY 4.0)
Who Can Use It
- Researchers and Analysts: To investigate the underlying factors that contribute to success in online dating.
- Data Scientists and Machine Learning Practitioners: For building predictive models and engaging in educational projects related to online matchmaking.
Dataset Name Suggestions
- Online Dating Match Predictor
- Digital Romance Success Factors
- Dating App Engagement Analytics
- User Profile Match Outcomes
- Synthetic Dating Behaviour Data
Attributes
Original Data Source: Synthetic Dating Behaviour Data