Opendatabay APP

Titanic Beginners Dataset

Foundation Model Datasets

Tags and Keywords

Titanic

Survival

Passengers

Demographics

Visualisation

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Titanic Beginners Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides key information on Titanic passengers, primarily for visualising survival outcomes and demographic trends. It merges the Titanic test file with gender submission data, making it an excellent resource for creating charts and understanding the factors influencing survival or demise. It is particularly suitable for beginners looking to engage with data visualisation and foundational predictive modelling tasks.

Columns

  • PassengerId: Unique identifier for each passenger.
  • Survived: Indicates whether the passenger survived (1) or died (0).
  • Pclass: Represents the passenger's class (1st, 2nd, or 3rd).
  • Name: The full name of the passenger.
  • Sex: The gender of the passenger (male or female).
  • Age: The age of the passenger. Note that approximately 21% of age values are missing.
  • SibSp: The number of siblings or spouses accompanying the passenger.
  • Parch: Represents the number of parents or children accompanying the passenger.
  • Ticket: The ticket number.
  • Fare: The fare paid for the ticket.
  • Cabin: The cabin number. Note that approximately 78% of cabin values are missing.
  • Embarked: The port where the passenger embarked (Cherbourg, Queenstown, or Southampton).

Distribution

The dataset is provided in CSV format and has a size of 29.47 kB. It contains 12 columns and includes 418 records. Some columns, such as 'Age', 'Fare', and 'Cabin', have missing values, with 'Cabin' having a significant number of missing entries.

Usage

This dataset is ideal for:
  • Creating visualisations to understand the Titanic passenger demographics and survival rates.
  • Developing and testing basic predictive models to determine survival probability.
  • Exploratory data analysis for beginners in data science.
  • Studying historical passenger data and its implications.

Coverage

The dataset covers passengers from the Titanic voyage, with demographic details including age, sex, and passenger class. Geographic scope is limited to the embarkation ports: Cherbourg, Queenstown, and Southampton. The data reflects the conditions and outcomes associated with the Titanic disaster, but a specific time range is not explicitly detailed beyond the historical context.

License

CC0: Public Domain. No specific URL for the license was provided in the source material.

Who Can Use It

This dataset is beneficial for:
  • Beginner data scientists: For learning data visualisation, cleaning, and introductory machine learning.
  • Students: Undertaking projects related to historical data analysis or predictive modelling.
  • Data analysts: For quick insights and chart generation.
  • Researchers: Interested in socio-economic aspects and survival analysis of historical events.

Dataset Name Suggestions

  • Titanic Passenger Survival Data
  • Titanic Demographics and Survival
  • Titanic Beginners Dataset
  • Titanic Merged Passenger Records

Attributes

Original Data Source: Titanic Beginners Dataset

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

08/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format