Opendatabay APP

Video Game Content Rating Prediction Dataset

Product Reviews & Feedback

Tags and Keywords

Gaming

Esrb

Ratings

Content

Video

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Video Game Content Rating Prediction Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is designed to facilitate the prediction of 'ESRB' video game ratings based on the content descriptors present within the games. It encompasses information for 1895 video games, featuring the game's title, console, and 34 binary content descriptors that indicate the presence or absence of specific content types. Each data point includes binary values for the console and a binary vector for the ESRB content features. It is noted that 'RP', 'EC', and 'A' ratings are not currently available in this version of the data, but they may be incorporated in future updates.

Columns

The dataset contains the following columns, each with a description of its content:
  • title (string): The name of the video game.
  • console (integer): The console on which the game was released. A value of 0 indicates 'PS4', while 1 indicates 'PS4 & Xbox_one'.
  • Alcohol_Reference (integer): Indicates references to and/or images of alcoholic beverages. 0 = no, 1 = yes.
  • Animated_Blood (integer): Denotes discoloured and/or unrealistic depictions of blood. 0 = no, 1 = yes.
  • Blood (integer): Signifies depictions of blood. 0 = no, 1 = yes.
  • Blood_and_Gore (integer): Indicates depictions of blood or the mutilation of body parts. 0 = no, 1 = yes.
  • Cartoon_Violence (integer): Refers to violent actions involving cartoon-like situations and characters, which may include violence where a character is unharmed after the action has been inflicted. 0 = no, 1 = yes.
  • Crude_Humor (integer): Specifies depictions or dialogue involving vulgar antics, including "bathroom" humour. 0 = no, 1 = yes.
  • DrugRe_ference (integer): Indicates references to and/or images of illegal drugs. 0 = no, 1 = yes.
  • Fantasy_Violence (integer): Denotes violent actions of a fantasy nature, involving human or non-human characters in situations easily distinguishable from real life. 0 = no, 1 = yes.
  • Intense_Violence (integer): Refers to graphic and realistic-looking depictions of physical conflict, potentially involving extreme and/or realistic blood, gore, weapons, and depictions of human injury and death. 0 = no, 1 = yes.
  • Language (integer): Signifies moderate use of profanity. 0 = no, 1 = yes.
  • Lyrics (integer): Indicates references to profanity, sexuality, violence, alcohol, or drug use in music. 0 = no, 1 = yes.
  • Mature_Humor (integer): Specifies depictions or dialogue involving "adult" humour, including sexual references. 0 = no, 1 = yes.
  • Mild_Blood (integer): Denotes some blood. 0 = no, 1 = yes.
  • Mild_Cartoon_Violence (integer): Refers to some violent actions involving cartoon. 0 = no, 1 = yes.
  • Mild_Fantasy_Violence (integer): Indicates some violent actions of a fantasy nature. 0 = no, 1 = yes.
  • Mild_Language (integer): Signifies mild to moderate use of profanity. 0 = no, 1 = yes.
  • Mild_Lyrics (integer): Denotes mild references to profanity, sexuality, violence, alcohol, or drug use in music. 0 = no, 1 = yes.
  • Mild_Suggestive_Themes (integer): Refers to some provocative references or materials. 0 = no, 1 = yes.
  • Mild_Violence (integer): Indicates some scenes involving aggressive conflict. 0 = no, 1 = yes.
  • No_Descriptors (integer): Signifies no content descriptors are present. 0 = no, 1 = yes.
  • Nudity (integer): Refers to graphic or prolonged depictions of nudity. 0 = no, 1 = yes.
  • Partial_Nudity (integer): Denotes brief and/or mild depictions of nudity. 0 = no, 1 = yes.
  • Sexual_Content (integer): Indicates non-explicit depictions of sexual behaviour, possibly including partial nudity. 0 = no, 1 = yes.
  • Sexual_Themes (integer): Signifies references to sex or sexuality. 0 = no, 1 = yes.
  • Simulated_Gambling (integer): Denotes a player can gamble without betting or wagering real cash or currency. 0 = no, 1 = yes.
  • Strong_Language (integer): Refers to explicit and/or frequent use of profanity. 0 = no, 1 = yes.
  • Strong_Sexual_Content (integer): Indicates explicit and/or frequent depictions of sexual behaviour, possibly including nudity. 0 = no, 1 = yes.
  • Suggestive_Themes (integer): Signifies provocative references or materials. 0 = no, 1 = yes.
  • Use_of_Alcohol (integer): Denotes the consumption of alcoholic beverages. 0 = no, 1 = yes.
  • Use_of_Drugs_and_Alcohol (integer): Refers to the consumption of alcoholic and drugs beverages. 0 = no, 1 = yes.
  • Violence (integer): Indicates scenes involving aggressive conflict, which may contain bloodless dismemberment. 0 = no, 1 = yes.
  • ESRB_rating (string): The assigned ESRB rating, which can be RP, EC, E, E10+, T, M, or A.

Distribution

The dataset is provided in CSV file format. It consists of a training set, Video_games_esrb_rating.csv, and a test set, test_esrb.csv. The Video_games_esrb_rating.csv file has a size of 168.9 kB and contains 1895 individual records (games). Each record is structured with a game title, a binary indicator for the console, and 34 binary content features, along with the corresponding ESRB rating string.

Usage

This dataset is ideally suited for:
  • Developing and training machine learning models to predict the ESRB rating of video games based on their content descriptors.
  • Conducting analysis to understand the correlation and impact of various content elements on the final ESRB ratings.
  • Researching trends and patterns in video game content and its classification by rating systems.
  • Aiding game developers and publishers in understanding the criteria that influence ESRB ratings during game development.

Coverage

  • Geographic Scope: Information on the geographic scope is not specified within the provided sources.
  • Time Range: Information on the time range of the data is not specified within the provided sources.
  • Demographic Scope: Information on the demographic scope is not specified within the provided sources.
  • Data Availability Notes: ESRB ratings for 'RP' (Rating Pending), 'EC' (Early Childhood), and 'A' (Adult) are not included in the current version of this dataset. These ratings may be incorporated in subsequent updates.

License

CC0: Public Domain

Who Can Use It

This dataset is particularly useful for:
  • Data Scientists and Machine Learning Engineers who are building classification models.
  • Researchers focusing on media content analysis, specifically within the video game industry.
  • Game Industry Professionals seeking insights into content rating standards and how specific game elements contribute to overall ratings.

Dataset Name Suggestions

  • ESRB Game Content Ratings
  • Video Game Content Rating Prediction Dataset
  • ESRB Rating Descriptors for Games

Attributes

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

24/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format