Opendatabay APP

Most Popular Kaggle Datasets

Data Science and Analytics

Tags and Keywords

Kaggle

Datasets

Popularity

Analytics

Upvoted

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Most Popular Kaggle Datasets Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset explores the most popular datasets available on Kaggle. It was created to address the absence of a centralised dataset featuring highly upvoted datasets. The project compiles and analyses the top 100 datasets based on user votes, providing insights into dataset popularity and their usability on Kaggle.

Columns

  • Dataset_Name: The title of the dataset. There are 99 unique dataset titles.
  • Author: The creator or organisation responsible for the dataset. The most common author is Murat KOKLU.
  • Last_Update: The most recent date the dataset was updated. Dates range from 2016 to 2025.
  • Usability: A score indicating how easy the dataset is to use, with scores ranging from 50 to 100. The mean usability score is 86.5.
  • File_Count: The number of files included within the dataset. The count ranges from 1 to over 717,000 files.
  • Data_Type: The types or formats of the data files. CSV is the most common format, accounting for 61% of datasets.
  • Size: The total size of the dataset files, measured in units like KB or MB. Sizes vary, with 9 KB being a common size.
  • Upvote: The number of user upvotes or endorsements received by the dataset. Upvotes range from 1,401 to over 52,000.
  • Rank: The popularity position or medal (e.g., Gold, Bronze) among compared datasets. Gold rank is held by 90% of the datasets.

Distribution

The dataset is typically provided as a data file, often in CSV format, specifically kaggle_top_100_dataset.csv. It contains 9 columns and covers 99 records or rows. The file size is 7.45 KB. Data types predominantly include CSV, with other formats also present. Sample files are intended to be updated separately to the platform.

Usage

This dataset is ideal for:
  • Gaining insights into dataset popularity and usability trends on Kaggle.
  • Analysing patterns in user engagement with datasets.
  • Identifying characteristics of highly upvoted datasets.
  • Researching effective dataset creation strategies for platforms like Kaggle.

Coverage

The dataset's coverage is primarily focused on Kaggle's platform.
  • Time Range: Data on dataset updates spans from 2016 to 2025, with an expected annual update frequency.
  • Geographic Scope: Not specified, as it pertains to an online platform.
  • Demographic Scope: Not specified.

License

CC0: Public Domain

Who Can Use It

This dataset is suitable for:
  • Data scientists and analysts interested in understanding data trends and popularity metrics.
  • Kaggle users and dataset creators looking to improve their dataset design and visibility.
  • Researchers studying online communities and data sharing platforms.
  • Anyone aiming to explore insights into dataset popularity and usability.

Dataset Name Suggestions

  • Kaggle Top 100 Datasets
  • Most Popular Kaggle Datasets
  • Kaggle Dataset Popularity Analysis
  • Upvoted Kaggle Datasets Overview

Attributes

Original Data Source: Most Popular Kaggle Datasets

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

20/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format