Opendatabay APP

Kaggle Popular Dataset Metadata

Data Science and Analytics

Tags and Keywords

Kaggle

Dataset

Metadata

Trends

Popularity

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Kaggle Popular Dataset Metadata Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset offers a snapshot of 2150 upvoted datasets from Kaggle, a popular and growing platform for data sharing. It provides detailed information on each dataset, capturing metadata as of 26 February 2018. The primary purpose is to enable exploration of Kaggle dataset characteristics, offering insights into trends, popularity, and content. Researchers and data scientists can leverage this collection to understand the dynamics of publicly shared datasets, perform data visualisation, and build predictive models for future dataset attributes.

Columns

  • Title: The main title of the Kaggle dataset.
  • Subtitle: A brief secondary description or subtitle for the dataset.
  • Owner: The creator or owner of the dataset on Kaggle.
  • Vote: The total number of votes received by the dataset, indicating its popularity.
  • Version History: Details about the different versions released for the dataset.
  • Tags: Keywords or labels associated with the dataset, aiding categorisation and search.
  • Datatype: The primary file format of the data, predominantly CSV.
  • Size: The physical size of the dataset file.
  • License: The licensing terms under which the dataset is distributed, often CC0: Public Domain.
  • Views: The total number of times the dataset page has been viewed.
  • Downloads: The total number of times the dataset has been downloaded.
  • Kernels: The number of Kaggle Kernels (notebooks) associated with the dataset.
  • Topics: The number of topics or discussion threads linked to the dataset.
  • URL: The direct web link to the dataset on Kaggle.
  • Description: A detailed explanation of the dataset's content and context.

Distribution

This dataset is provided as a CSV file (voted-kaggle-dataset.csv) and is approximately 3.88 MB in size. It comprises 2150 unique records, each representing a distinct Kaggle dataset. The data is structured across 15 distinct columns, as detailed above.

Usage

This dataset is ideal for:
  • Predictive modelling: Forecasting upcoming dataset characteristics such as topics, number of votes, or download counts.
  • Data visualisation: Creating visualisations to identify patterns and clusters within Kaggle's dataset ecosystem.
  • Exploratory data analysis: Understanding trends in dataset popularity, common topics, and ownership on the Kaggle platform.
  • Marketplace analysis: Gaining insights into the metadata and attributes that contribute to a dataset's visibility and usage.

Coverage

The dataset provides a snapshot of Kaggle datasets collected on 26 February 2018, focusing on those with at least two votes. It encompasses data from the global Kaggle platform, reflecting a wide range of topics and owners. There are no specific geographic or demographic restrictions mentioned for the scope of the datasets included; it broadly covers publicly shared datasets on the platform.

License

CC0: Public Domain

Who Can Use It

  • Data scientists: Interested in analysing trends and patterns in public datasets.
  • Machine learning engineers: For building models to predict dataset performance or characteristics.
  • Researchers: Studying data sharing ecosystems and community-driven data platforms.
  • Academics: Utilising a real-world dataset for educational purposes in data science courses.
  • Platform developers: Seeking insights to improve data marketplace features and recommendations.

Dataset Name Suggestions

  • Kaggle Popular Dataset Metadata
  • Kaggle Dataset Analytics Snapshot
  • Voted Datasets on Kaggle
  • Kaggle Data Trends 2018
  • Public Kaggle Dataset Information

Attributes

Original Data Source: Kaggle Popular Dataset Metadata

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

26/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format