Opendatabay APP

Python Library Manifest Dataset

Software and Technology

Tags and Keywords

Computer

Science

Programming

Beginner

Nlp

Clustering

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Python Library Manifest Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a curated list of Python packages, along with descriptive summaries and licensing information. It was compiled due to the absence of a centralised reference for Python packages available within Kaggle notebooks. The dataset was created by gathering names of over 600 installed packages from Kaggle, cross-referencing with Anaconda's package list, and then querying the Python Package Index (PyPI) JSON API to enrich the details for approximately 400 remaining packages. It serves as a valuable manifest for understanding the Python environment in Kaggle notebooks and beyond.

Columns

  • package_name: The specific name of a Python package.
  • version: The version of this package that the notebook uses.
  • summary: A brief description outlining what this package does.
  • license: Details on how the package is licensed.
  • metadata_source: The origin from which the summary and license information were gathered, such as PyPI or Anaconda.

Distribution

The dataset typically takes the form of a data file, structured as a list where each entry represents a unique Python package. It contains information for 630 unique package names. Specific row or record counts are not available, but the initial compilation involved over 600 packages, which were then refined to about 400 with detailed metadata.

Usage

This dataset is ideally suited for:
  • Discovering and exploring available Python packages and their functionalities.
  • Researchers and developers needing quick access to package summaries and licensing details.
  • Analysing trends in Python package usage and distribution within environments like Kaggle.
  • Educational purposes, providing a structured overview of commonly used Python libraries.

Coverage

The dataset’s coverage is global in scope, reflecting Python packages widely used across various regions. It was listed on 27th June 2025. There are no specific notes on demographic scope.

License

CC0

Who Can Use It

This dataset is beneficial for a wide range of users, including:
  • Data scientists seeking to identify suitable Python libraries for their projects.
  • Software developers looking for package details and licensing information.
  • Academics and students conducting research on programming languages, specifically Python, or learning about its ecosystem.
  • Beginners in programming who need clear descriptions of Python packages.

Dataset Name Suggestions

  • Python Library Manifest
  • Kaggle Python Package Index
  • Python Package Overview
  • Installed Python Packages

Attributes

Original Data Source: Python Package List

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free