Opendatabay APP

Superconductor Feature Engineering Data

Data Science and Analytics

Tags and Keywords

Superconductor

Physics

Materials

Prediction

Critical

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Superconductor Feature Engineering Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This data product enables researchers and modellers to build predictive models for determining a superconductor's critical temperature based purely on its elemental and structural features. It includes raw data derived from public scientific sources, offering a valuable resource for studying the properties that govern superconductivity. The primary goal is to facilitate accurate predictive analysis for material discovery and characterisation.

Columns

The dataset includes 81 distinct feature attributes, along with the critical temperature (Tc) as the label column, totalling 82 columns in the main file (train.csv). The features are derived characteristics of the superconducting material's constituent elements.
Key attributes include:
  • number_of_elements: The count of chemical elements present.
  • mean_atomic_mass: The average atomic mass of the constituent elements.
  • wtd_mean_atomic_mass: The weighted average atomic mass.
  • gmean_atomic_mass: The geometric mean of atomic mass.
  • wtd_gmean_atomic_mass: The weighted geometric mean of atomic mass.
  • entropy_atomic_mass: The entropy related to atomic mass.
  • range_atomic_mass: The range of atomic mass values.
  • std_atomic_mass: The standard deviation of atomic mass.
A separate file (unique_m.csv) contains the chemical formula broken down for each material.

Distribution

The dataset is structured into two main files, typically delivered in CSV format. The primary file, train.csv, contains 21,263 records/superconductors and occupies approximately 23.86 MB. The data is notably clean, showing 100% validity and 0% missing values across the sampled features.

Usage

This data is ideally suited for developing advanced statistical models and machine learning algorithms, such as regression models, intended to predict the critical temperature of new or existing materials. It serves as foundational training data for academic research in condensed matter physics, materials informatics, and data science competitions focused on physical property prediction.

Coverage

The scope of this dataset covers 21,263 distinct superconducting compounds. The data is physics-centric, focusing on the properties of the material composition and structure. Specific geographical, time range, or demographic scope limitations are not applicable to this physical science data.

License

CC0: Public Domain

Who Can Use It

  • Materials Scientists: For validating theories on superconducting properties and material composition.
  • Data Scientists/Machine Learning Engineers: For training and evaluating predictive models (e.g., critical temperature regression).
  • Researchers in Physics: Studying the relationship between elemental attributes and superconductivity onset.

Dataset Name Suggestions

  • Superconductivity Critical Temperature Prediction Dataset
  • Superconductor Feature Engineering Data
  • Materials Science Tc Regression Data
  • Data-Driven Superconductor Model Features

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

31/10/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format