Opendatabay APP

Modified Human Sperm Morphology Dataset

Patient Health Records & Digital Health

Tags and Keywords

Sperm

Morphology

Infertility

Images

Acrosome

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Modified Human Sperm Morphology Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This collection features grayscale images of human sperm intended for machine learning applications in medical diagnostics. The data was gathered from 235 patients dealing with male factor infertility. Experts have meticulously labelled each sperm image to indicate the status of four crucial morphological components: the acrosome, the head, the vacuole, and the tail. The purpose of this data is to advance research in automated sperm morphology analysis, which could ultimately assist in the diagnosis and clinical treatment of male infertility.

Columns

The data is structured as numerical arrays stored in .npy files, representing both image pixels (x files) and corresponding binary labels (y files). There are no traditional columns as such, but rather key data components:
  • Image Data (x_ files): Contains the grayscale images of individual sperm cells, available in two resolutions: 128x128 pixels and 64x64 pixels.
  • Label Data (y_ files): Contains binary labels (0 for normal, 1 for abnormal) for four separate morphological features: Acrosome, Head, Tail, and Vacuole.

Distribution

The dataset is provided in Python/NumPy format (.npy files) and is partitioned into standard Training, Validation, and Test sets to enable robust model evaluation. The images are grayscale, focusing on a single sperm cell with the head approximately centred, though the sperm tail may not be fully visible in every image.
The image quantities are structured as follows:
  • 128x128 resolution: 1000 training images, 240 validation images, and 300 test images.
  • 64x64 resolution: 1000 training images, 240 validation images, and 300 test images.

Usage

This data product is suited for several types of automated analysis tasks, including:
  • Binary Classification: Developing distinct models to classify the normality or abnormality of each of the four identified morphological features.
  • Multi-label Classification: Creating sophisticated models capable of simultaneously predicting the abnormality status across all four features in a single instance.
  • Image Analysis: Facilitating research into the visual features most indicative of sperm abnormalities.
  • Benchmarking: Serving as a standardised resource for comparing the effectiveness of different computer vision algorithms designed for sperm morphology analysis.

Coverage

The data focuses exclusively on morphological analysis of human sperm samples. The demographic scope includes samples sourced from 235 patients suffering from male factor infertility. Specific geographical origin or time range for the collection is not explicitly detailed.

License

CC BY-NC 3.0

Who Can Use It

This dataset is designed for use by researchers in computer science and biology, particularly those focused on medical imaging, automated diagnostics, and computer vision. Intended users include academics developing new machine learning models for clinical diagnostics and engineers aiming to benchmark image analysis performance.

Dataset Name Suggestions

  • MHSMA (Sperm Morphology Analysis Dataset)
  • Modified Human Sperm Morphology Dataset
  • Automated Male Fertility Diagnosis Images
  • Sperm Morphological Abnormality Classifier Data

Attributes

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

02/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format