Opendatabay APP

Forensic Glass Composition Dataset

Data Science and Analytics

Tags and Keywords

Glass

Forensic

Classification

Chemical

Evidence

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Forensic Glass Composition Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset focuses on the identification of different types of glass, primarily for forensic science applications. Originating from the USA Forensic Science Service, it was created by B. German and donated by Vina Spiehler in September 1987. The data was gathered to assist in criminological investigations, particularly in correctly identifying glass fragments found at crime scenes, which can serve as crucial evidence. The dataset has been used in comparative tests of classification algorithms, including rule-based systems like BEAGLE, nearest-neighbour algorithms, and discriminant analysis, where the nearest-neighbour method performed well relative to the rule-based system.

Columns

The dataset comprises 10 continuously valued attributes, including an ID number, plus a class attribute indicating the type of glass. There are no missing attribute values.
  • Id: A unique identification number for each instance, ranging from 1 to 214.
  • RI: Refractive index, a measure of how light propagates through the glass.
  • Na: Sodium content, measured as weight percent in the corresponding oxide.
  • Mg: Magnesium content, measured as weight percent in the corresponding oxide.
  • Al: Aluminium content, measured as weight percent in the corresponding oxide.
  • Si: Silicon content, measured as weight percent in the corresponding oxide.
  • K: Potassium content, measured as weight percent in the corresponding oxide.
  • Ca: Calcium content, measured as weight percent in the corresponding oxide.
  • Ba: Barium content, measured as weight percent in the corresponding oxide.
  • Fe: Iron content, measured as weight percent in the corresponding oxide.
  • Type of glass: The classification attribute, indicating the glass type:
    • 1: building windows (float processed)
    • 2: building windows (non-float processed)
    • 3: vehicle windows (float processed)
    • 4: vehicle windows (non-float processed) – none in this dataset
    • 5: containers
    • 6: tableware
    • 7: headlamps

Distribution

This tabular dataset consists of 214 instances (rows) and 11 attributes, including the class attribute, all of which are continuously valued. The dataset is provided in a CSV format, specifically glass.csv, and is approximately 10.6 kB in size. The class distribution across the 214 instances is as follows:
  • Window glass (163 instances):
    • Float processed: 87 instances (70 building windows, 17 vehicle windows)
    • Non-float processed: 76 instances (76 building windows, 0 vehicle windows)
  • Non-window glass (51 instances):
    • Containers: 13 instances
    • Tableware: 9 instances
    • Headlamps: 29 instances

Usage

This dataset is ideally suited for applications in machine learning, particularly for classification tasks and rule induction. It is highly valuable for:
  • Forensic Science Research: Developing and testing methods for glass identification in criminological investigations.
  • Algorithm Comparison: Benchmarking and comparing the performance of various classification algorithms, such as nearest-neighbour, rule-based systems, and discriminant analysis.
  • Material Science Studies: Analysing the chemical composition of different glass types and their classification.
  • Educational Purposes: Providing a practical example for teaching data classification, attribute analysis, and the application of machine learning in real-world scenarios.

Coverage

The data originates from the USA Forensic Science Service. The collection date is September 1987. The scope of the dataset covers various types of glass commonly found in building windows, vehicle windows, containers, tableware, and headlamps. It is noted that there are no instances of 'vehicle windows (non-float processed)' within this particular database.

License

CC0: Public Domain

Who Can Use It

This dataset is beneficial for a range of users, including:
  • Forensic scientists and researchers: For developing and refining techniques in glass evidence analysis.
  • Data scientists and machine learning engineers: For building and evaluating classification models.
  • Academics and students: For educational purposes in fields such as data science, chemistry, and criminology.
  • Anyone interested in material classification: To explore the relationship between chemical composition and glass type.

Dataset Name Suggestions

  • Forensic Glass Composition Dataset
  • Glass Type Classification Data
  • Chemical Glass Identification Data
  • Criminological Glass Analysis Dataset
  • Material Property Glass Classification

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

10/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format