Mushroom Edibility Predictor Dataset
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset details the physical characteristics of mushrooms, enabling their classification as poisonous or edible. It includes descriptions for 23 hypothetical species belonging to the Agaricus and Lepiota Families. Each species is identified as either definitely edible or definitely poisonous, with species of unknown edibility grouped with the poisonous category. The dataset highlights that there is no simple, universal rule for determining a mushroom's edibility, much like the caution against Poisonous Oak and Ivy.
Columns
The dataset comprises 23 distinct columns, each detailing a specific attribute of the mushrooms:
- cap-shape: Describes the shape of the mushroom cap, with values such as bell, conical, convex, flat, knobbed, or sunken.
- cap-surface: Characterises the texture of the cap surface, including fibrous, grooves, scaly, or smooth.
- cap-color: Indicates the colour of the mushroom cap, with options such as brown, buff, cinnamon, grey, green, pink, purple, red, white, or yellow.
- bruises?: A binary indicator showing whether the mushroom bruises (true) or not (false).
- odor: Describes the mushroom's scent, with categories like almond, anise, creosote, fishy, foul, musty, none, pungent, or spicy.
- gill-attachment: Details how the gills are attached to the stalk, for instance, attached, descending, free, or notched.
- gill-spacing: Refers to the spacing between the gills, either close, crowded, or distant.
- gill-size: Indicates the size of the gills, being either broad or narrow.
- gill-color: Specifies the colour of the gills, including black, brown, buff, chocolate, grey, green, orange, pink, purple, red, white, or yellow.
- stalk-shape: Describes the shape of the mushroom stalk, either enlarging or tapering.
- stalk-root: Details the base of the stalk, such as bulbous, club, cup, equal, rhizomorphs, rooted, or missing.
- stalk-surface-above-ring: Characterises the stalk surface above the ring, including fibrous, scaly, silky, or smooth.
- stalk-surface-below-ring: Describes the stalk surface below the ring, with options like fibrous, scaly, silky, or smooth.
- stalk-color-above-ring: Indicates the stalk colour above the ring, with possibilities like brown, buff, cinnamon, grey, orange, pink, red, white, or yellow.
- stalk-color-below-ring: Specifies the stalk colour below the ring, similar to above-ring colours.
- veil-type: Describes the type of veil, either partial or universal (only partial is observed in this dataset).
- veil-color: Indicates the colour of the veil, such as brown, orange, white, or yellow.
- ring-number: States the number of rings present on the stalk, with options none, one, or two.
- ring-type: Details the style of the ring, including cobwebby, evanescent, flaring, large, none, pendant, sheathing, or zone.
- spore-print-color: Describes the colour of the mushroom's spore print, with options such as black, brown, buff, chocolate, green, orange, purple, white, or yellow.
- population: Indicates the mushroom's population density, like abundant, clustered, numerous, scattered, several, or solitary.
- habitat: Specifies the mushroom's natural environment, for example, grasses, leaves, meadows, paths, urban, waste, or woods.
- class: The target variable, classifying the mushroom as edible or poisonous.
Distribution
The dataset is provided as a CSV file named
mushroom.csv
, with a file size of 942.69 kB. It contains 8124 records (rows) and 23 columns. All records across all columns are valid, with no missing or mismatched entries observed.Usage
This dataset is ideally suited for machine learning tasks, particularly for developing binary classification models. Potential applications include:
- Predictive Modelling: Building models to predict whether a mushroom is poisonous or edible based on its attributes.
- Educational Purposes: A valuable resource for students and practitioners learning about classification algorithms and data analysis.
- Feature Engineering: Exploring the significance of different physical characteristics in determining edibility.
Coverage
The dataset focuses on 23 species of gilled mushrooms within the Agaricus and Lepiota Families. It describes their physical characteristics for classification purposes. There is no explicit geographic or temporal scope defined for the data samples.
License
Attribution 4.0 International (CC BY 4.0)
Who Can Use It
This dataset is particularly useful for:
- Data Scientists and Machine Learning Engineers: For developing and testing classification algorithms.
- Students and Educators: As an accessible dataset for teaching and learning fundamental concepts in data science and binary classification.
- Researchers: Those interested in mycological data analysis or developing tools for natural resource identification.
Dataset Name Suggestions
- Mushroom Edibility Predictor Dataset
- Fungus Characteristics Classification
- Poisonous/Edible Mushroom Attributes
- Mycological Classification Data
Attributes
Original Data Source:Mushroom Edibility Predictor Dataset