Iris Flower Extended Dataset
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is an advanced version of the widely-used Iris dataset, expanding its utility with new and realistic features. It introduces additional attributes, increasing the dataset to 1200 rows. These new features offer a more detailed understanding of iris flower biology and ecology. The dataset is particularly well-suited for educational purposes, ecological research, and machine learning applications that benefit from varied and multi-dimensional data. Key additions include environmental factors like elevation and soil type, alongside advanced morphological features such as petal curvature and texture, providing an enriched view of iris characteristics.
Columns
- Species: The specific species of the iris flower (options include Setosa, Versicolor, Virginica).
- Elevation: The elevation level (in meters) where the iris plant was discovered.
- Soil Type: The classification of soil where the iris was found (e.g., Loamy, Sandy, Clayey).
- Sepal Length (cm): The length of the sepal, measured in centimetres.
- Sepal Width (cm): The width of the sepal, measured in centimetres.
- Petal Length (cm): The length of the petal, measured in centimetres.
- Petal Width (cm): The width of the petal, measured in centimetres.
- Sepal Area (cm^2): The calculated area of the sepal, derived from Sepal Length multiplied by Sepal Width.
- Petal Area (cm^2): The calculated area of the petal, derived from Petal Length multiplied by Petal Width.
- Sepal Aspect Ratio: The ratio between Sepal Length and Sepal Width.
- Petal Aspect Ratio: The ratio between Petal Length and Petal Width.
- Sepal-to-Petal Length Ratio: The ratio comparing Sepal Length to Petal Length.
- Sepal-to-Petal Width Ratio: The ratio comparing Sepal Width to Petal Width.
- Sepal-Petal Length Difference: The numerical difference between Sepal Length and Petal Length.
- Sepal-Petal Width Difference: The numerical difference between Sepal Width and Petal Width.
- Petal Curvature (mm): A measurement of the petal's curvature in millimetres.
- Petal Texture (trichomes/mm^2): The count of trichomes per square millimetre on the petal surface.
- Leaf Area (cm^2): The typical leaf area of the iris plant, in square centimetres.
- Sepal Area Sqrt: The square root of the Sepal Area.
- Petal Area Sqrt: The square root of the Petal Area.
- Area Ratios: A custom attribute encapsulating various area ratios within the dataset.
Distribution
This dataset is provided as a CSV file, named
iris_extended.csv
, with a size of approximately 278.66 kB. It contains 1200 records or rows and features 21 distinct columns. There are no missing values reported across the listed columns, ensuring data integrity for analysis.Usage
This enriched dataset is ideally suited for:
- Advanced botanical and ecological studies requiring multi-dimensional data analysis.
- Educational case studies in biology, ecology, or data science curricula.
- Developing and evaluating machine learning algorithms that benefit from diverse and multi-dimensional input.
- Exploratory data analysis and visualisation to gain deeper insights into iris flower characteristics.
- Classification tasks, building upon the foundational use cases of the original Iris dataset.
Coverage
The dataset focuses on the biological and ecological aspects of iris flowers, covering species Setosa, Versicolor, and Virginica. While specific geographic locations or time ranges are not explicitly detailed, the inclusion of elevation and soil type attributes suggests a range of environmental conditions. The data provides a snapshot for research and study, without specific notes on data availability for certain groups or years beyond the included species.
License
Attribution 4.0 International (CC BY 4.0)
Who Can Use It
- Data Scientists and Machine Learning Engineers: For training and testing classification models, or for advanced feature engineering exercises.
- Botanists and Ecologists: For in-depth studies of iris flower biology, morphology, and environmental interactions.
- Educators and Students: As a practical resource for teaching data science principles, biological concepts, and statistical analysis.
- Researchers: Seeking multi-dimensional data to explore complex biological relationships and patterns.
- Data Analysis Enthusiasts: Interested in exploring and visualising an expanded version of a classic dataset.
Dataset Name Suggestions
- Iris Flower Extended Dataset
- Multi-Feature Iris Data
- Iris Environmental & Morphological Data
- Advanced Iris Plant Dataset
- Iris 2.0 Scientific Dataset
Attributes
Original Data Source: Iris Flower Extended Dataset