Dataset Iris

Il dataset Iris è un dataset multivariato introdotto da Ronald Fisher nel 1936. Consiste in 150 istanze di Iris misurate da Edgar Anderson e classificate secondo tre specie: Iris setosa, Iris virginica e Iris versicolor. Le quattro variabili considerate sono la lunghezza e la larghezza del sepalo e del petalo. A causa di errori, esistono diverse versioni del dataset utilizzate nella letteratura scientifica.^[1]

Il dataset Iris viene utilizzato nell'ambito dell'apprendimento automatico come esempio di classificazione statistica.^[2]^[3]

^ (EN) Bezdek, J.C., Keller, J.M.; Krishnapuram, R.; Kuncheva, L.I.; Pal, N.R., Will the real iris data please stand up?, in IEEE Transactions on Fuzzy Systems, vol. 7, n. 3, IEEE, 1999, pp. 368-369, DOI:10.1109/91.771092, ISSN 1063-6706 (WC · ACNP).
^ (EN) An introduction to machine learning with scikit-learn, su scikit-learn.
^ (EN) Yanchang Zhao, R and Data Mining: Examples and Case Studies (PDF), 26 aprile 2013.

[1] (EN) Bezdek, J.C., Keller, J.M.; Krishnapuram, R.; Kuncheva, L.I.; Pal, N.R., Will the real iris data please stand up?, in IEEE Transactions on Fuzzy Systems, vol. 7, n. 3, IEEE, 1999, pp. 368-369, DOI:10.1109/91.771092, ISSN 1063-6706 (WC · ACNP).

[2] (EN) An introduction to machine learning with scikit-learn, su scikit-learn.

[3] (EN) Yanchang Zhao, R and Data Mining: Examples and Case Studies (PDF), 26 aprile 2013.

[1]

[2]

[3]