Comparison of silhouette‐based reallocation methods for vegetation classification

dc.contributor.authorLengyel, Attila
dc.contributor.authorRoberts, David W.
dc.contributor.authorBotta-Dukát, Zoltán
dc.date.accessioned2022-09-09T18:24:26Z
dc.date.available2022-09-09T18:24:26Z
dc.date.issued2021-01
dc.description.abstractAims: Vegetation classification seeks to partition the variability of vegetation into relatively homogeneous but distinct types. There are many ways to evaluate, and potentially improve, such a partitioning. One effective approach involves calculating silhouette widths which measure the goodness-of-fit of plots to their cluster. We introduce a new iterative reallocation clustering method — Reallocation of Misclassified Objects based on Silhouette width (REMOS) — and compare its performance with an existing algorithm — OPTimizing SILhouette widths (OPTSIL). REMOS reallocates misclassified objects to their nearest-neighbour cluster iteratively. Of its two variants, REMOS1 reallocates only the object with the lowest silhouette width, while REMOS2 reallocates all objects with negative silhouette width in each iteration. We test how REMOS1, REMOS2 and OPTSIL perform in terms of: (a) cluster homogeneity and separation; (b) the number of diagnostic species; and (c) runtime. Methods: We classified simulated data with the flexible-beta algorithm for values of beta from −1 to 0. These classifications were subsequently optimized by REMOS1, REMOS2 and OPTSIL and compared for mean silhouette widths, misclassification rate, and runtime. We classified three vegetation data sets from two to ten clusters, optimized all outcomes with the three reallocation methods, and compared their mean silhouette widths, misclassification rate, and number of diagnostic species. Results: OPTSIL achieved the highest mean silhouette width across the majority of the data sets. REMOS achieved zero or negligible misclassifications, outperforming OPTSIL on this criterion. REMOS algorithms were typically more than an order of magnitude faster to calculate than OPTSIL. There was no clear difference between REMOS and OPTSIL in the number of diagnostic species. Conclusions:REMOS algorithms may be preferable to OPTSIL when: (a) the primary objective is to reduce the number of negative silhouette widths in a classification, as opposed to maximizing mean silhouette width; or (b) when the time efficiency of the algorithm is important.en_US
dc.identifier.citationLengyel, A., Roberts, D. W., & Botta‐Dukát, Z. (2021). Comparison of silhouette‐based reallocation methods for vegetation classification. Journal of Vegetation Science, 32(1), e12984.en_US
dc.identifier.issn1100-9233
dc.identifier.urihttps://scholarworks.montana.edu/handle/1/17105
dc.language.isoen_USen_US
dc.publisherWileyen_US
dc.rightscc-by-nc-nden_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.subjectclassicationen_US
dc.subjectclusteringen_US
dc.subjectflexible-betaen_US
dc.subjectiterativeen_US
dc.subjectoptimclassen_US
dc.subjectoptimizationen_US
dc.subjectoptsilen_US
dc.subjectremosen_US
dc.subjectsilhouetteen_US
dc.subjectvalidationen_US
dc.titleComparison of silhouette‐based reallocation methods for vegetation classificationen_US
dc.typeArticleen_US
mus.citation.extentfirstpage1en_US
mus.citation.extentlastpage10en_US
mus.citation.issue1en_US
mus.citation.journaltitleJournal of Vegetation Scienceen_US
mus.citation.volume32en_US
mus.data.thumbpage7en_US
mus.identifier.doi10.1111/jvs.12984en_US
mus.relation.collegeCollege of Letters & Scienceen_US
mus.relation.departmentEcology.en_US
mus.relation.universityMontana State University - Bozemanen_US

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
lengyel-vegetation-2021.pdf
Size:
1.27 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
826 B
Format:
Item-specific license agreed upon to submission
Description:
Copyright (c) 2002-2022, LYRASIS. All rights reserved.