"Learning Morphological Data of Tomato Fruits" by Joshua C. Thomas, Matthew Lambert et al.
 

Learning Morphological Data of Tomato Fruits

Publication Date

2011

Document Type

Conference Proceeding

Abstract

Three methods for attribute reduction in conjunction with Neural Networks, Naive Bayes, and k-Nearest Neighbor classifiers are investigated here when classifying a particularly challenging data set. The difficulty encountered with this data set is mainly due to the high dimensionality and to some inbalance between classes. As a result of this research, a subset of only 8 attributes (out of 34) is identified leading to a 92.7% classification accuracy. The confusion matrix analysis identifies class 7 as the one poorly learned across all combinations of attributes and classifiers. This information can be further used to upsample this underrepresented class or to investigate a classifier less sensitive to imbalance.

Keywords

Attribute selection, Classification, Confusion matrix

Published Version

Share

COinS