A fuzzy K-nearest neighbor classifier to deal with imperfect data

Cadenas, Jose M.; Garrido Carrera, María del Carmen; Martínez, Raquel; Muñoz, Enrique; Bonissone, Piero P.

Por favor, use este identificador para citar o enlazar este ítem: https://doi.org/10.1007/s00500-017-2567-x

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Título:	A fuzzy K-nearest neighbor classifier to deal with imperfect data
Fecha de publicación:	1-abr-2017
Editorial:	Springer-Verlag
Cita bibliográfica:	Soft Comput (2018) 22: 3313–3330
Materias relacionadas:	CDU::6 - Ciencias aplicadas::62 - Ingeniería. Tecnología::621 - Ingeniería mecánica en general. Tecnología nuclear. Electrotecnia. Maquinaria::621.3 - Ingeniería eléctrica. Electrotecnia. Telecomunicaciones
Palabras clave:	k-nearest neighbors Classification Imperfect data Distance/dissimilarity measures Combination methods
Resumen:	The k-nearest neighbors method (kNN) is a nonparametric, instance-based method used for regression and classification. To classify a new instance, the kNN method computes its k nearest neighbors and generates a class value from them. Usually, this method requires that the information available in the datasets be precise and accurate, except for the existence of missing values. However, data imperfection is inevitable when dealing with real-world scenarios. In this paper, we present the kNNimp classifier, a k-nearest neighbors method to perform classification from datasets with imperfect value. The importance of each neighbor in the output decision is based on relative distance and its degree of imperfection. Furthermore, by using external parameters, the classifier enables us to define the maximum allowed imperfection, and to decide if the final output could be derived solely from the greatest weight class (the best class) or from the best class and a weighted combination of the closest classes to the best one. To test the proposed method, we performed several experiments with both synthetic and realworld datasets with imperfect data. The results, validated through statistical tests, show that the kNNimp classifier is robust when working with imperfect data and maintains a good performance when compared with other methods in the literature, applied to datasets with or without imperfection.
Autor/es principal/es:	Cadenas, Jose M. Garrido Carrera, María del Carmen Martínez, Raquel Muñoz, Enrique Bonissone, Piero P.
URI:	http://hdl.handle.net/10201/137783
DOI:	https://doi.org/10.1007/s00500-017-2567-x
Tipo de documento:	info:eu-repo/semantics/article
Número páginas / Extensión:	18
Derechos:	info:eu-repo/semantics/embargoedAccess
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
JoseMCadenas.pdf		382,6 kB	Adobe PDF	Visualizar/Abrir Solicitar una copia

Mostrar el registro Dublin Core completo del ítem Mostrar el registro PREMIS del ítem Estadísticas