Reviewing ensemble classification methods in breast cancer

Hosni, Mohamed; Abnane, Ibtissam; Idri, Ali; Carrillo-de- Gea, Juan Manuel; FernÁndez-Alemán, José Luis

Por favor, use este identificador para citar o enlazar este ítem: https://doi.org/10.1016/j.cmpb.2019.05.019

RefMan EndNote BibTex RefWorks Excel CSV PDF Mendeley

Título:	Reviewing ensemble classification methods in breast cancer
Fecha de publicación:	20-may-2019
Editorial:	Elsevier
Cita bibliográfica:	Computer Methods and Programs in Biomedicine, 177, 89-112
ISSN:	Print: 0169-2607 Electronic: 1872-7565
Palabras clave:	Breast cancer Classification Ensemble methods Machine learning Data mining
Resumen:	Context: Ensemble methods consist of combining more than one single technique to solve the same task. This approach was designed to overcome the weaknesses of single techniques and consolidate their strengths. Ensemble methods are now widely used to carry out prediction tasks (e.g. classification and regression) in several fields, including that of bioinformatics. Researchers have particularly begun to employ ensemble techniques to improve research into breast cancer, as this is the most frequent type of cancer and accounts for most of the deaths among women. Objective and method: The goal of this study is to analyse the state of the art in ensemble classification methods when applied to breast cancer as regards 9 aspects: publication venues, medical tasks tackled, empirical and research types adopted, types of ensembles proposed, single techniques used to construct the ensembles, validation framework adopted to evaluate the proposed ensembles, tools used to build the ensembles, and optimization methods used for the single techniques. This paper was undertaken as a systematic mapping study. Results: A total of 193 papers that were published from the year 20 0 0 onwards, were selected from four online databases: IEEE Xplore, ACM digital library, Scopus and PubMed. This study found that of the six medical tasks that exist, the diagnosis medical task was that most frequently researched, and that the experiment-based empirical type and evaluation-based research type were the most dominant ap- proaches adopted in the selected studies. The homogeneous type was that most widely used to perform the classification task. With regard to single techniques, this mapping study found that decision trees, support vector machines and artificial neural networks were those most frequently adopted to build en- semble classifiers. In the case of the evaluation framework, the Wisconsin Breast Cancer dataset was the most frequently used by researchers to perform their experiments, while the most noticeable vali- dation method was k-fold cross-validation. Several tools are available to perform experiments related to ensemble classification methods, such as Weka and R Software. Few researchers took into account the optimisation of the single technique of which their proposed ensemble was composed, while the grid search method was that most frequently adopted to tune the parameter settings of a single classifier. Conclusion: This paper reports an in-depth study of the application of ensemble methods as regards breast cancer. Our results show that there are several gaps and issues and we, therefore, provide researchers in the field of breast cancer research with recommendations. Moreover, after analysing the papers found in this systematic mapping study, we discovered that the majority report positive results concerning the ac- curacy of ensemble classifiers when compared to the single classifiers. In order to aggregate the evidence reported in literature, it will, therefore, be necessary to perform a systematic literature review and meta- analysis in which an in-depth analysis could be conducted so as to confirm the superiority of ensemble classifiers over the classical techniques.
Autor/es principal/es:	Hosni, Mohamed Abnane, Ibtissam Idri, Ali Carrillo-de- Gea, Juan Manuel FernÁndez-Alemán, José Luis
Forma parte de:	MPHR- PPR1/09-2015-2018, BIZDEVOPS-Global (RTI2018-098309-B-C33)
Versión del editor:	https://www.sciencedirect.com/science/article/pii/S0169260719301907
URI:	http://hdl.handle.net/10201/149102
DOI:	https://doi.org/10.1016/j.cmpb.2019.05.019
Tipo de documento:	info:eu-repo/semantics/article
Número páginas / Extensión:	37
Derechos:	info:eu-repo/semantics/openAccess Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Descripción:	© 2019, Computer Methods and Programs in Biomedicine. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/. This document is the Accepted version of a Published Work that appeared in final form in Computer Methods and Programs in Biomedicine. To access the final edited and published work see https://doi.org/10.1016/j.cmpb.2019.05.019
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
ReviewingEnsembleClassificationMethods.pdf	© [2019] Computer Methods and Programs in Biomedicine. Published by Elsevier. This is an Accepted Manuscript of an article published by Elsevier in Computer Methods and Programs in Biomedicine, available online: https://doi.org/10.1016/j.cmpb.2019.05.019.	1,41 MB	Adobe PDF	Visualizar/Abrir

Mostrar el registro Dublin Core completo del ítem Mostrar el registro PREMIS del ítem Estadísticas

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons