Journal Home > Volume 23 , Issue 6

In recent years, there have been rapid developments in various bioinformatics technologies, which have led to the accumulation of a large amount of biomedical data. The biomedical data can be analyzed to enhance assessment of at-risk patients and improve disease diagnosis, treatment, and prevention. However, these datasets usually have many features, which contain many irrelevant or redundant information. Feature selection is a solution that involves finding the optimal subset, which is known to be an NP problem because of the large search space. Considering this, a new feature selection approach based on Binary Chemical Reaction Optimization algorithm (BCRO) and k-Nearest Neighbors (KNN) classifier is presented in this paper. Tabu search is integrated with CRO framework to enhance local search capacity. KNN is adopted to evaluate the quality of selected candidate subset. The results for an experiment conducted on nine standard medical datasets demonstrate that the proposed approach outperforms other state-of-the-art methods.


menu
Abstract
Full text
Outline
About this article

A Hybrid Algorithm Based on Binary Chemical Reaction Optimization and Tabu Search for Feature Selection of High-Dimensional Biomedical Data

Show Author's information Chaokun YanJingjing MaHuimin LuoJianxin Wang( )
School of Computer and Information Engineering, Henan University, Kaifeng 475000, China.
School of Information Science and Engineering, Central South University, Changsha 410083, China.

Abstract

In recent years, there have been rapid developments in various bioinformatics technologies, which have led to the accumulation of a large amount of biomedical data. The biomedical data can be analyzed to enhance assessment of at-risk patients and improve disease diagnosis, treatment, and prevention. However, these datasets usually have many features, which contain many irrelevant or redundant information. Feature selection is a solution that involves finding the optimal subset, which is known to be an NP problem because of the large search space. Considering this, a new feature selection approach based on Binary Chemical Reaction Optimization algorithm (BCRO) and k-Nearest Neighbors (KNN) classifier is presented in this paper. Tabu search is integrated with CRO framework to enhance local search capacity. KNN is adopted to evaluate the quality of selected candidate subset. The results for an experiment conducted on nine standard medical datasets demonstrate that the proposed approach outperforms other state-of-the-art methods.

Keywords: feature selection, biomedical data, chemical reaction optimization, tabu search

References(22)

[1]
Lee K., Man Z. H., Wang D. H., and Cao Z. W., Classification of microarray datasets using finite impulse response extreme learning machine for cancer diagnosis, presented at the IECON 2011—37th Annu. Conf. of the IEEE Industrial Electronics Society, Melbourne, Australia, 2011.
DOI
[2]
Liu H. and Zhao Z., Manipulating data and dimension reduction methods: Feature selection, presented at the Encyclopedia of Complexity and Systems Science, Berlin, Germany, 2015.
DOI
[3]
Hira Z. M. and Gillies D. F., A review of feature selection and feature extraction methods applied on microarray data, Advances in Bioinformatics, vol. 2015, p. 198363, 2015.
[4]
Kira K. and Rendell L. A., The feature selection problem: Traditional methods and a new algorithm, in Proc. Tenth National Conf. Artificial Intelligence, San Jose, CA, USA, 1992, pp. 129–134.
[5]
Baldi P. and Long A. D., A Bayesian framework for the analysis of microarray expression data: Regularized t-test and statistical inferences of gene changes, Bioinformatics, vol. 17, no. 6, pp. 509–519, 2001.
[6]
Verbiest N., Derrac J., Cornelis C., García S., and Herrera F., Evolutionary wrapper approaches for training set selection as preprocessing mechanism for support vector machines: Experimental evaluation and support vector analysis, Applied Soft Computing, vol. 38, pp. 10–22, 2016.
[7]
Peng Y. H., Wu Z. Q., and Jiang J. M., A novel feature selection approach for biomedical data classification, Journal of Biomedical Informatics, vol. 43, no. 1, pp. 15–23, 2010.
[8]
Xue B., Zhang M. J., Browne W. N., and Yao X., A survey on evolutionary computation approaches to feature selection, IEEE Transactions on Evolutionary Computation, vol. 20, no. 4, pp. 606–626, 2016.
[9]
Yagiura M. and Ibaraki T., On metaheuristic algorithms for combinatorial optimization problems, Systems and Computers in Japan, vol. 32, no. 3, pp. 33–55, 2001.
DOI
[10]
Lin S. W., Lee Z. J., Chen S. C., and Tseng T. Y., Parameter determination of support vector machine and feature selection using simulated annealing approach, Applied Soft Computing, vol. 8, no. 4, pp. 1505–1512, 2008.
[11]
Babatunde O. H., Armstrong L., Leng J. S., and Diepeveen D., A genetic algorithm-based feature selection, International Journal of Electronics Communication and Computer Engineering, vol. 5, no. 4, pp. 899–905, 2014.
[12]
Ghanad N. K. and Ahmadi S., Combination of PSO algorithm and naive Bayesian classification for Parkinson disease diagnosis, Advances in Computer Science: An International Journal, vol. 4, no. 4, pp. 119–125, 2015.
[13]
Hu B.,Dai Y. Q.,Su Y.,Moore P., Zhang X. W., Mao C. S., Chen J., and Xu L. X., Feature selection for optimized high-dimensional biomedical data using the improved shuffled frog leaping algorithm, presented at the IEEE/ACM Trans. on Computational Biology and Bioinformatics, 2016.
[14]
Vieira S. M., Mendonça L. F., Farinha G. J., and Sousa J. M. C., Modified binary pso for feature selection using SVM applied to mortality prediction of septic patients, Applied Soft Computing, vol. 13, no. 8, pp. 3494–3504, 2013.
[15]
Subanya B. and Rajalaxmi R. R., Feature selection using artificial bee colony for cardiovascular disease classification, presented at the 2014 Int. Conf. on Electronics and Communication Systems (ICECS), Coimbatore, India, 2014.
DOI
[16]
Xing B. and Gao W. J., Chemical-reaction optimization algorithm, in Innovative Computational Intelligence: A Rough Guide to 134 Clever Algorithms. Springer, 2014.
DOI
[17]
Alatas B., ACROA: Artificial chemical reaction optimization algorithm for global optimization, Expert Systems with Applications, vol. 38, no. 10, pp. 13170–13180, 2011.
[18]
Glover F. and Laguna M., Tabu search, in Handbook of Combinatorial Optimization, Du D. Z. and Pardalos P. M.,eds. Springer, 2013, pp. 3261–3362.
DOI
[19]
Glover F., Tabu search: A tutorial, Interfaces, vol. 20, no. 4, pp. 74–94, 1990.
[20]
Wade B. S. C., Joshi S. H., Gutman B. A., and Thompson P. M., Machine learning on high dimensional shape data from subcortical brain surfaces: A comparison of feature selection and classification methods, Pattern Recognition, vol. 63, pp. 731–739, 2017.
[21]
Franek L. and Jiang X. Y., Orthogonal design of experiments for parameter learning in image segmentation, Signal Processing, vol. 93, no. 6, pp. 1694–1704, 2013.
[22]
Cui W. G., Li X. H., Zhou S. B., and Weng J., Investigation on process parameters of electrospinning system through orthogonal experimental design, Journal of Applied Polymer Science, vol. 103, no. 5, pp. 3105–3112, 2007.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 06 April 2018
Accepted: 04 May 2018
Published: 15 October 2018
Issue date: December 2018

Copyright

© The authors 2018

Acknowledgements

This work was supported in part by the Natural Science Foundation of Henan Province (No. 14A520042), Scientific Research Foundation of the Higher Education Institutions of Henan Province (No. 18A520021), the National Natural Science Foundation of China (No. 61802114), and the National Key Technology R&D Program of China (No. 2015BAK01B06).

Rights and permissions

Return