Machine Learning and Feature Selection-Enabled Optimized Technique for Heart Disease Classification and Prediction
Abstract
When dealing with a group of patients seeking treatment for heart-related diseases, doctors who specialize in the diagnosis and treatment of heart-related disorders have a difficult but critical task. It comes as no surprise that cardiovascular disease is a leading source of morbidity and death in contemporary society. An expert system with clear categorization that may assist medical professionals in identifying heart disease condition based on the clinical data of a patient is often required by physicians. The aim of this work is to provide a method for the prediction and classification of cardiac disease based on machine learning and feature selection. The correlation-based feature selection (CFS) method is applied to the input data set in order to extract relevant features for analysis. The support vector machine with radial basis function (SVM RBF) and random forest algorithms are used here for data classification. Cleveland heart disease dataset is used in the experiment work. This dataset has 303 instances and 14 attributes. The accuracy, specificity and sensitivity of SVM RBF are higher than those of the random forest algorithm.
Keywords
machine learning, heart disease prediction, accuracy, SVM RBF, CFS feature selection,References
1. A. Singh, R. Kumar, Heart disease prediction using machine learning algorithms, [in:] 2020 International Conference on Electrical and Electronics Engineering (ICE3), Gorakhpur, India, pp. 452–457, 2020, doi: 10.1109/ICE348803.2020.9122958.2. V. Sharma, S. Yadav, M. Gupta, Heart disease prediction using machine learning techniques, [in:] 2020 2nd International Conference on Advances in Computing, Communication Control and Networking (ICACCCN), Greater Noida, India, pp. 177–181, 2020, doi: 10.1109/ICACCCN51052.2020.9362842.
3. Z.I. Attia et al., An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: A retrospective analysis of outcome prediction, The Lancet, 394(10201): 861–867, 2019, doi: 10.1016/S0140-6736(19)31721-0.
4. A.E. Ulloa-Cerna et al., rECHOmmend: An ECG-based machine learning approach for identifying patients at increased risk of undiagnosed structural heart disease detectable by echocardiography, Circulation, 146(1): 36–47, 2022, doi: 10.1161/CIRCULATIONAHA.121.057869.
5. R.J.P. Princy, S. Parthasarathy, P.S. Hency Jose, A. Raj Lakshminarayanan, S. Jeganathan, Prediction of cardiac disease using supervised machine learning algorithms, [in:] 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, pp. 570–575, 2020, doi: 10.1109/ICICCS48265.2020.9121169.
6. M. Nasser, U.K. Yusof, Deep learning based methods for breast cancer diagnosis: A systematic review and future direction, Diagnostics, 13(1): 161, 2023, doi: 10.3390/diagnostics13010161.
7. M. Sivakami, P. Prabhu, A comparative review of recent data mining techniques for prediction of cardiovascular disease from electronic health records, [in:] Intelligent Data Communication Technologies and Internet of Things, ICICI 2019, D. Hemanth, S. Shakya, Z. Baig [Eds.], Lecture Notes on Data Engineering and Communications Technologies, Vol. 38, pp. 477–484, Springer, Cham, 2020, doi: 10.1007/978-3-030-34080-3_54.
8. M. Sivakami, P. Prabhu, Classification of algorithms supported factual knowledge recovery from cardiac data set, International Journal of Current Research and Review, 13(6): 161–166, 2021, doi: 10.31782/IJCRR.2021.13614.
9. V. Chang, V.R. Bhavani, A.Q. Xu, M.A. Hossain, An artificial intelligence model for heart disease detection using machine learning algorithms, Healthcare Analytics, 2: 100016, 2022, doi: 10.1016/j.health.2022.100016.
10. A. Rajkomar et al., Scalable and accurate deep learning with electronic health records, NPJ Digital Medicine, 1: 18, 2018, doi: 10.1038/s41746-018-0029-1.
11. J.M. Kwon et al., Deep learning-based algorithm for detecting aortic stenosis using electrocardiography, Journal of the American Heart Association, 9(7): e14717, 2020, doi: 10.1161/jaha.119.014717.
12. A. Frank, A. Asuncion, UCI Machine Learning Repository, University of California, School of Information and Computer Sciences, Irvine, 2010, http://archive.ics.uci.edu/ml.
13. A. Ghorbani et al., Deep learning interpretation of echocardiograms, NPJ Digital Medicine, 3: 10, 2020, doi: 10.1038/s41746-019-0216-8.
14. V. Parthasarthy, L. Pallavi, H. Anandaram, M. Praveen, S. Arun, R. Krishnamoorthy, Prediction of coronary artery disease by adapting hybrid approach of machine learning methods, [in:] 2022 3rd International Conference on Smart Electronics and Communication (ICOSEC), Trichy, India, pp. 1233–1237, 2022, doi: 10.1109/ICOSEC54921.2022.9952111.
15. J. N., D. P., M. E., R. Santhosh, R. Reshma, D. Selvapandian, Heart attack prediction using machine learning, [in:] 2022 4th International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, India, pp. 854–860, 2022, doi: 10.1109/ICIRCA54612.2022.9985736.
16. J. Han, M. Kamber, J. Pei, Data Mining: Concepts and Techniques, 3rd edition, Morgan Kaufmann, 2011.
17. J.R. Quinlan, C4.5: Programs for Machine Learning, Morgan Kaufmann, 1993.
18. M.A. Jabbar, P. Chandra, B.L. Deekshatulu, Prediction of risk score for heart disease using associative classification and hybrid feature subset selection, [in:] 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA), Kochi, India, pp. 628–634, 2012, doi: 10.1109/ISDA.2012.6416610.
19. A. Meena Kowshalya, R. Madhumathi, N. Gopika, Correlation based feature selection algorithms for varying datasets of different dimensionality, Wireless Personal Communication, 108: 1977–1993, 2019, doi: 10.1007/s11277-019-06504-w.
20. O. Chapelle, Training a support vector machine in the primal, Neural Computation, 19(5): 1155–1178, 2007, doi: 10.1162/neco.2007.19.5.1155.
21. C.-H. Weng, T.C.-K. Huang, R.-P. Han, Disease prediction with different types of neural network classifiers, Telematics and Informatics, 33(2): 277–292, 2016, doi: 10.1016/j.tele.2015.08.006.
22. A. Janosi, W. Steinbrunn, M. Pfisterer, R. Detrano, Heart Disease. UCI Machine Learning Repository, 1988, doi: 10.24432/C52P4X.
This work is licensed under a Creative Commons Attribution 4.0 International License.