Predicting Heart Disease using Logistic Regression

Mochammad Anshori, M. Syauqi Haris

Abstract


A common risk of death is caused by heart disease. It is critical in the field of medicine to be able to diagnose cardiac disease in order to adequately prevent and treat patients. The most accurate method of prediction has the potential to both extend the patient's life and reduce the severity of their cardiac disease. The use of machine learning is one approach that may be taken to generate predictions. In this study, patient medical record information was used in conjunction with an algorithm for logistic regression in order to make heart disease diagnoses. The outcomes of the logistic regression have been utilized to achieve a high level of accuracy in the prediction of heart disease. To get the model coefficients needed for the equation, the experiment uses an iterative form of the logistic regression test. Iteration 14 produced the best results, with an accuracy of 81.3495% and an average calculation time of 0.020 seconds. The best iteration was reached at that point. The percentage of space that lies beneath the ROC curve is 89.36%. The findings of this study have significant implications for the field of heart disease prediction and can contribute to improved patient care and outcomes. Accurate predictions obtained through logistic regression can guide healthcare professionals in identifying individuals at risk and implementing preventive measures or tailored treatment plans. The computational efficiency of the model further enhances its applicability in real-time decision support systems.

Full Text:

PDF

References


S. S. Maghdid and T. A. Rashid, “An Extensive Dataset for the Heart Disease Classification System,” Mendeley Data, 2022.

WHO, “Cardiovascular diseases,” World Health Organization, 2020. https://www.who.int/health-topics/cardiovascular-diseases#tab=tab_1 (accessed Aug. 08, 2022).

C. B. C. Latha and S. C. Jeeva, “Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques,” Informatics Med. Unlocked, vol. 16, p. 100203, 2019.

A. Alshukry et al., “Clinical characteristics of coronavirus disease 2019 (COVID-19) patients in Kuwait,” PLoS One, vol. 15, no. 11, p. e0242768, Nov. 2020.

S. M. Nagarajan, V. Muthukumaran, R. Murugesan, R. B. Joseph, M. Meram, and A. Prathik, “Innovative feature selection and classification model for heart disease prediction,” J. Reliab. Intell. Environ., vol. 8, no. 4, pp. 333–343, Dec. 2022.

S.-J. Kim, “Global Awareness of Myocardial Infarction Symptoms in General Population,” Korean Circ. J., vol. 51, no. 12, p. 997, 2021.

R. Ndejjo, G. Musinguzi, F. Nuwaha, H. Bastiaens, and R. K. Wanyenze, “Understanding factors influencing uptake of healthy lifestyle practices among adults following a community cardiovascular disease prevention programme in Mukono and Buikwe districts in Uganda: A qualitative study,” PLoS One, vol. 17, no. 2, p. e0263867, Feb. 2022.

A. K. Gárate-Escamila, A. Hajjam El Hassani, and E. Andrès, “Classification models for heart disease prediction using feature selection and PCA,” Informatics Med. Unlocked, vol. 19, p. 100330, 2020.

S. M. M. Hasan, M. A. Mamun, M. P. Uddin, and M. A. Hossain, “Comparative Analysis of Classification Approaches for Heart Disease Prediction,” in 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), Feb. 2018, pp. 1–4.

M. Anshori, F. Mar’i, and F. A. Bachtiar, “Comparison of Machine Learning Methods for Android Malicious Software Classification based on System Call,” in 2019 International Conference on Sustainable Information Engineering and Technology (SIET), Sep. 2019, pp. 343–348.

P. Thombare, M. Ghalme, S. Raut, N. Dhakne, and P. R. Dholi, “Prediction of Heart Disease using Machine Learning Techniques,” Int. Res. J. Mod. Eng. Technol. Sci., vol. 04, no. 06, pp. 1099–1102.2022.

H. Gulfam Ahmad and M. Jasim Shah, “Prediction of Cardiovascular Diseases ( CVDs ) Using Machine Learning Techniques in Health,” Azerbaijan J. High Perform. Comput., vol. 4, no. 2, pp. 267–279, Dec. 2021.

S. D. Desai, S. Giraddi, P. Narayankar, N. R. Pudakalakatti, and S. Sulegaon, “Back-Propagation Neural Network Versus Logistic Regression in Heart Disease Classification,” in Advances in Intelligent Systems and Computing, 2019, pp. 133–144.

W. Książek, M. Gandor, and P. Pławiak, “Comparison of various approaches to combine logistic regression with genetic algorithms in survival prediction of hepatocellular carcinoma,” Comput. Biol. Med., vol. 134, p. 104431, Jul. 2021.

J. P. Li, A. U. Haq, S. U. Din, J. Khan, A. Khan, and A. Saboor, “Heart Disease Identification Method Using Machine Learning Classification in E-Healthcare,” IEEE Access, vol. 8, pp. 107562–107582, 2020.

J. Vijayashree and H. P. Sultana, “A Machine Learning Framework for Feature Selection in Heart Disease Classification Using Improved Particle Swarm Optimization with Support Vector Machine Classifier,” Program. Comput. Softw., vol. 44, no. 6, pp. 388–397, Nov. 2018.

C. M. Bhatt, P. Patel, T. Ghetia, and P. L. Mazzeo, “Effective Heart Disease Prediction Using Machine Learning Techniques,” Algorithms, vol. 16, no. 2, p. 88, Feb. 2023.

L. Ali et al., “A Feature-Driven Decision Support System for Heart Failure Prediction Based on X2 Statistical Model and Gaussian Naive Bayes,” Comput. Math. Methods Med., vol. 2019, pp. 1–8, Nov. 2019.

A. Elsayad and M. Fakhr, “Diagnosis of cardiovascular diseases with bayesian classifiers,” J. Comput. Sci., vol. 11, no. 2, pp. 274–282, 2015.

S. Asadi, S. Roshan, and M. W. Kattan, “Random forest swarm optimization-based for heart diseases diagnosis,” J. Biomed. Inform., vol. 115, p. 103690, Mar. 2021.

K. Subhadra and B. Vikas, “Neural network based intelligent system for predicting heart disease,” Int. J. Innov. Technol. Explor. Eng., vol. 8, no. 5, pp. 484–487, 2019.

L. Ali et al., “An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure,” IEEE Access, vol. 7, pp. 54007–54014, 2019.

R. TR, U. K. Lilhore, P. M, S. Simaiya, A. Kaur, and M. Hamdi, “Predictive analysis of heart diseases with machine learning approaches,” Malaysian J. Comput. Sci., pp. 132–148, Mar. 2022.

S. I. Ayon, M. M. Islam, and M. R. Hossain, “Coronary Artery Heart Disease Prediction: A Comparative Study of Computational Intelligence Techniques,” IETE J. Res., vol. 68, no. 4, pp. 2488–2507, Jul. 2022.

M. M. Ghiasi, S. Zendehboudi, and A. A. Mohsenipour, “Decision tree-based diagnosis of coronary artery disease: CART model,” Comput. Methods Programs Biomed., vol. 192, p. 105400, Aug. 2020.

T. K. Sajja and H. K. Kalluri, “A Deep Learning Method for Prediction of Cardiovascular Disease Using Convolutional Neural Network,” Rev. d’Intelligence Artif., vol. 34, no. 5, pp. 601–606, Nov. 2020.

S. Nusinovici et al., “Logistic regression was as good as machine learning for predicting major chronic diseases,” J. Clin. Epidemiol., vol. 122, pp. 56–69, Jun. 2020.

D. Maulud and A. M. Abdulazeez, “A Review on Linear Regression Comprehensive in Machine Learning,” J. Appl. Sci. Technol. Trends, vol. 1, no. 4, pp. 140–147, 2020.

Z. Huang and D. Chen, “A Breast Cancer Diagnosis Method Based on VIM Feature Selection and Hierarchical Clustering Random Forest Algorithm,” IEEE Access, vol. 10, pp. 3284–3293, 2022.

A. Swift, R. Heale, and A. Twycross, “What are sensitivity and specificity?,” Evid. Based Nurs., vol. 23, no. 1, pp. 2–4, Jan. 2020.

E. Frank, M. A. Hall, and I. H. Witten, The WEKA workbench. Morgan Kaufmann, 2016.

K. Kirasich, T. Smith, and B. Sadler, “Random Forest vs Logistic Regression: Binary Classification for Heterogeneous Datasets,” SMU Data Sci. Rev., vol. 1, no. 3, p. 9, 2018.

L. de S. Rodrigues, E. T. Matsubara, and B. M. Nogueira, “Learning a Fast Bipartite Ranker for Text Documents Using Lexicographical Rankers and ROC Curves,” in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Nov. 2017, pp. 1307–1312.




DOI: http://dx.doi.org/10.17977/um018v5i22022p188-196

Refbacks

  • There are currently no refbacks.


Copyright (c) 2022 Knowledge Engineering and Data Science

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Flag Counter

Creative Commons License


This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

View My Stats