Cardiovascular Disease Prediction through Machine Learning: A Comparative Study of Ensemble Techniques
DOI:
https://doi.org/10.1807/2f7s6c18Keywords:
Heart disease prediction, Machine Learning, heart dataset, gradient boosting, Adaboost , Random Forest, XGBoost, CatBoost, Light GBM, Extra tree classifier, Ensemble Learning, Bagging, BoostingAbstract
Cardiovascular disease is a worldwide health issue that necessitates enhancements in the evaluation of risks and early identification methods. Several lethal heart illnesses have been transmitted to humans through various recognized mechanisms. According to a figure from the World Health Organization (WHO), over 17.9 million individuals in the country perish annually, representing 32% of global mortality. The increasing annual population poses a significant challenge in terms of early-stage diagnosis and treatment. Machine learning approaches have proven highly useful in the realm of healthcare. The study employed the "heart dataset" obtained from Kaggle. The implementation was carried out using the Open editor and the Python programming language. This work aims to calculate and assess the precision of ensemble machine learning algorithms in predicting cardiac disease, along with other performance metrics. The study examined seven proven models, including Gradient Boosting, AdaBoost, XGBoost, CatBoost, Light GBM (boosting techniques), and Random Forest and Extra Tree Classifier (bagging algorithm). These models were tested and trained using the heart dataset. 80% of the dataset will be allocated for training, while the remaining 20% will be used for testing. This study examines machine learning methods to determine the optimal model for predicting cardiac disease. Upon assessing multiple models, we determined that XGBoost achieves the highest accuracy and F1-score, with values of 87.50% and 88.44% respectively, surpassing the performance of other models we employed.
References
[1] M. Pal, S. Parija, G. Panda, K. Dhama, and R. K. Mohapatra, “Risk prediction of cardiovascular disease using Machine Learning Classifiers,” Open Medicine, vol. 17, no. 1, pp. 1100–1113, Jan. 2022. doi:10.1515/med-2022-0508
[2] A. Saboor, M. Usman, S. Ali, A. Samad, M. F. Abrar, and N. Ullah, “A Method for Improving Prediction of Human Heart Disease Using Machine Learning Algorithms,” Mobile Information Systems, vol. 2022, no. 1, pp. 1–9, Mar. 2022, doi: https://doi.org/10.1155/2022/1410169.
[3] A. Srivastava, Sandhya Umrao, and S. Biswas, “Exploring Forest Transformation by Analyzing Spatial-temporal Attributes of Vegetation using Vegetation Indices,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 5, Jan. 2023, doi: https://doi.org/10.14569/ijacsa.2023.01405114.
[4] S. Mohan, C. Thirumalai, and G. Srivastava, “Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques,” IEEE Access, vol. 7, no. 1, pp. 81542–81554, 2019, doi: https://doi.org/10.1109/access.2019.2923707.
[5] D. Swain, B. Parmar, H. Shah, A. Gandhi, M. R. Pradhan, H. Kaur, & B. Acharya, “Cardiovascular Disease Prediction using Various Machine Learning Algorithms,” Journal of Computer Science, vol. 18, no. 10, pp. 993–1004, Oct. 2022, doi: https://doi.org/10.3844/jcssp.2022.993.1004.
[6] H. Sharma, P. Kumar, and K. Sharma, “Identification of Device Type Using Transformers in Heterogeneous Internet of Things Traffic,” Lecture notes in networks and systems, pp. 471–481, Jan. 2023, doi: https://doi.org/10.1007/978-981-99-3010-4_40.
[7] C. Gupta, A. Saha, N. V. Subba Reddy, and U. Dinesh Acharya, “Cardiac Disease Prediction using Supervised Machine Learning Techniques.,” Journal of Physics: Conference Series, vol. 2161, no. 1, p. 012013, Jan. 2022, doi: https://doi.org/10.1088/1742-6596/2161/1/012013.
[8] A. Srivastava and P. Ahmad, “A Probabilistic Gossip-based Secure Protocol for Unstructured P2P Networks,” Procedia Computer Science, vol. 78, pp. 595–602, 2016, doi: https://doi.org/10.1016/j.procs.2016.02.122.
[9] K. S. Kaswan, J. S. Dhatterwal, H. Sharma, and K. Sood, “Big Data in Insurance Innovation,” Big Data: A Game Changer for Insurance Industry, pp. 117–136, Jul. 2022, doi: https://doi.org/10.1108/978-1-80262-605-620221008.
[10] M.M. Rahman, M.R. Rana, M. Nur-A-Alam, M.S.I. Khan, K.M.M. Uddin, “A web-based heart disease prediction system using machine learning algorithms,” Network Biology, pp. 64–80, June 2022.
[11] P. Rani, R. Kumar, N. M. O. S. Ahmed, and A. Jain, “A decision support system for heart disease prediction based upon machine learning,” Journal of Reliable Intelligent Environments, Jan. 2021, doi: https://doi.org/10.1007/s40860-021-00133-6.
[12] Nutan Gusain and Himanshu Sharma, “Communication-Efficient Federated Learning in Industrial IoT — A Framework for Real-Time Threat Detection and Secure Device Coordination,” International Journal on Computational Modelling Applications, vol. 2, no. 2, pp. 18–29, May 2025, doi: https://doi.org/10.63503/j.ijcma.2025.115.
[13] P. Sapra, Divya Paikaray, Nutan Gusain, M. Abrol, S. Ramesh, and S. Bhardwaj, “Evaluation of soft computing in methodology for calculating information protection from parameters of its distribution in social networks,” Soft Computing, Jun. 2023, doi: https://doi.org/10.1007/s00500-023-08633-8.
[14] Anubhava Srivastava, Rakesh Dubey & Susham Biswas, “Comparison of Sentinel and Landsat Data Sets over Lucknow Region Using Gradient Tree Boost Supervised Classifier,” In International Conference on Emerging Trends and Technologies on Intelligent Systems (pp. 221-232), 2023. Singapore: Springer Nature Singapore.
[15] S. Dev, F. M. Savoy, Y. H. Lee, and S. Winkler, “Nighttime sky/cloud image segmentation,” in: Proc. IEEE International Conference on Image Processing (ICIP), IEEE Xplore, pp. 345–349, Sep. 01, 2017.
[16] H. Sharma, P. Kumar, and K. Sharma, “Recurrent Neural Network based Incremental model for Intrusion Detection System in IoT,” Scalable Computing: Practice and Experience, vol. 25, no. 5, pp. 3778–3795, Aug. 2024, doi: https://doi.org/10.12694/scpe.v25i5.3004.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Revolutionary Advances in Computing and Electronics: An International Journal

This work is licensed under a Creative Commons Attribution 4.0 International License.
This work is licensed under a Creative Commons Attribution 4.0 International License.