A Weighted Majority Voting Ensemble Model for Disease Prediction Boosted by PSO: The Case of Type 2 Diabetes

Authors

  • Muljono Muljono
  • Novian Adhipurna

DOI:

https://doi.org/10.47839/ijc.22.4.3354

Keywords:

Diabetes, Machine learning, Ensemble majority voting, PSO

Abstract

Early detection of diabetes is critical to reducing the number of cases, which continues to rise year after year. Many approaches to diagnosis have been used, but they still have flaws in making clinical decisions that are more effective and efficient. The use of intelligent systems is very effective in assisting in data analysis and clinical decision support. The purpose of this article is to develop a model to predict diabetes mellitus using the Pima Indian Diabetes Dataset (PIDD). The ensemble method has shown to be quite effective at increasing accuracy, but it has the issue of determining the optimal weight. As a result, to improve prediction accuracy, this study employs PSO optimization in the selection of ensemble majority voting weights. The test results show that predictions for ensemble majoritarian voting using PSO perform well, with an accuracy rate of 0.9322, precision of 0.9412, recall of 0.8421, and F1-score of 0.8889. PSO accuracy is improved by 4% and 7%, respectively. This demonstrates that applying PSO to the ensemble model can improve accuracy.

References

S. Li, J. Wang, B. Zhang, X. Li, and Y. Liu, “Diabetes mellitus and cause-specific mortality: A population-based study,” Diabetes Metab. J., vol. 43, no. 3, p. 319, 2019. https://doi.org/10.4093/dmj.2018.0060.

M. S. Rahman et al., “Role of insulin in health and disease: An update,” Int. J. Mol. Sci., vol. 22, no. 12, p. 6403, 2021. https://doi.org/10.3390/ijms22126403.

A. Z. Woldaregay et al., “Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes,” Artif. Intell. Med., vol. 98, no. April 2018, pp. 109–134, 2019. https://doi.org/10.1016/j.artmed.2019.07.007.

H. Qteat and M. Awad, “Using hybrid model of particle swarm optimization and multi-layer perceptron neural networks for classification of diabetes,” Int. J. Intell. Eng. Syst., vol. 14, no. 3, pp. 11–22, 2021. https://doi.org/10.22266/ijies2021.0630.02.

N. Esser, K. M. Utzschneider, and S. E. Kahn, “On the causal relationships between hyperinsulinaemia, insulin resistance, obesity and dysglycaemia in type 2 diabetes: Reply to Johnson JD [letter],” Diabetologia, vol. 64, no. 10, pp. 2345–2347, 2021. https://doi.org/10.1007/s00125-021-05511-6.

P. Kostagiolas, P. Tsiligros, P. Theodorou, N. Tentolouris, and D. Niakas, “A cross-sectional survey interconnecting health information seeking behavior with clinical data of type 2 diabetes mellitus patients,” Libr. Hi Tech, vol. 39, no. 2, pp. 448–461, 2020. https://doi.org/10.1108/LHT-02-2020-0030.

International Diabetes Federation, IDF Diabetes Atlas, 9th edition, 2019.

A. U. Haq et al., “Intelligent machine learning approach for effective recognition of diabetes in e-healthcare using clinical data,” Sensors (Switzerland), vol. 20, no. 9, article no. 2649, 2020. https://doi.org/10.3390/s20092649.

G. Battineni, G. G. Sagaro, C. Nalini, F. Amenta, and S. K. Tayebati, “Comparative machine-learning approach: A follow-up study on type 2 diabetes predictions by cross-validation methods,” Machines, vol. 7, no. 4, pp. 1–11, 2019. https://doi.org/10.3390/machines7040074.

Y. Jian, M. Pasquier, A. Sagahyroon, and F. Aloul, “A machine learning approach to predicting diabetes complications,” Healthcare, vol. 9, no. 12, p. 1712, 2021. https://doi.org/10.3390/healthcare9121712.

D. D. Rufo, T. G. Debelee, A. Ibenthal, and W. G. Negera, “Diagnosis of diabetes mellitus using gradient boosting machine (Lightgbm),” Diagnostics, vol. 11, no. 9, pp. 1–14, 2021. https://doi.org/10.3390/diagnostics11091714.

H. Jafarzadeh, M. Mahdianpari, E. Gill, F. Mohammadimanesh, and S. Homayouni, “Bagging and boosting ensemble classifiers for classification of multispectral, hyperspectral and PolSAR data: A comparative evaluation,” Remote Sens., vol. 13, no. 21, p. 4405, 2021. https://doi.org/10.3390/rs13214405,

L. J. Muhammad, E. A. Algehyne, and S. S. Usman, “Predictive supervised machine learning models for diabetes mellitus,” SN Comput. Sci., vol. 1, no. 5, pp. 1–10, 2020. https://doi.org/10.1007/s42979-020-00250-8,

S. K. Kalagotla, S. V. Gangashetty, and K. Giridhar, “A novel stacking technique for prediction of diabetes,” Comput. Biol. Med., vol. 135, no. June, p. 104554, 2021. https://doi.org/10.1016/j.compbiomed.2021.104554,

S. Kumari, D. Kumar, and M. Mittal, “An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” Int. J. Cogn. Comput. Eng., vol. 2, no. November 2020, pp. 40–46, 2021. https://doi.org/10.1016/j.ijcce.2021.01.001.

M. Shahhosseini, G. Hu, and H. Pham, “Optimizing ensemble weights and hyperparameters of machine learning models for regression problems,” Mach. Learn. with Appl., vol. 7, p. 100251, 2022. https://doi.org/10.1016/j.mlwa.2022.100251,

Y. Xu, C. Wu, K. Zheng, X. Wang, X. Niu, and T. Lu, “Computing adaptive feature weights with PSO to improve android malware detection,” Secur. Commun. Networks, vol. 2017, Article ID 3284080, 2017. https://doi.org/10.1155/2017/3284080.

M. Maniruzzaman et al., “Accurate diabetes risk stratification using machine learning: Role of missing value and outliers,” J. Med. Syst., vol. 42, no. 5, p. 92, 2018. https://doi.org/10.1007/s10916-018-0940-7.

G. Aksu, C. O. Güzeller, and M. T. Eser, “The effect of the normalization method used in different sample sizes on the success of artificial neural network model,” Int. J. Assess. Tools Educ., vol. 6, no. 2, pp. 170–192, 2019. https://doi.org/10.21449/ijate.479404.

M. S. Sainin, R. Alfred, and F. Ahmad, “Ensemble Meta Classifier with Sampling and Feature Selection for Data with Multiclass Imbalance Problem,” J. Inf. Commun. Technol., vol. 20, no. 2, pp. 103–133, 2021. https://doi.org/10.32890/jict2021.20.2.1.

S. Y. Kim and A. Upneja, “Majority voting ensemble with a decision trees for business failure prediction during economic downturns,” J. Innov. Knowl., vol. 6, no. 2, pp. 112–123, 2021. https://doi.org/10.1016/j.jik.2021.01.001.

H. Bouziane, B. Messabih, and A. Chouarfia, “Profiles and majority voting-based ensemble method for protein secondary structure prediction,” Evol. Bioinforma., vol. 7, no. 7, p. EBO.S7931, 2011. https://doi.org/10.4137/EBO.S7931.

J. Kennedy and R. Eberhart, “Particle swarm optimization,” Proceedings of the International Conference on Neural Networks ICNN’95, 1995, vol. 4, pp. 1942–1948.

D. Ratnawati, M. Marjono, W. Widodo, and S. Anam, “PSO-ELM with time-varying inertia weight for classification of SMILES codes,” Int. J. Intell. Eng. Syst., vol. 13, no. 6, pp. 522–532, 2020. https://doi.org/10.22266/ijies2020.1231.46.

Y. Hayashi and S. Yukita, “Rule extraction using recursive-rule extraction algorithm with J48graft combined with sampling selection techniques for the diagnosis of type 2 diabetes mellitus in the Pima Indian dataset,” Informatics Med. Unlocked, vol. 2, pp. 92–104, 2016. https://doi.org/10.1016/j.imu.2016.02.001.

Downloads

Published

2023-12-31

How to Cite

Muljono, M., & Adhipurna, N. (2023). A Weighted Majority Voting Ensemble Model for Disease Prediction Boosted by PSO: The Case of Type 2 Diabetes. International Journal of Computing, 22(4), 475-484. https://doi.org/10.47839/ijc.22.4.3354

Issue

Section

Articles