Examination and Evaluation of Obesity Risk Factors with Explainable Artificial Intelligence
PDF File

Keywords

Explainable AI
Obesity detection
Machine learning
XGBoost
InterpretML

How to Cite

Examination and Evaluation of Obesity Risk Factors with Explainable Artificial Intelligence. (2024). Computers and Electronics in Medicine, 1(1), 12-17. https://doi.org/10.69882/adba.cem.2024072

Abstract

There is an increasing need for effective methods for the detection and management of obesity, which is an important public health problem worldwide and is critical for the sustainability of health systems. This study examines the effectiveness of data preprocessing and machine learning techniques in detecting obesity. Data preprocessing steps, including the removal of unnecessary data, handling missing values, and addressing data imbalance, are necessary to enhance the accuracy of machine learning algorithms. In this study, data preprocessing steps were applied to an obesity dataset to make it suitable for machine learning. Using a dataset of 2111 patients, this study evaluates the effectiveness of machine learning techniques in detecting obesity. Following the completion of data preprocessing, obesity was identified using various machine learning algorithms, including Decision Tree Classification, Random Forest Classification, Naive Bayes Classification, KNN, and XGBoosting, and their performances were compared. According to the results, the XGBoosting algorithm exhibited the highest accuracy (0.92), precision, recall, and F1-score values. Explainable Artificial Intelligence (XAI) techniques, such as SHAP and InterpretML, were employed to understand the effects of obesity parameters and determine which parameters have a greater impact on obesity. Through visualizing and analyzing the effects of obesity parameters, these techniques facilitated the identification of significant parameters in obesity detection. The findings demonstrate that the XGBoosting algorithm outperforms other algorithms in detecting obesity. Furthermore, XAI techniques play a crucial role in comprehending obesity parameters. Specifically, a family history of obesity and factors like FCVC and CAEC appear to have more significant effects compared to others.

PDF File

References

A. Moore, M. B., 2022 Xgboost, a novel explainable ai technique, in the prediction of myocardial infarction, a uk biobank cohort study. Med. Cardiovascular-Medicine.

B. Kui, R. M. M. N. N. F. N. G. A. V. J. B. S. G. J. C. I. S. A. I. P. S. R. H. G. P. M. P. Z. V. G. K. E. F. I. F. F. I. L. G. R. F., József Pintér, 2022 Easy-app: An artificial intelligence model and application for early and easy prediction of severity in acute pancreatitis.

Clinical and Translational Medicine . Berrar, D., 2019 Bayes’ theorem and naive bayes classifier .

Boateng, O. J. . A. D. A., E. Y., 2020 Basic tenets of classification algorithms k-nearest-neighbor, support vector machine, random mforest and neural network: a review. Journal of Data Analysis and Information Processing 8: 341–357.

Bray, G. A., 2003 Risks of obesity. Endocrinology and Metabolism Clinics 32: 787–804.

Chang Hu, Q. Z. Y. L. F. W. X. Z. Z. P., Qing Tan, 2022 Application of interpretable machine learning for early prediction of prognosis in acute kidney injury. Computational and Structural Biotechnology Journal .

Charbuty, . A. A., B., 2021 Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends 2: 20–28.

Chen, . G. C., T., 2016 Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACMSIGKDD International Conference on Knowledge Discovery and Data Mining pp. 785–794.

Feng,W.W. J. M. S. . T. E., D. C., 2021 Interpretable xgboost-shap machine-learning model for shear strength prediction of squat rc walls. Journal of Structural Engineering 147: 040.

Hafsa Binte Kibria, M. O. F. G. M. A. J. H., Md Nahiduzzaman, 2022 An ensemble approach for the prediction of diabetes mellitus using a soft voting classifier with an explainable ai. Sensors (Basel, Switzerland) .

Hssina, M. A. E. H. . E. M., B., 2014 A comparative study of decision tree id3 and c4.5. International Journal of Advanced Computer Science and Applications 4: 13–19.

Hui Wen Loh, S. S. P. D. B. F. M. U. R. A., Chui Ping Ooi, 2022 Application of explainable artificial intelligence for healthcare: A systematic review of the last decade (2011-2022). Computer Methods and Programs in Biomedicine.

Isfafuzzaman Tasin, S. I. R. K., Tansin Ullah Nabil, 2022 Diabetes prediction using machine learning and explainable ai techniques. Healthcare Technology Letters .

Jadhav, . C. H. P., S. D., 2016 Comparative study of k-nn, naive bayes and decision tree classification techniques. International Journal of Science and Research (IJSR) 5: 1842–1845.

Jaishree Meena, Y. H., 2022 Application of explainable artificial intelligence in the identification of squamous cell carcinoma biomarkers. Computers in Biology and Medicine .

Junfeng Peng, M. Z. Y. T. X. Z. F. Z. J. X., Kaiqiang Zou, 2021 An explainable artificial intelligence framework for the deterioration risk prediction of hepatitis patients. Journal of Medical Systems .

Jungsu Park, K. T. K. C. Y. P. S. L. T.-Y. H., Woo Hyoung Lee, 2022 Interpretation of ensemble learning to predict water quality using explainable artificial intelligence. The Science of the Total Environment .

Nori, J. S. K. P. . C. R., H., 2019 Interpretml: A unified framework for machine learning interpretability. arXiv preprint arXiv:1909.09223 .

Sejong Oh, K. J. C. S. J. K., Yuli Park, 2021 Explainable machine learning model for glaucoma diagnosis and its interpretation. Diagnostics (Basel, Switzerland) .

Tariq Aziz, Z. H. L. L., Nageen Hussain, 2024 Elucidating the role of diet in maintaining gut health to reduce the risk of obesity, cardiovascular and other age-related inflammatory diseases: recent challenges and future recommendations. Gut Microbes 16: 2297864.

Zhang, M. X. Z. J. S. D. Z. X. M. C. . W.-H., J., 2023 Insights into geospatial heterogeneity of landslide susceptibility based on the shap-xgboost model. Journal of Environmental Management 332: 117357.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.