Predicting Diabetes in Healthy Population through Machine Learning Conference Paper uri icon


  • © 2019 IEEE. In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1% with a recall rate of 81.1% averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.

author list (cited authors)

  • Alic, L., Abbas, H. T., Rios, M., AbdulGhani, M., & Qaraqe, K.

citation count

  • 4

publication date

  • June 2019