phat 19(20): e7

Research Article

Predicting Diabetes Mellitus and Analysing Risk-Factors Correlation

Download461 downloads
  • @ARTICLE{10.4108/eai.13-7-2018.164173,
        author={Md. Faisal Faruque and Asaduzzaman Asaduzzaman and Syed Md. Minhaz Hossain and Md. Hasan Furhad and Iqbal H. Sarker},
        title={Predicting Diabetes Mellitus and Analysing Risk-Factors Correlation},
        journal={EAI Endorsed Transactions on Pervasive Health and Technology},
        keywords={health informatics, machine learning, diabetes, classification, e-health services},
  • Md. Faisal Faruque
    Asaduzzaman Asaduzzaman
    Syed Md. Minhaz Hossain
    Md. Hasan Furhad
    Iqbal H. Sarker
    Year: 2019
    Predicting Diabetes Mellitus and Analysing Risk-Factors Correlation
    DOI: 10.4108/eai.13-7-2018.164173
Md. Faisal Faruque1,*, Asaduzzaman Asaduzzaman1, Syed Md. Minhaz Hossain1,2, Md. Hasan Furhad3, Iqbal H. Sarker1,*
  • 1: Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Bangladesh
  • 2: Department of Computer Science and Engineering, Premier University, Chittagong, Bangladesh
  • 3: Canberra Institute of Technology, Reid, ACT, Australia
*Contact email:,


INTRODUCTION: Diabetes mellitus is a common disease of the human body caused by a group of metabolic disorders where the sugar levels exceed a prolonged period, and that is very high than the usual time. It not only affects different organs of the human body but also harms a large number of the body system, in particular the blood veins and nerves.

OBJECTIVES: Early predictions of this phenomenon can help us to control the disease and also to save human life. For achieving the goal, this research work mainly explores various risk factors such as kidney complications, blood pressure, hearing loss, and skin complications related to this disease using machine learning techniques and make a decision.

METHODS: Machine learning techniques provide an efficient result to extract knowledge by constructing predicting models from diagnostic medical datasets collected from 200 diabetic patients from the Medical Centre Chittagong, Bangladesh using 16 attributes. Obtaining knowledge from such data can be useful to predict diabetes. In this work, we perform four popular machine learning algorithms, such as Support Vector Machine (SVM), Naive Bayes (NB), K-Nearest Neighbour (KNN) and C4.5 Decision Tree (DT), on adult population dataset to predict Diabetes Mellitus.

RESULTS: C4.5 Decision Tree performs better than other algorithms for predicting diabetes with 73.5% accuracy, 72% F-measure, and 0.69 of AUC (area under ROC curve). Besides, we determine the correlation between different risk factors of Diabetes Mellitus. The highest correlation is 0.81 for blood pressure (Hypertension) complications with diabetes.

CONCLUSION: In this study, both positive and negative correlation has been established between the various risk factors and diabetes. There is a positive correlation for predicting kidney complications (Nephropathy) and blood pressure (Hypertension) complications and a negative correlation at predicting hearing loss and skin complications (diabetes dermopathy) from diabetic patients. It helps a patient to be aware of the risk factors related to diabetes.