Abstract
Objective We build classification models and risk assessment tools for diabetes, hypertension and comorbidity using machine-learning algorithms on data from Kuwait. We model the increased proneness in diabetic patients to develop hypertension and vice versa. We ascertain the importance of ethnicity (and natives vs expatriate migrants) and of using regional data in risk assessment. Design Retrospective cohort study. Four machine-learning techniques were used: logistic regression, k-nearest neighbours (k-NN), multifactor dimensionality reduction and support vector machines. The study uses fivefold cross validation to obtain generalisation accuracies and errors. Setting Kuwait Health Network (KHN) that integrates data from primary health centres and hospitals in Kuwait. Participants 270 172 hospital visitors (of which, 89 858 are diabetic, 58 745 hypertensive and 30 522 comorbid) comprising Kuwaiti natives, Asian and Arab expatriates. Outcome measures Incident type 2 diabetes, hypertension and comorbidity. Results Classification accuracies of >85% (for diabetes) and >90% (for hypertension) are achieved using only simple non-laboratory-based parameters. Risk assessment tools based on k-NN classification models are able to assign ‘high’ risk to 75% of diabetic patients and to 94% of hypertensive patients. Only 5% of diabetic patients are seen assigned ‘low’ risk. Asian-specific models and assessments perform even better. Pathological conditions of diabetes in the general population or in hypertensive population and those of hypertension are modelled. Two-stage aggregate classification models and risk assessment tools, built combining both the component models on diabetes (or on hypertension), perform better than individual models. Conclusions Data on diabetes, hypertension and comorbidity from the cosmopolitan State of Kuwait are available for the first time. This enabled us to apply four different case–control models to assess risks. These tools aid in the preliminary non-intrusive assessment of the population. Ethnicity is seen significant to the predictive models. Risk assessments need to be developed using regional data as we demonstrate the applicability of the American Diabetes Association online calculator on data from Kuwait.
Keywords
Affiliated Institutions
Related Publications
Multiple risk factor interventions for primary prevention of coronary heart disease
Interventions using counselling and education aimed at behaviour change do not reduce total or CHD mortality or clinical events in general populations but may be effective in re...
Risk stratification in hypertension: new insights from the Framingham study*1
Five decades of epidemiologic research have established that blood pressure elevation is a common and powerful contributor to all of the major cardiovascular diseases, including...
HbA1c and all-cause mortality risk among patients with type 2 diabetes
Several prospective studies have evaluated the association between glycosylated hemoglobin (HbA1c) and death risk among diabetic patients. However, the results have been inconsi...
Effect of Antihypertensive Treatment in Patients Having Already Suffered From Stroke
Background and Purpose Drug treatment of high blood pressure has been shown to reduce the associated cardiovascular risk. Stroke represents the type of event more strongly linke...
Implementation of machine-learning classification in remote sensing: an applied review
Machine learning offers the potential for effective and efficient classification of remotely sensed imagery. The strengths of machine learning include the capacity to handle dat...
Publication Info
- Year
- 2013
- Type
- article
- Volume
- 3
- Issue
- 5
- Pages
- e002457-e002457
- Citations
- 145
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1136/bmjopen-2012-002457
- PMID
- 23676796
- PMCID
- PMC3657675