Skip to main content

HbA1c comparable to fasting glucose in the external validation of the African Diabetes Risk Score and other established risk prediction models in Black South Africans

Abstract

Background

The use of non-invasive risk scores to detect undiagnosed type 2 diabetes (T2D) ensures the restriction of invasive and costly blood tests to those most likely to be diagnosed with the disease. This study assessed and compared the performance of the African Diabetes Risk Score (ADRS) with three other diabetes risk prediction models for identifying screen-detected diabetes based on fasting plasma glucose (FPG) or glycated haemoglobin (HBA1c).

Methods

Age, sex, waist circumference, body mass index, blood pressure, history of diabetes and physical activity levels from the SA-NW-PURE study were used to externally validate the ADRS and other established risk prediction models. Discrimination was assessed and compared using C-statistics and nonparametric methods. Calibration was assessed using calibration plots, before and after recalibration.

Results

Nine hundred and thirty-seven participants were included; 14% had prevalent undiagnosed T2D according to FPG and 26% according to HbA1c. Discrimination was acceptable and was mostly similar between models for both diagnostic measures. The C-statistics for diagnosis by FPG ranged from 0.69 for the Simplified FINDRISC model to 0.77 for the ADRS model and 0.77 for the Simplified FINDRISC model to 0.79 for the ADRS model for diagnosis by HbA1c. Calibration ranged from acceptable to good, though over- and underestimation were present. All models improved significantly following recalibration.

Conclusions

The models performed comparably, with the ADRS offering a non-invasive way to identify up to 79% of cases. Based on its ease of use and performance, the ADRS is recommended for screening for T2D in certain Black population groups in South Africa. HbA1c as a means of diagnosis also showed comparable performance with FPG. Therefore, further validation studies can potentially use HbA1c as the standard to compare to.

Peer Review reports

Introduction

Diabetes is a growing problem worldwide. Type 2 diabetes (T2D) and intermediate hyperglycaemia are becoming increasingly burdensome to residents and healthcare systems in low- and middle-income countries (LMICs), even though the majority of cases remain undiagnosed [1]. Africa is expected to experience the greatest increase in diabetes prevalence by 2045 [2]. These patterns are also observed in South Africa, where the T2D epidemic occurs alongside the communicable disease epidemic of human immunodeficiency virus and tuberculosis [3]. This strains the public healthcare system, which services over 80% of the population [4].

In South Africa, 45% of the estimated T2D cases are undiagnosed [2]. Hyperglycaemia results in pathological and functional changes and may persist for an extended period before being diagnosed [5]. The chronic nature of T2D requires continuous clinical care and management that requires significant healthcare resources. In South Africa, the direct cost of managing T2D amounted to over ZAR 2.7 billion in 2018 (approximately US$143 million), half of which related to treating complications. This cost is expected to increase at least 10-fold over the next decade [6]. Detecting T2D early in the pathophysiological process enables early administration of patient-centred management to optimise glycaemia and minimise complications, subsequently reducing its associated health and economic burden [7,8,9].

As glucose testing cannot be applied broadly in resource-poor settings such as South Africa, a risk assessment score based on non-invasive predictors is a quick method of guiding healthcare professionals as to whether a blood-based diagnostic test should be performed [10]. This ensures the allocation of scarce resources to those at highest risk to prevent the progression of T2D and associated complications [11]. Risk scores can also be used as health promotion tools in LMIC settings when used as self-completion questionnaires. The information provided at the end of the questionnaire, based on the risk status, can potentially act as a catalyst for needed lifestyle changes [12].

Several non-invasive risk assessment tools have been developed in specific White, multi-racial, Asian, and Middle Eastern cohorts, which limits their generalisability to other populations, such as individuals of African descent [13,14,15,16]. Prediction equations developed in other population groups that only include non-invasive variables have performed poorly in Black African cohorts, and either underestimated or overestimated T2D risk [14, 16]. This is because there are different thresholds, for example, for age, body mass index (BMI), and waist circumference in terms of T2D risk in Africans compared to individuals of European descent [16]. To date, only one sub-Saharan African screening tool has been developed using data from Tanzania, Senegal, and Guinea, known as the African Diabetes Risk Score (ADRS). The predictive accuracy for screen-detected T2D of this screening tool was found to be good in two Black cohorts from Cape Town (CRIBSA Study) and rural Kwa-Zulu Natal [17]. As ethnicity influences T2D risk [10], the ADRS requires further validation in other ethnic groups from a variety of settings in South Africa to ensure that it reliably predicts the risk of undiagnosed T2D in Black individuals with different demographic and ethnic characteristics.

This study aimed to further validate the ADRS in the South African North-West Province arm of the Prospective Rural and Urban Epidemiology (SA-NW-PURE) study. The American Diabetes Association (ADA) risk score, a Simplified Finnish Diabetes Risk Score (FINDRISC), and the Indian Risk Score (IRS), which were developed in different phenotypic and racial populations, were also validated to compare the predictive performance of the ADRS [13, 18, 19]. Furthermore, this study evaluated whether predictive performance differed between fasting plasma glucose (FPG)-based or glycated haemoglobin (HBA1c)-based diagnoses and across age, BMI, sex, and rural-urban living subgroups.

Materials and methods

Study population and design

The PURE study is a large-scale epidemiological study that prospectively investigates lifestyle behaviours, cardiovascular risk factors and chronic non-communicable disease incidence among different communities across 27 low-, middle-, and high-income countries [20]. This study used data from SA-NW-PURE, collected at three time points (2005, 2010 and 2015). Self-reported Tswana speaking, apparently healthy Black adults older than 30 years of age were eligible for inclusion at baseline (2005). Any self-reported prior cardiovascular event or acute illness was basis for exclusion. Detailed information on participant selection and recruitment has been reported previously [20, 21]. The SA-NW-PURE study included two dwelling sites (urban and rural) and consisted of n = 2 010 participants at baseline. In 2010, n = 1 282 participants returned for follow-up, and n = 924 returned in 2015. All participants provided written informed consent prior to data collection. The study received ethical approval from the North-West University Health Research Ethics Committee (NWU-00190-22-A1) and complies with the amended Declaration of Helsinki.

Newly diagnosed T2D was defined as having FPG levels  ≥ 7 mmol/L or HbA1c  ≥ 6.5%, in line with current guidelines [22], at any of the three data collection time-points. These participants (cases) were investigated alongside individuals for whom FPG or HbA1c were below the established cut-offs throughout follow-up (participants without T2D). Individuals who were receiving diabetes medication at any time point were excluded. Data collected at the first time of diagnosis for participants with T2D (this could be either time point 1, 2 or 3), and at baseline for participants without T2D, were used in the risk prediction models. In all, n = 937 participants were included in this cross-sectional analysis. Supplemental Fig. 1 provides an overview of the case-status across data collection time points.

Data collection

Information regarding general demographics, tobacco use, alcohol consumption, dietary intake, physical activity, and medicine use (including lipid-lowering, antihypertensive, and glucose-lowering medication) was collected by trained field workers using standardised, validated questionnaires. Weight, height, and waist circumference were measured according to the International Standards of Anthropometric Assessment (ISAK). BMI was calculated by dividing weight by height squared and reported as kg/m2. Blood pressure was measured in mmHg using the OMRON HEM-757 (Omron Healthcare, Kyoto, Japan) automated digital blood pressure monitor in 2005 and 2010, and the OMRON M6 device (Omron Healthcare, Kyoto, Japan) in 2015. Fasting (overnight) blood samples were collected, processed, and stored at − 80 °C until analyses. Samples for FPG were collected in fluoride tubes, and FPG was quantified using an enzymatic reference method with hexokinase on a Vitros DT6011 Chemistry Analyzer (Ortho-Clinical Diagnostics, Rochester, New York, USA) in 2005, and a Cobas Integra 400 Roche Clinical System (Roche Diagnostics, Indianapolis, IN, USA) in 2010 and 2015. Samples for glycated haemoglobin (HbA1c) were collected in EDTA tubes and determined via ion exchange high-performance liquid chromatography with the D-10 Haemoglobin testing system at all time points (Biorad, Hercules, California, USA).

Risk prediction models

The ADRS is the only prediction model that has been developed in a sub-Saharan African population [17]. The FINDRISC model is a widely used diabetes risk score that was developed in a White population [23] and subsequently broadly validated [24, 25]. Physical activity and diet were excluded in the Simplified FINDRISC, a derivative of the FINDRISC model developed by Bergmann et al. (2007), as these variables were found to have limited relevance [23]. The ADA risk model was developed in a multi-ethnic population and is used on a large scale in the United States [13]. The IRS model was developed in an Indian population [19]. Details of the different risk prediction models are provided in Supplemental Table 1.

Statistical analysis

The participant distribution (T2D vs. no T2D) based on FPG and HbA1c diagnosis is shown in Supplemental Table 2. In terms of precision, Vergouwe et al. [26] suggest that a minimum of 100 events (T2D) and 100 non-events are required for the validation of binary outcomes. In the SA-NW-PURE study, at least 100 people were diagnosed with T2D in total using either FPG or HbA1c. The predicted probability of undiagnosed T2D for each participant was estimated using the relevant predictors for each model [13, 17,18,19]. Exclusions based on missing data were model-specific, resulting in differential cohort sizes across models. Model performance was assessed using discrimination and calibration statistics.

Discrimination refers to the ability of the model to distinguish those with prevalent undiagnosed T2D from those without T2D. Discrimination was assessed and compared using the concordance (C) statistic and non-parametric methods [27, 28]. C-statistics vary from no discrimination (0.5) to perfect discrimination (1.0), where values of 0.6–0.7 are deemed to be poor, 0.7–0.8, acceptable and 0.8–0.9, good [29]. Another popular method for displaying the discriminatory accuracy of a potential novel marker (diagnostic test), is the receiver operating characteristic (ROC) curve [30]. ROC curves are used to calculate the Youden Index, which is the optimal cut-off for maximising the potential effectiveness of a model, where sensitivity and specificity are determined for each threshold [31]. This index ranges between 0 and 1, where 1 indicates complete separation between diseased and healthy populations and 0 indicates complete overlap [30].

Calibration refers to the agreement between the probability of the outcome of interest as estimated by the model and the observed outcome frequencies [32]. This was assessed by plotting the predicted risk against the observed outcome rate in calibration plots, as well as utilising the Hosmer and Lemeshow goodness-of-fit test [29, 33]. Furthermore, the agreement between the expected and observed T2D rates (E/O) was assessed, where the 95% confidence intervals (CIs) were calculated assuming a Poisson distribution [33]. Ideally, the E/O rate should be as close to 1 as possible, values below 1 underestimate, and values above 1 overestimate the risk of undiagnosed T2D prevalence.

The Yates slope and the Brier score were also calculated. The Yates slope is the difference between the mean predicted probability of T2D for participants with and without prevalent T2D, where higher values indicate better performance [30]. The Brier score is the squared difference between the predicted probability and the actual outcome for each participant, where a perfect prediction model has a value of 1, and 0 indicates no match in prediction and outcome [32, 34].

To reduce bias brought on by differences in T2D prevalence between the development and test (validation) populations, all models were recalibrated according to the SA-NW-PURE-specific T2D prevalence using intercept adjustment. The calculated correction factor is based on the mean predicted risk and the prevalence in the SA-NW-PURE dataset and is the natural logarithm of the odds ratio of the mean observed prevalence and the mean predicted risk [35]. Furthermore, sensitivity analyses were conducted to assess the aforementioned aspects of model performance using first FPG and then HbA1c for T2D diagnosis.

Data analysis used R® statistical software version 4.1.1 (2021), and the level of statistical significance was set at p < 0.05. The models were validated in the overall sample and compared between sex (men vs. women), median age (< 51 years vs.  ≥ 51 years), median BMI (< 25 vs. ≥  25 kg/m2), and residency (rural vs. urban) subgroups upon recalibration.

Results

Participant characteristics

The prevalence of T2D, according to FPG, was approximately 14%, whereas it was just over 26%, according to HbA1c (Supplemental Table 2). Supplemental Table 3 provides the descriptive characteristics of the full analytical cohort (n = 937), and across the four case–non-case groups. More women were diagnosed with T2D by HbA1c than FPG (79.8% vs. 69.6%). Overall, those with T2D were older, had higher waist circumferences, and were more likely to be women, overweight or obese, and hypertensive. Very few participants with T2D reported having a family history of diabetes, although this data should be interpreted cautiously as almost 40% of the participants had missing family history data.

Prediction of prevalent undiagnosed T2D

The number of variables included in the risk prediction models ranged from three (ADRS) to six (ADA). All included age and waist circumference, and hypertension status was only excluded by the IRS. The Simplified FINDRISC model included characteristics with the least amount of missing data and thus had the largest sample size for analysis, whereas the ADA had the smallest sample size (Table 1).

Table 1 Performance of the original risk prediction models and the recalibrated models based on diagnosis by FPG and HbA1c

Discrimination

Table 1 includes the C-statistics and sensitivity and specificity measures at the optimal threshold, used to compare discrimination ability across models. Discrimination was acceptable overall and tended to be better in HbA1c- vs. FPG-based diagnoses. The ADRS had the highest C-statistic for T2D by FPG diagnosis and performed as well as the IRS for T2D diagnosis by HbA1c, whereas the Simplified FINDRISC performed the worst in both diagnostic groups. The ADA risk score was the most sensitive and least specific score in both diagnostic models, followed by the ADRS in each case. Overall, the IRS provided the best sensitivity/specificity profile (Fig. 1). When comparing C-statistics between subgroups (Table 2), the Simplified FINDRISC and IRS models performed better in participants below vs. above the median age for both, and there was better ADA risk score discrimination in rural- vs. urban-dwelling individuals and those with BMIs below vs. above the median. The ADRS performed equally well in all subgroups.

Fig. 1
figure 1

ROC curves for each risk prediction model by FPG-diagnosis (red) and HbA1c-diagnosis (blue), and their respective C-statistic and optimal threshold values from the Youden Index approach (black values) and the top left point approach (grey values). The numbers featured in the square brackets are co-ordinates

Table 2 Performance of the recalibrated risk prediction models according to sex, median age, median BMI, and residency based on diagnosis by FPG and HbA1c

Calibration

According to the E/O ratio, before recalibration, the ADRS, Simplified FINDRISC, and IRS models underestimated the prevalence of FPG- and HbA1c-based T2D, whereas the opposite was observed for the ADA risk score (Table 1). Recalibration improved the agreement between predicted and observed prevalent T2D rates for all models. Although the pattern of underestimation remained, the 95% CI of the E/O ratio for the ADRS, Simplified FINDRISC and IRS models all spanned 1, with the ADRS for FPG-based T2D diagnosis performing the best. The ADA risk model also improved upon recalibration but continued to overestimate T2D prevalence approximately 2-fold.

The calibration curves are presented in Fig. 2. In agreement with the E/O data, there was a systematic risk overestimation across the continuum of predicted prevalence by the ADA model for both FPG and HbA1c as the method of diagnosis. When compared to FPG as diagnostic criteria, the ADRS and IRS had a selective upper stratum (when T2D probability was higher) risk overestimation. In contrast, the ADRS had a systematic underestimation across the continuum, and the IRS had a combination of overestimation in the lower strata and underestimation in the upper strata, using HbA1c as the method of diagnosis. The Simplified FINDRISC had a lower strata underestimation and an upper strata overestimation when using FPG as the method of diagnosis, whereas it had a systematic underestimation when HbA1c was used.

Fig. 2
figure 2

Calibration curves for each risk prediction model by FPG-diagnosis (red) and HbA1c-diagnosis (blue), and their respective Brier scores and Yates slopes. The grey curves represent the original calibration, and the coloured curves represent recalibration

Regarding subgroup analysis (Table 2), the ADRS proved the most stable, with 95% CIs overlapping between each of the compared groups. The Simplified FINDRISC had two instances of non-overlapping prediction, and there was one instance for the ADA and IRS scores. Each of these three models also had one instance of very slight overlapping. The lack of overlap predominantly occurred with HbA1c-based diagnosis. The Simplified FINDRISC significantly overestimated FPG-based T2D prevalence in women compared to men. In contrast, although both were overestimated, the ADA risk score predicted much higher HbA1c-based T2D in men compared to women. Both the simplified FINDRISC and IRS predicted a higher HbA1c-based T2D prevalence for participants younger vs. older than 51 years. While both ratios remained higher than 1, the discrepancy in the expected vs. observed prevalence of HbA1c-based T2D was larger in participants with a lower compared to higher BMI for the ADA. Finally, the Simplified FINDRISC underestimated risk in urban vs. rural dwellers.

Discussion

This study further validated the unpublished ADRS, the only risk prediction model developed in a sub-Saharan African population and compared its performance to that of other widely used T2D risk prediction models. To the authors’ knowledge, this is also the first study to evaluate whether the performance of risk prediction models differed when using HbA1c or FPG as the diagnostic measure of T2D. Though many T2D risk scores have been developed, only a few have been externally validated, usually in high-income countries [36]. Since prediction equations developed in other population groups that include non-invasive variables only, have performed poorly in African cohorts [14, 16], it is important to identify a suitable risk prediction model for use in individuals of African descent. The external validation study by Masconi et al. [14] also highlighted the need to improve screening tools for use in the South African setting. In terms of a public health approach, a non-invasive risk score can be used to identify individuals at risk of prevalent undiagnosed T2D who should undergo further biochemical testing [37]. This will ensure the allocation of scarce resources to those at the highest risk to prevent the progression of T2D and associated complications in a cost-effective manner. An ideal screening tool should thus be easy to use, non-invasive, free, and easily accessible to clinicians, public health workers, researchers, and individuals to assess their level of risk of having T2D [11, 15].

Overall performance

Almost all participants with T2D reported not being previously diagnosed with T2D, which agrees with the high prevalence of undiagnosed T2D in SA [2]. In the SA-NW-PURE sample, the ADA overestimated risk, and the remaining prediction models performed relatively comparably. Except for the ADA, the other models had an overall modest-to-acceptable discriminatory ability to predict prevalent undiagnosed T2D, using both FPG and HbA1c as diagnostic criteria. However, the ADRS and IRS performed marginally best when FPG and HbA1c, respectively, were used, taking C-statistics, E/O rates, sensitivity, and specificity into account. The age cut-off in the population used to develop the Simplified FINDRISC was much higher than the other three models, and FINDRISC only included the use of blood pressure medication (and not blood pressure itself) to determine hypertensive status (thus assuming that all people with hypertension were receiving treatment), which could have negatively impacted its performance in this population.

The calibration for all models was improved through simple intercept adjustment, and the assessment of recalibration was better when FPG was used. After recalibration, the ADRS had a near-perfect E/O ratio when FPG was used for diagnosis, and the calibration curve was close to the curve of perfect calibration, with slight overestimation in the upper strata only. Furthermore, the ADRS performed equally well in all subgroups, further supporting its generalisability. This is not unexpected as the three variables that the ADRS uses; age, waist circumference, and hypertension, are known to be strong predictors of T2D [38,39,40,41,42] and commonly feature in risk prediction models. The fact that the recalibration had the smallest effect on the IRS suggests that the baseline prevalence in the population in which the IRS was developed was similar to that of the SA-NW-PURE study. In contrast, the baseline risk in the population in which the ADA model was developed was much higher, and while recalibration was slightly successful in reducing the degree of risk overestimation, this remained unacceptably high.

The number of variables used in the equations does not seem to influence the predictive performance of risk scores [12]. However, the more variables the model uses, the higher the chance of participant exclusion due to missing data. This is seen with the ADA, which had the largest number of variables (n = 6) and the smallest sample size (n = 420). Though the ADRS had the fewest variables (n = 3), the Simplified FINDRISC, with five variables, had the largest sample size (n = 727).

Considering the aforementioned results, the ADRS is recommended for use in this Tswana-speaking, Black South African population. Though the IRS performed marginally better when HbA1c was used, it includes family history of diabetes and level of physical activity as variables, which may result in missing data or incorrect information used for risk calculation. This is particularly relevant in older populations, and those with a relatively low level of education, poor access to health care, and unhealthy lifestyles [43]. The ADRS includes age, waist circumference and hypertensive status as variables, all of which are easy to obtain. Fewer variables reduce the risk of missing data and the impact this has on the performance of a model. The ADRS has been externally validated by Mayige [17] in an urban Black population in Cape Town (CRIBSA study population), and a rural Black population in Kwa-Zulu Natal, and it also performed well in these populations. Although ethnicity was not genetically determined in this or the external validation studies by Mayige [17], individuals in the North-West Province, Cape Town and Kwa-Zulu Natal typically represent different ethnicities, and these results, therefore, suggest that the ADRS can successfully be used in different ethnic groups in South Africa.

Performance according to subgroups

The models’ performance varied in the major subgroups, where they generally performed better in women, those who were younger, and those with a lower BMI. Discrimination varied in the subgroups from poor to good. Women accounted for 70% of the sample of this study. Certain risk factors for T2D may affect men and women differently, such as distinct hormonal profiles in women that influence their risk [44], or sex differences in health-related behaviour, such as smoking, excessive drinking, and unhealthy dietary habits, that can alter diabetes risk [45]. Men are also more insulin resistant than women and have greater central and hepatic fat compared to women [46, 47]. If the model has been developed using risk factors that are more common in one sex, it may not accurately assess risk in the other. For example, Mayige [17] indicated that women were over-represented in the studies used to derive and validate the ADRS. Moreover, T2D is strongly associated with obesity [10], and in individuals with lower BMI levels, other risk factors for T2D may become more significant, allowing the model to detect and weight these factors more effectively. It is thus important that individuals with a range of BMI levels are included in the development of a risk prediction model, so the diversity can improve the model’s ability to evaluate a range of risk factors and their interactions.

The median age of the SA-NW-PURE population was 51 years; thus, a large proportion of the study population was at risk of T2D, as age is a key risk factor for developing T2D [10]. Ideally, screening for T2D should begin from 35 years for all people and should take place in adults of all ages if they present with overweight or obesity and one or more risk factors for T2D [10, 48]. Younger individuals usually have a lower baseline risk of developing T2D, and they may have fewer risk factors that trigger a model’s criteria designed to detect individuals at elevated risk. The underestimation of risk in urban dwellers could result from the model inadequately accounting for the nutrition transition that occurs with urbanisation and the resultant increased risk of T2D [49].

Performance according to the method of T2D diagnosis

This study has shown that the models performed comparably, whether FPG or HbA1c was used. This has important implications for future screening of undiagnosed T2D. HbA1c may be more convenient to obtain when conducting epidemiological research as it doesn’t require fasting. It has long been accepted that FPG can be used to diagnose diabetes [50]. Relative to FPG, HbA1c has only been accepted as a screening and diagnostic test more recently, after much debate. Other than glycaemia, several factors can affect HbA1c levels [51]. For example, HbA1c can portray falsely high glucose levels when red blood cells have an increased lifespan and/or reduced turnover, as is found in untreated iron deficiency and certain haemoglobin variants [52]. Haemoglobinopathies have been found to be more prevalent in some LMICs [1].

Furthermore, it has been reported that HbA1c and FPG have slight differences in identifying different groups of people with diabetes [53, 54]. HbA1c levels have also been demonstrated to differ between races, potentially affecting the T2D prevalence when assessed by HbA1c and FPG [53, 55]. Furthermore, it has recently been suggested by Chivese et al. [56] that a different HbA1c cut-off may need to be applied for individuals of African descent since the T2D HbA1c cut-off of 6.5% missed up to 42% and 35% of people with diabetes identified by OGTT and FPG, respectively. This does not imply that glucose-based measures are more accurate in classifying T2D; however, the discrepancy between tests may result in differing estimates of the prevalence of T2D [56]. Though the performance of the risk prediction models in this study was comparable with both methods of diagnosis, they tended to underestimate risk when HbA1c was used. This finding supports the suggestion of a lower HbA1c cut-off for individuals of African descent.

Strengths and limitations

This study has assisted with the generalisability of the ADRS in Black South Africans, who account for the greatest portion of people in South Africa [57]. As prediction models commonly perform poorly in new populations compared to the development population [28], investigators tend to reject existing models and develop or fit a new one. This results in the loss of previous scientific information and creates confusion amongst healthcare professionals as to which model to use. Better practice involves updating and recalibrating existing prediction models to the population at hand [28]. This results in combined information from the original model and new individuals [58,59,60]. The combination of information may improve the generalisability of the updated model to other population groups [28]. A potential limitation of the study was the large amount of missing data for some variables, reducing the sample size available for validating each risk score. This could have negatively influenced their performance. Limited information was available for family history of T2D, which resulted in very small sample sizes for the ADA and IRS. The oral glucose tolerance test is considered to be the gold standard for diagnosing diabetes, and not having this measurement in the SA-NW-PURE dataset could therefore be considered a limitation of the study.

Conclusion

Widespread laboratory measurements are not financially feasible in resource-scarce settings such as South Africa. Ideally, scarce resources should be allocated to those at highest risk of developing or having undiagnosed T2D. A two-step approach can thus be used, where a risk prediction model comprised of non-invasive variables in the form of a questionnaire can guide healthcare professionals on whether a diagnostic test should be performed. While no model significantly outperformed others enough to be uniquely selected for routine risk stratification use, based on the ease of use and the performance, it is recommended that the ADRS be used to screen for T2D in the Black population groups in South Africa. It can also be used as a health promotion tool when used as a self-completion questionnaire. Furthermore, using HbA1c as a means of diagnosis resulted in comparable performance with FPG. Therefore, further validation of risk prediction models can potentially make use of HbA1c as a method of diagnosis, given its convenience as it does not require fasting.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ADA:

American Diabetes Association

ADRS:

African Diabetes Risk Score

BMI:

Body mass index

E/O:

Expected and observed rates

FINDRISC:

Finnish Diabetes Risk Score

rRNA:

Ribosomal RNA

FPG:

Fasting Plasma Glucose

HbA1c:

Glycated haemoglobin

AUROC :

Area Under The Receiver Operating Characteristic Curve

IRS:

Indian Risk Score

ISAK:

International Standards of Anthropometric Assessment

IGF-1:

Insulin-Like Growth Factor-1

LMICs:

Low- and Middle-Income Countries

NWU:

North West University

ROC:

Receiver Operating Characteristic

SA-NW-PURE:

South African North-West Province arm of the Prospective Rural and Urban Epidemiology

T2D:

Type 2 Diabetes

ZAR:

South African rand

References

  1. Echouffo-Tcheugui JB, Mayige M, Ogbera AO, Sobngwi E, Kengne AP. Screening for hyperglycemia in the developing world: rationale, challenges and opportunities. Diabetes Res Clin Pract. 2012;98(2):199–208. https://doi.org/10.1016/j.diabres.2012.08.003.

    Article  PubMed  Google Scholar 

  2. International Diabetes Federation. IDF Diabetes Atlas 10th edition. 2021. https://diabetesatlas.org/

  3. Oni T, Youngblood E, Boulle A, McGrath N, Wilkinson RJ, Levitt NS. Patterns of HIV, TB, and non-communicable disease multi-morbidity in peri-urban South Africa - a cross sectional study. BMC Infect Dis. 2015;15:20. https://doi.org/10.1186/s12879-015-0750-1.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Mayosi BM, Flisher AJ, Lalloo UG, Sitas F, Tollman SM, Bradshaw D. The burden of non-communicable diseases in South Africa. Lancet. 2009;374(9693):934–47. https://doi.org/10.1016/s0140-6736(09)61087-4.

    Article  PubMed  Google Scholar 

  5. The Expert Committee on the Diagnosis & Classification of Diabetes Mellitus. Report of the Expert Committee on the diagnosis and classification of diabetes Mellitus. Diabetes Care. 2003;26(suppl1):s5–20. https://doi.org/10.2337/diacare.26.2007.S5

  6. Erzse A, Stacey N, Chola L, Tugendhaft A, Freeman M, Hofman K. The direct medical cost of type 2 diabetes mellitus in South Africa: a cost of illness study. Glob Health Action. 2019;12(1):1636611. https://doi.org/10.1080/16549716.2019.1636611.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Chatterjee S, Khunti K, Davies MJ. Type 2 diabetes. Lancet. 2017;389(10085):2239–51. https://doi.org/10.1016/s0140-6736(17)30058-2.

    Article  PubMed  CAS  Google Scholar 

  8. Ding D, Chong S, Jalaludin B, Comino E, Bauman AE. Risk factors of incident type 2-diabetes mellitus over a 3-year follow-up: results from a large Australian sample. Diabetes Res Clin Pract. 2015;108(2):306–15. https://doi.org/10.1016/j.diabres.2015.02.002.

    Article  PubMed  Google Scholar 

  9. Joshi P, Joshi S. Type 2 diabetes: primary health care approach for prevention, screening and diagnosis in South Africa. SAFP. 2008;50(4):14–20.

    Google Scholar 

  10. ElSayed NA, Aleppo G, Aroda VR, Bannuru RR, Brown FM, Bruemmer D,... on behalf of the American Diabetes Association. 2. Classification and diagnosis of diabetes: Standards of Care in Diabetes—2023. Diabetes Care. 2022;46(Supplement_1):S19–S40. https://doi.org/10.2337/dc23-S002

  11. Ekure E, Ovenseri-Ogbomo G, Osuagwu UL, Agho KE, Ekpenyong BN, Ogbuehi KC,... Naidoo KS. A systematic review of diabetes risk assessment tools in sub-Saharan Africa. Int J Diabetes Dev Ctries. 2022;42(3):380–93. https://doi.org/10.1007/s13410-022-01045-8

  12. Brown N, Critchley J, Bogowicz P, Mayige M, Unwin N. Risk scores based on self-reported or available clinical data to detect undiagnosed type 2 diabetes: a systematic review. Diabetes Res Clin Pract. 2012;98(3):369–85. https://doi.org/10.1016/j.diabres.2012.09.005.

    Article  PubMed  Google Scholar 

  13. Bang H, Edwards AM, Bomback AS, Ballantyne CM, Brillon D, Teutsch MACSM,... Kern LM. A patient self-assessment diabetes screening score. Ann Intern Med. 2009;151(11):775–83. https://doi.org/10.1059/0003-4819-151-11-200912010-00005.A.

  14. Masconi K, Matsha TE, Erasmus RT, Kengne AP. Independent external validation and comparison of prevalent diabetes risk prediction models in a mixed-ancestry population of South Africa. Diabetol Metab Syndr. 2015;7:42. https://doi.org/10.1186/s13098-015-0039-y.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Mbanya V, Hussain A, Kengne AP. Application and applicability of non-invasive risk models for predicting undiagnosed prevalent diabetes in Africa: a systematic literature search. Prim Care Diabetes. 2015;9(5):317–29. https://doi.org/10.1016/j.pcd.2015.04.004.

    Article  PubMed  Google Scholar 

  16. Mugeni R, Aduwo JY, Briker SM, Hormenu T, Sumner AE, Horlyck-Romanovsky MF. A review of diabetes prediction equations in African descent populations. Front Endocrinol. 2019;10:663. https://doi.org/10.3389/fendo.2019.00663.

    Article  Google Scholar 

  17. Mayige M. Derivation and validation of a simple risk score for undiagnosed diabetes for Tanzania and other African populations. 2014. https://theses.ncl.ac.uk/jspui/handle/10443/2435

  18. Bergmann A, Li J, Wang L, Schulze J, Bornstein SR, Schwarz PE. A simplified Finnish diabetes risk score to predict type 2 diabetes risk and disease evolution in a German population. Horm Metab Res. 2007;39(9):677–82. https://doi.org/10.1055/s-2007-985353.

    Article  PubMed  CAS  Google Scholar 

  19. Ramachandran A, Snehalatha C, Vijay V, Wareham NJ, Colagiuri S. Derivation and validation of diabetes risk score for urban Asian indians. Diabetes Res Clin Pract. 2005;70(1):63–70. https://doi.org/10.1016/j.diabres.2005.02.016.

    Article  PubMed  CAS  Google Scholar 

  20. Teo K, Chow CK, Rangarajan S, Yusuf S, Vaz M, McQueen M,... Williams D. The Prospective Urban Rural Epidemiology (PURE) study: Examining the impact of societal influences on chronic noncommunicable diseases in low-, middle-, and high-income countries. Am Heart J. 2009;158(1):7.e1. https://doi.org/10.1016/j.ahj.2009.04.019

  21. de Lange Z, Pieters M, Jerling JC, Kruger A, Rijken DC. Plasma clot lysis time and its association with cardiovascular risk factors in Black africans. PLoS ONE. 2012;7(11):e48881. https://doi.org/10.1371/journal.pone.0048881.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  22. World Health Organization & International Diabetes Federation. Definition and diagnosis of diabetes mellitus and intermediate hyperglycemia. 2006;1–41. https://apps.who.int/iris/handle/10665/43588

  23. Lindström J, Tuomilehto J. The diabetes risk score: a practical tool to predict type 2 diabetes risk. Diabetes Care. 2003;26:725–31.

    Article  PubMed  Google Scholar 

  24. Gabriel R, Acosta T, Florez K, Anillo L, Navarro E, Boukichou N,... Aschner P. Validation of the Finnish Type 2 Diabetes Risk Score (FINDRISC) with the OGTT in health care practices in Europe. Diabetes Res Clin Pract. 2021;178:108976. https://doi.org/10.1016/j.diabres.2021.108976.

  25. Rokhman MR, Arifin B, Zulkarnain Z, Rauf S, Perwitasari DA. Bibliometric analysis of the utilisation of FINDRISC in patients with diabetes: 2005–2021. Borneo J Pharm. 2022;5(3). https://doi.org/10.33084/bjop.v5i3.3267.

  26. Vergouwe Y, Steyerberg EW, Eijkemans MJC, Habbema JDF. Substantial effective sample sizes were required for external validation studies of predictive logistic regression models. J Clin Epidemiol. 2005;58:475–83. https://doi.org/10.1016/j.jclinepi.2004.06.017.

    Article  PubMed  Google Scholar 

  27. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45.

    Article  PubMed  CAS  Google Scholar 

  28. Moons KG, Kengne AP, Grobbee DE, Royston P, Vergouwe Y, Altman DG, Woodward M. Risk prediction models: II. External validation, model updating, and impact assessment. Heart. 2012;98(9):691–8. https://doi.org/10.1136/heartjnl-2011-301247.

    Article  PubMed  Google Scholar 

  29. Cole TJ. Applied logistic regression. New York: Wiley; 1989.

    Google Scholar 

  30. Fluss R, Faraggi D, Reiser B. Estimation of the Youden Index and its associated cutoff point. Biom J. 2005;47(4):458–72. https://doi.org/10.1002/bimj.200410135.

    Article  PubMed  Google Scholar 

  31. Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–5. https://doi.org/10.1002/1097-0142(1950)3:1%3C32::aid-cncr2820030106%3E3.0.co;2-3.

    Article  PubMed  CAS  Google Scholar 

  32. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N,... Kattan MW. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiol. 2010;21(1):128–38. https://doi.org/10.1097/EDE.0b013e3181c30fb2

  33. Dobson AJ, Kuulasmaa K, Eberle E, Scherer J. Confidence intervals for weighted sums of Poisson parameters. Stat Med. 1991;10(3):457–62. https://doi.org/10.1002/sim.4780100317.

    Article  PubMed  CAS  Google Scholar 

  34. Kengne AP, Masconi K, Mbanya VN, Lekoubou A, Echouffo-Tcheugui JB, Matsha TE. Risk predictive modelling for diabetes and cardiovascular disease. Crit Rev Clin Lab Sci. 2014;51(1):1–12. https://doi.org/10.3109/10408363.2013.853025.

    Article  PubMed  CAS  Google Scholar 

  35. Janssen KJ, Moons KG, Kalkman CJ, Grobbee DE, Vergouwe Y. Updating methods improved the performance of a clinical prediction model in new patients. J Clin Epidemiol. 2008;61(1):76–86. https://doi.org/10.1016/j.jclinepi.2007.04.018.

  36. Collins GS, Mallett S, Omar O, Yu L-M. Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reporting. BMC Med. 2011;9(1):103. https://doi.org/10.1186/1741-7015-9-103.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Kengne AP, Beulens JW, Peelen LM, Moons KG, van der Schouw YT, Schulze MB,... Wareham NJ. Non-invasive risk scores for prediction of type 2 diabetes (EPIC-InterAct): a validation of existing models. Lancet Diabetes Endocrinol. 2014;2(1):19-29. https://doi.org/10.1016/s2213-8587(13)70103-7.

  38. Kahn R, Alperin P, Eddy D, Borch-Johnsen K, Buse J, Feigelman J, Wareham NJ. Age at initiation and frequency of screening to detect type 2 diabetes: a cost-effectiveness analysis. Lancet. 2010;375(9723):1365–74. https://doi.org/10.1016/S0140-6736(09)62162-0.

    Article  PubMed  Google Scholar 

  39. Peer N, Balakrishna Y, de Villiers A, Crickmore C, Mungal-Singh V. Effectiveness of a screening programme in identifying individuals with increased risk of cardiovascular disease in South Africa. J Public Health (Oxf). 2018;40(1):e34–45. https://doi.org/10.1093/pubmed/fdx012.

    Article  PubMed  Google Scholar 

  40. Hayashi T, Tsumura K, Suematsu C, Endo G, Fujii S, Okada K. High normal blood pressure, hypertension, and the risk of type 2 diabetes in Japanese men. Osaka Health Surv Diabetes Care. 1999;22(10):1683–7. https://doi.org/10.2337/diacare.22.10.1683.

    Article  CAS  Google Scholar 

  41. Conen D, Ridker PM, Mora S, Buring JE, Glynn RJ. Blood pressure and risk of developing type 2 diabetes mellitus: the women’s Health Study. Eur Heart J. 2007;28(23):2937–43. https://doi.org/10.1093/eurheartj/ehm400.

    Article  PubMed  Google Scholar 

  42. Bray GA, Jablonski KA, Fujimoto WY, Barrett-Connor E, Haffner S, Hanson RL,... Program Research Group. Relation of central adiposity and body mass index to the development of diabetes in the Diabetes Prevention Program. AJCN, 2008;87:1212–8. https://doi.org/10.1093/ajcn/87.5.1212.

  43. Tsiampalis T, Panagiotakos DB. Missing-data analysis: socio- demographic, clinical and lifestyle determinants of low response rate on self- reported psychological and nutrition related multi- item instruments in the context of the ATTICA epidemiological study. BMC Med Res Methodol. 2020;20(1):148. https://doi.org/10.1186/s12874-020-01038-3.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Kautzky-Willer A, Harreiter J, Pacini G. Sex and gender differences in risk, pathophysiology and complications of type 2 diabetes Mellitus. Endocr Rev. 2016;37(3):278–316. https://doi.org/10.1210/er.2015-1137.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  45. Olajide D, Eberth B, Ludbrook A. Analysis of multiple Health Risky behaviours and Associated Disease outcomes using Scottish linked Hospitalisation Data. Front Public Health. 2022;10. https://doi.org/10.3389/fpubh.2022.847938.

  46. Geer EB, Shen W. Gender differences in insulin resistance, body composition, and energy balance. Gend Med. 2009;6(Suppl 1):60–75. https://doi.org/10.1016/j.genm.2009.02.002.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Sattar N. Gender aspects in type 2 diabetes mellitus and cardiometabolic risk. Best Pract Res Clin Endocrinol Metab. 2013;27(4):501–7. https://doi.org/10.1016/j.beem.2013.05.006.

    Article  PubMed  CAS  Google Scholar 

  48. Chung S, Azar KM, Baek M, Lauderdale DS, Palaniappan LP. Reconsidering the age thresholds for type II diabetes screening in the U.S. Am J Prev Med. 2014;47(4):375–81. https://doi.org/10.1016/j.amepre.2014.05.012.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Wentzel-Viljoen E, Lee S, Laubscher R, Vorster HH. Accelerated nutrition transition in the North West Province of South Africa: results from the prospective urban and rural epidemiology (PURE-NWP-SA) cohort study, 2005 to 2010. Public Health Nutr. 2018;21(14):2630–41. https://doi.org/10.1017/S1368980018001118.

    Article  PubMed  PubMed Central  Google Scholar 

  50. American Diabetes Association. Diagnosis and classification of diabetes mellitus. Diabetes Care. 2011;34(Suppl 1):S62–69. https://doi.org/10.2337/dc11-S062.

    Article  PubMed Central  CAS  Google Scholar 

  51. World Health Organization. Use of glycated haemoglobin (HbA1c) in the diagnosis of diabetes mellitus. 2011. https://apps.who.int/iris/bitstream/handle/10665/70523/WHO_NMH_CHP_CPM_11.1_eng.pdf

  52. Wright LA-C, Hirsch IB. Metrics beyond hemoglobin A1C in diabetes management: time in range, hypoglycemia, and other parameters. Diabetes Technol Ther. 2017;19(S2):S16–26. https://doi.org/10.1089/dia.2017.0029.

    Article  PubMed  CAS  Google Scholar 

  53. Cowie CC, Rust KF, Byrd-Holt DD, Gregg EW, Ford ES, Geiss LS,... Fradkin JE. Prevalence of diabetes and high risk for diabetes using A1C criteria in the U.S. population in 1988–2006. Diabetes Care. 2010;33(3):562–8. https://doi.org/10.2337/dc09-1524.

  54. Lorenzo C, Wagenknecht LE, Hanley AJ, Rewers MJ, Karter AJ, Haffner SM. A1C between 5.7 and 6.4% as a marker for identifying pre-diabetes, insulin sensitivity and secretion, and cardiovascular risk factors: the insulin resistance atherosclerosis study (IRAS). Diabetes Care. 2010;33(9):2104–9. https://doi.org/10.2337/dc10-0679.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  55. Christensen DL, Witte DR, Kaduka L, Jorgensen ME, Borch-Johnsen K, Mohan V,... Vistisen D. Moving to an A1C-based diagnosis of diabetes has a different impact on prevalence in different ethnic groups. Diabetes Care. 2010;33(3):580–2. https://doi.org/10.2337/dc09-1843.

  56. Chivese T, Hirst J, Matizanadzo JT, Custodio M, Farmer A, Norris S, Levitt N. The diagnostic accuracy of HbA(1c), compared to the oral glucose tolerance test, for screening for type 2 diabetes mellitus in Africa - A systematic review and meta-analysis. Diabet Med. 2022;39(4):e14754. https://doi.org/10.1111/dme.14754.

    Article  PubMed  CAS  Google Scholar 

  57. Statista. Total population of South Africa in 2022, by ethnic groups. 2022. https://www.statista.com/statistics/1116076/total-population-of-south-africa-by-population-group/ Accessed 23 Jul 2024.

  58. Ivanov J, Tu JV, Naylor CD. Ready-made, recalibrated, or remodeled? Issues in the use of risk indexes for assessing mortality after coronary artery bypass graft surgery. Circulation. 1999;99(16):2098–104. https://doi.org/10.1161/01.cir.99.16.2098.

    Article  PubMed  CAS  Google Scholar 

  59. Moons KG, Altman DG, Vergouwe Y, Royston P. Prognosis and prognostic research: application and impact of prognostic models in clinical practice. BMJ. 2009;338:b606. https://doi.org/10.1136/bmj.b606.

    Article  PubMed  Google Scholar 

  60. Steyerberg EW, Borsboom GJ, van Houwelingen HC, Eijkemans MJ, Habbema JD. Validation and updating of predictive logistic regression models: a study on sample size and shrinkage. Stat Med. 2004;23(16):2567–86. https://doi.org/10.1002/sim.1844.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

We thank all participants, fieldworkers and supporting staff involved in the PURE data collection. We thank the PURE-SA research team and the office staff of the Africa Unit for Transdisciplinary Health Research of the Faculty of Health Sciences, NWU South Africa. In addition, we thank the PURE-International research team and the PURE-study office staff at the PHRI, Hamilton Health Sciences and McMaster University, Ontario, Canada.

Funding

Open access funding provided by North-West University. This work was supported by the North-West University, South African National Research Foundation (NRF), Population Health Research Institute and the South African Medical Research Council (SAMRC-RFA-EMU-10-2020). None of the funding bodies were involved in the design of the study, collection, analysis, or interpretation of the data or in the writing of this manuscript. Opinions expressed and conclusions arrived at are those of the authors and are not to be attributed to the funding sources.

Author information

Authors and Affiliations

Authors

Contributions

All authors made substantial contributions to the following: (1) the conception and design of the study, or acquisition of data, or analysis and interpretation of data [NR, HTC, HSK, RCDM, APK, MP], (2) drafting the article or revising it critically for important intellectual content [NR, HTC, APK, MP], (3) final approval of the version to be submitted [NR, HTC, APK, HSK, RCDM, MP].

Corresponding author

Correspondence to Marlien Pieters.

Ethics declarations

Ethics approval and consent to participate

All participants provided written informed consent prior to data collection for each leg of the SA-NW-PURE study. This study received ethical approval from the North-West University Health Research Ethics Committee (NWU-00190-22-A1) and complies with the amended Declaration of Helsinki.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Royce, N., Cronjé, H.T., Kengne, A.P. et al. HbA1c comparable to fasting glucose in the external validation of the African Diabetes Risk Score and other established risk prediction models in Black South Africans. BMC Endocr Disord 24, 213 (2024). https://doi.org/10.1186/s12902-024-01735-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12902-024-01735-w

Keywords