Evaluation and refinement of the PRESTARt tool for identifying 12–14 year olds at high lifetime risk of developing type 2 diabetes compared to a clinicians assessment of risk: a cross-sectional study

Background Traditionally Type 2 Diabetes Mellitus (T2DM) was associated with older age, but is now being increasingly diagnosed in younger populations due to the increasing prevalence of obesity and inactivity. We aimed to evaluate whether a tool developed for community use to identify adolescents at high lifetime risk of developing T2DM agreed with a risk assessment conducted by a clinician using data collected from five European countries. We also assessed whether the tool could be simplified. Methods To evaluate the tool we collected data from 636 adolescents aged 12–14 years from five European countries. Each participant’s data were then assessed by two clinicians independently, who judged each participant to be at either low or high risk of developing T2DM in their lifetime. This was used as the gold standard to which the tool was evaluated and refined. Results The refined tool categorised adolescents at high risk if they were overweight/obese and had at least one other risk factor (High waist circumference, family history of diabetes, parental obesity, not breast fed, high sugar intake, high screen time, low physical activity and low fruit and vegetable intake). Of those found to be at high risk by the clinicians, 93% were also deemed high risk by the tool. The specificity shows that 67% of those deemed at low risk by the clinicians were also found to be a low risk by the tool. Conclusions We have evaluated a tool for identifying adolescents with risk factors associated with the development of T2DM in the future. Future work to externally validate the tool using prospective data including T2DM incidence is required. Electronic supplementary material The online version of this article (10.1186/s12902-019-0410-3) contains supplementary material, which is available to authorized users.


Background
In 2017, 425 million people had diabetes worldwide, this is projected to increase by 16% to 629 million by 2045 [1]. Around 90% of these are diagnosed with Type 2 Diabetes Mellitus (T2DM). Worldwide, 352 million people are at risk of developing T2DM and around 1 in 2 adults with T2DM are undiagnosed [1]. Historically T2DM was associated with older age, but over the past 15 years dramatic rises in T2DM are being seen in children, adolescents and young adults [2]. A study assessing the prevalence of T2DM in people aged 10-19 years in the United States found a 30% increase in prevalence between 2001 and 2009 [3]. Similar increases have been reported in the UK [4,5]. Those with early onset T2DM (for example those diagnosed before 40 years old), seem to represent a high risk group, with long disease exposure leading to the early onset of microvascular and macrovascular complications [6]. Emerging evidence also suggests that early onset T2DM is associated with a more extreme phenotype than that seen in older adults [7]. Early onset has additional psycho-societal implications [8]. Those affected by early onset T2DM are of working age, studies show that diabetes is associated with a significant negative impact on the ability-to-work [8]. To date there is very little data regarding undiagnosed T2DM or the number of children, adolescents and young adults at risk of developing diabetes. One UK-based clinical trial recruited overweight and obese  year olds. Of the 193 participants recruited, 5% had undiagnosed T2DM and 18% had elevated glucose levels putting them at risk of developing T2DM [9].
Screening to identify adults at risk of developing T2DM is recommended by national bodies, such as NICE in England and Wales [10]. Such guidance usually recommends a two stage approach where a non-invasive risk score which assesses the presence of risk factors is used to pre-screen before a blood test is taken to assess HbA1c or glucose levels in those at high risk [10,11]. Given T2DM can have a long asymptomatic phase, risk scores can also be used to identify people with undiagnosed T2DM. To date a plethora of risk scores have been developed and validated for use in adults to identify those at risk of either having undiagnosed T2DM or developing it in the future [12,13]. Risk scores reduce the number of people requiring blood tests, help people understand their modifiable risk factors and have been shown to reduce the cost of screening and increase uptake [14,15]. Identifying those at risk of developing T2DM and providing prevention programmes is very effective, with the landmark studies in the area showing a 58% reduction in the development of T2DM [16].
A collaboration between sites from five European Countries (Germany, Greece, Portugal, Spain and UK) developed the PRESTARt tool to identify adolescents (defined here as [12][13][14] year olds) with risk factors associated with the lifetime development of T2DM and to develop a prevention programme for high risk adolescents. The tool defined adolescents at high risk if they had both high levels of screen time and were overweight/obese and had one other of the included risk factors: high waist circumference; Acanthosis Nigricans; first degree family history of diabetes; non-Caucasian ethnicity; metabolic syndrome; rapid weight gain in 1st year; pre-diabetes; high sugar intake; fatty liver disease; parental obesity; polycystic ovary syndrome; small for gestational age; and not breast fed.
The aim of this study was to evaluate whether adolescents identified at high lifetime risk of T2DM by the PRESTARt tool agree with a risk assessment conducted by a clinician using data collected from sites in five European countries. We also assessed whether the tool could be simplified.

Methods
We conducted a cross-sectional study of 12-14 years olds from sites in each of the five countries involved (Germany, Greece, Portugal, Spain and UK). These data were then assessed by a group of clinicians, who in their expert opinion deemed each participant to be at high or low risk of developing T2DM in their lifetime. This was used as the gold standard to which the tool was evaluated and refined. The cross-sectional study, outcome adjudication and evaluation of the tool are described in detail below.

Cross-sectional study
We collected data from adolescents aged 12-14 years inclusive and their families from sites in five European countries. At each site local research ethics and regulatory approvals were obtained before recruitment commenced.
The inclusion and exclusion criteria were purposely broad. Only 12-14 year olds inclusive were included (as stipulated by the terms of the European Union tender) and those who were willing and able to give written informed assent (after obtained written informed parental/ guardian consent). Individuals were ineligible if did not meet the inclusion criteria and/or had an existing diagnosis of type 1 or type 2 diabetes mellitus.
We planned to sample across the BMI distribution with over sampling at the higher BMI percentiles to ensure we recruited sufficient numbers of participants with risk factors for developing type 2 diabetes in order to be able assess the tool. The aim being to recruit between 10 and 25% with normal weight, between 25 and 50% overweight and between 30 and 50% obese as defined by the World Health Organisation (WHO) BMI for age reference charts for children aged 5-19 years [17]. The target sample size was 500 adolescents (100 per country). This minimum sample size was chosen as methodological studies have suggested that 100-200 cases and 100-200 non-cases should be included for the external validation of risk prediction models [18]. Using the proposed sampling frame, we estimated that at least 100 of the 500 adolescents recruited should be at high risk. The final sample size recruited was 636 adolescents. A variety of recruitment settings were used. In Spain, Greece and Germany potential participants were identified in clinical settings, whereas schools were used in Portugal and the UK.
An extensive data set from both the child and parents/ guardians, covering health and family history, diet and lifestyle, anthropometric, puberty stage and biochemical measures were collected (described below). Standard operating procedures (SOP) for each of the measurements described were agreed and followed by each country, technicians collecting data were trained using these SOPs. Data were collected on standardised data collection forms and entered into web-based database developed by the Leicester Clinical Trials Unit.
Family history and current health status (see Additional file 1 for the data collected form used): The parents/guardians were asked about their family history of chronic disease such as T2DM, gestational diabetes, cardiovascular disease and stroke in themselves or their immediate relatives. Details of the child's own birth and health history were reported including items such as child's birth weight, their gestational period and whether they were breast or formula fed. Ethnicity was collected in Germany, Spain and UK only due to ethical requirements in Greece and Portugal.
Diet and lifestyle questionnaire (see additional file 2 for an example of the questionnaire used): A questionnaire booklet was collated to assess the child's diet and lifestyle habits and included questions about risk factors that may have an association with chronic disease risk. This included the PACE+ questionnaire to assess physical activity levels [19]; The Adolescent Sedentary Activity Questionnaire to assess time spent sedentary [20]; and questions pertaining to frequency of breakfast consumptions, snacks, fruit and vegetables and sugary drink consumption [21][22][23].
Biological maturity status: The Tanner stage that the child had reached was self-reported using the Tanner scale pictures [24]. This questionnaire was not administered at the Portuguese site and was assessed by a paediatrician in Spain at the request of their ethics committees.
Anthropometric measurements: Weight was measured to the nearest 0.1 kg and height was measured to the nearest 0.1 cm using a clinically approved scale and a portable stadiometer, respectively. Body mass index (BMI) was calculated as weight (kg)/height (m) 2 and was converted to a BMI percentile based on WHO growth charts [17]. Waist, neck and upper arm circumferences were measured with an inelastic anthropometry tape. Waist circumference was measured to the nearest 0.1 cm as the midpoint between the lower costal margin and iliac crest. Neck and upper arm circumferences were also measured to the nearest 0.1 cm at the appropriate anatomical locations. For example, upper arm circumference was measured at the mid-point on the belly of the bicep muscle (i.e. highest point). Neck circumference was measured at the mid-point of the neck. Arterial blood pressure was measured using an automated sphygmomanometer with an appropriate sized cuff while the participant was seated, and having rested quietly for 5 minutes. Three measurements were obtained for blood pressure and the average of the last two used for analysis.
Biochemical measures: Triglycerides, glucose, high density lipoprotein cholesterol (HDL-C) and total cholesterol were measured using a point-of-care testing (POCT) device (CardioChek® system) and low density lipoprotein cholesterol (LDL-C) automatically calculated. Capillary blood samples (15 to 40 μL) were taken from each participant using the finger prick method. The Car-dioChek® system is certified by the Cholesterol Reference Method Laboratory Network (CRMLN) and National Cholesterol Education Program (NCEP), is FDA-cleared, CE-marked, internationally registered, and is CLIAwaived by the Centers for Medicare & Medicaid Services, USA. HbA1c was measured using POCT with the A1C Now® + system, BHR Pharmaceuticals Limited (UK). The A1CNow + system is annually certified by the National Glycohemoglobin Standardization Program (NGSP). Having successfully completed rigorous testing requirements, the A1CNow + system was awarded a Certification of Traceability to the Diabetes Control and Complications Trial (DCCT) Reference Method (http:// www.ngsp.org/bground.asp). Participants were not specifically required to fast for these blood tests; time of the last meal consumed or whether they were fasting was recorded.

Evaluation of the tool
The development of the PRESTARt tool has been described previously [25]. Briefly, given there were no data available to develop such a tool a novel approach was used. The American Diabetes Association (ADA) diabetes screening recommendations for children and young people [26] and the results from a systematic review assessing predictors of diabetes risk in children and adolescents were used to develop the tool. Once a pool of potential risk factors for inclusion were identified, a Delphi study was undertaken to decide which of these should be included in the tool [25].
To assess the performance of the PRESTARt tool, the lifetime risk status of each participant from the crosssectional study needed to be established and then compared to the result of the tool. A pool of clinicians, with two allocated to each participant, independently judged each participant's lifetime risk status using the extensive data collected during the cross-sectional study. Where the clinical assessments did not agree a third clinician adjudicated. Each country provided a pool of three clinical assessors. A bespoke database was developed which presented each participants anonymised data in an easy to read and accessible format and then the last page of the system recorded the assessor's outcome. Missing data were shown as blank responses and therefore the reviewers could not use this in their assessment. Initially all of the clinicians assessed the same 20 participants to train them in using the system and the process. Feedback was provided after this training exercise. This was followed by two rounds of assessment, one when half the data had been collected and cleaned and one at the end of the study. Assessors were randomised to participants in country based clustersthe idea being that for difficult cases the three assessors could discuss the case, although this was not required.
The PRESTARt tool, as previously described, gives a binary outcome of either low or high risk. Participants were defined at high lifetime risk if they had both high levels of screen time (≥2 h of TV/computer viewing per day) and were overweight/obese (≥85th BMI percentile) and had one other of the included risk factors (high waist circumference (defined using the following age/sex cut points: 12 male: 84.5 cm; 12 female: 81.2 cm; 13 male: 87.9 cm; 13 female: 84.1 com; 14 male: 91.3 cm; 14 female: 86.9 cm), acanthosis nigricans, first degree family history of diabetes, non-Caucasian ethnicity, metabolic syndrome (defined as having three or more of: (i) high blood pressure; (ii) high cholesterol; (iii) high triglycerides; (iv) high blood glucose levels, but not in the diabetes range), rapid weight gain in 1st year (≥2 lb. (908 g) a month), pre-diabetes, high sugar intake (≥1.5 cans (or 532mls) of carbonated sugar sweetened beverages/ fruit juice a day), fatty liver disease, parental obesity (BMI ≥30 kg/m 2 if White European or 27 kg/m 2 for other ethnicities), polycystic ovary syndrome, small for gestational age (using published guidelines [27]), and not breast fed).
The tool outcome was compared to the adjudicated outcome using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and the area under the receiver operating curve (ROC) value.

Refinement of the tool
To assess if the tool developed could be simplified to reduce the burden on the completer, without losing statistical performance (in comparison to the clinicians assessment of risk), we assessed the effect on the statistical measures of performance (using area under the receiver operating curve, sensitivity, specificity, PPV and NPV) of removing each of the risk factors included. We also sought opinions from the PRE-STARt Collaborative on ways to simplify and/or amend the tool. These included suggestions from the members about other modifiable risk factors that could be important for improving the healthy lifestyle messages provided by the tool. An example of such an approach is the FINDRISC diabetes score for use in adults. This score includes questions asking about physical activity and fruit and vegetable intake [28], not because these improve the statistical performance of the score but because they are modifiable risk factors and are therefore there for education purposes. A risk score which contains only non-modifiable risk factors may give the impression to completers that their risk cannot be changed. The evaluation was repeated for the final tool.

Cross-sectional study
In total 636 participants were recruited into the study (Greece 100 (15.7%), Germany 100 (15.7%), Portugal 226 (35.5%), Spain 129 (20.3%), and UK 81 (12.7%)). Of these, 52.2% were male, with a mean age of 13.3 years. The full results are given in Table 1 and Additional file 3: Tables S1-S7. There was large variation in BMI in those recruited between countries, overall 56% of participants had BMI over the 85th percentile, but this ranges from 32% of those recruited in Portugal up to 91% of those recruited in Spain. Although we aimed to quota recruit by BMI this was not possible in all countries. The majority of participants were Caucasian from the sites in Germany and Spain. In the UK site 54% were of nonwhite ethnicity, reflecting the ethnic diversity of the area where recruitment took place. We were unable to collect ethnicity data in Greece and Portugal. In terms of cardiovascular risk factors, 29 participants (5%) had high blood pressure, 15 (3%) had HbA1c over 6.0% which would be deemed high risk in adults, with five (1%) having an HbA1c over 6.5%, i.e. indicative of undiagnosed T2DM. In terms of high cholesterol, four (1%) participants had high total cholesterol and 11 (2%) had high LDL cholesterol.
When applying the PRESTARt tool to the cross-sectional study data, 214 (33.7%) participants were found to be at high risk. 241 (37.9%) participants were defined at high risk by the clinical review (Table 2). For 76% of cases the two clinical experts agreed, therefore 24% required a third party to reach a final decision on the risk status. Both when using the tool and the clinical assessment there were differences between countries in terms  Proportion of participants that are considered to be at high risk. 2 High risk blood pressure is those that have average systolic BP ≥ 120 and average diastolic BP ≥ 80 (red zone). 3 Fasting participants are considered to be at high risk (red zone) when glucose ≥ 7.0 mmol/L and non-fasting participants are considered to be at high risk when glucose level ≥ 11.1 mmol/L. 5 1 missing value, 6 10 missing values, 7 135 missing values, 8 2 missing values 9 9 missing values, 10 78 missing values, 11 34 missing values, 12 43 missing values, 13   of the percentage of participants at high risk. This reflects the differences in participant characteristics seen between the countries. Table 3 shows the statistical performance of the tool. Overall 64% of those assessed to be at high risk by the clinicians were also found to be high risk when using the tool (sensitivity). Conversely, of the 214 who were found to be at high risk by the tool 72% of these were also found to be at high risk by the clinicians (PPV). The specificity and NPV look at those found to be low risk and agreement within this group. Eighty five percent of those deemed to be low risk by the clinicians were also found to be low risk when using the tool (specificity). Of those with a low risk from the tool, 79% were also found to be low risk by the clinicians (NPV). Eighteen participants (6.41%) were deemed to be high risk by the clinicians were not overweight/obese. The area under the ROC was 0.74 (95% CI 0.71, 0.78) before refinement. In our data this value represents the probability that a randomly selected high risk adolescent will have a higher test result than a randomly selected low risk adolescent.

Refinement of the PRESTARt tool
The refinement of the PRESTARt tool was conducted in two stages, first assessing the effect of removing risk factors on the statistical performance and secondly incorporating requests from the PRE-STARt Collaborative and again assessing the effect of these on performance. Table 4 shows the area under the ROC curve for each of the tools assessed. When removing variables the performance of the tool remained fairly consistent (tools 1-12) until the removal of the rapid weight gain between 0 and 4 months, when the area under the ROC reduced to 0.73. The results of these analyses were presented to the members of the study steering committee. Members requested testing the following changes to the tool: Removal of high screen time from the core risk factors (Tool 13). Although sedentary behaviour has been associated with adult diabetes [29] this is an emerging area of research in those under 18 and members felt it is too premature to have screen time as a core risk factor ahead of physical inactivity. Meeting notes from stakeholder events suggested that this behaviour is one of the key modifiable behaviours that GPs report as a "problem" and for this reason could be a useful starting point in getting parents to think about their child's lifestyle. Adding back in the questions about parental obesity and breast feeding (Tool 14). As these questions are relatively straight forward for parents to complete and are somewhat representative of the family lifestyle it was felt these risk factors would broaden the message around modifiable risk factors out to the wider family. Removal of the rapid weight gain question as parents involved in the study reported difficulties in understanding what this meant and actually remembering this detail (Tool 15). Adding back in the family history question but including 2nd degree as well as 1st degree, given the age of the participants the parents may not yet have developed diabetes (Tool 16). Adding in additional modifiable risk factors. We assessed adding the following modifiable risk factors to the score -high sugar intake, high screen time, low physical activity (< 60 mins per day), low fruit and vegetable intake (< 5 portions per day) (Tools 17-20).
All of the suggestions given above either improved the statistical performance of the tool or did not reduce it and where therefore incorporated into the final tool. The final tool is shown in additional file 4 and the number and percentage of participants with each risk factor in Table 5. The statistical measures from the evaluation of the refined tool are given in Table 3. The performance of the updated tool improved significantly. Of those found to be high risk by the clinicians, 93% are also deemed high risk by the tool. The specificity shows that 67% of those deemed at low risk by the clinicians are also found to be a low risk when using the tool. The NPV show that those receiving a low risk result from the tool are not being falsely reassured, as 94% of those with a low tool result were deemed to be low risk by the clinicians.

Discussion
We have collected data from over 600 adolescents from five European countries. These data have been used to evaluate and refine the PRESTARt tool for identifying adolescents with risk factors for the development of T2DM in their lifetime. We have shown that the final refined tool performs well when compared to a clinician's assessment of risk. We have taken a novel and pragmatic approach to both the development and evaluation of this tool. The standard way of developing such a tool or score would be to use existing data to model the associations between risk factors and the outcome of interest. In this case we would need data which followed up a cohort of adolescents for decades so that the relationship between risk factors present in adolescence and the development of T2DM could be assessed. No such data were available and therefore a novel and pragmatic approach was taken. The tool was developed using the results from a systematic review and consensus study which identified risk factors for inclusion [25]. The tool assesses the presence of these risk factors rather than attributing weight to them, this was based on the format used for the ADA screening guidelines [26]. This approach gives equal weighting to all of the risk factors included, which although may not be appropriate, the performance of the tool shows that the outcome of the tool is usually in Table 4 The process of refinement of the tool and the performance (measured using area under the receiver operating curve (AUROC)) of each version compared to a clinicians assessment of lifetime risk of T2DM   agreement with that of the clinicians. This also means that the tool is easy to use in practice as no calculations or specialist equipment are required. Therefore this tool could be completed by parents outside of a health care setting. Prior to implementation in clinical practice, all such tools should be validated [30]. To validate this tool a similar data set to that required for development would be needed, i.e. a longitudinal data set which follows up a representative sample of adolescents for many decades which records whether they develop T2DM. Given this was not available we have taken a different approach. We have evaluation the tool against a surrogate marker of diabetes risk, in this case clinical opinion. All participants were assessed by two clinicians independently, with a third being used where consensus was not found. In the majority of cases the two clinicians agreed. Therefore although this approach is not without its limitations, we believe a tool evaluated in this way does still does provide useful information and can be used to identify adolescents at risk without the input from clinicians, for example in a community setting. Ideally long term follow up of the cohort would allow validation against the development of T2DM to be undertaken. In the shorter term, additional validation of the final refined tool using other cross-sectional data is warranted, which could include extending the validated age range beyond 12-14 year olds. We believe this is the first such tool developed for use in this age group in a European setting. Many risk tools/ scores have been developed to assess diabetes risk in adults [13,31], some of which have been validated in young adults (18-25 years) [32]. There are a number of notable differences between those tools developed for use in adults and the tool developed here. Firstly the sensitivity, the percentage of those with a high risk outcome also being assessed as high risk by the clinicians, is significantly higher than those seen for adult tools (92.5% compared to 70-80%) [13,33]. High sensitivity can be due to the proportion being defined at high risk, i.e. a tool with 100% sensitivity may have defined the whole population at high risk. That is not the case here, 56% of those screened are defined at high risk by the refined tool, and this is in line with the tools developed for use in adults [34]. This may also reflect that the clinicians assessing the participants used weight as the primary driver for their assessment, this is also the primary risk factor in the tool as to be at high risk completers have to be overweight/obese with one or more additional risk factors. One could argue that weight alone should therefore be used to assess risk and indeed doing this does not hamper the performance of the tool (ROC 0.80, 95% CI 0.77, 0.82), however we believe a more holistic assessment of risk is important for a number of reasons. Firstly it makes the completer aware of a number of modifiable risk factors (screen time, physical activity, sugary drinks etc.), allowing individuals to target multiple risk factors. We also include family based risk factors, such as parental obesity, this may encourage family wide improvements in lifestylewhich are important in this group. Additionally there maybe stigma around having and/or being an overweight/obese child [35], making the tool less obesity orientated and more focussed on being healthy may reduce this. As previously discussed, this tool uses a pragmatic approach and does not assign weights to risk factors. This crude scoring may have also influenced the performance of the tool. Finally, many of the risk factors included in this tool are not included in the tools developed for use in adults [13]. Again this emphasises the needs for tailored risk identifications and prevention approaches.
This tool could be used to identify participants for lifestyle modification programmes aimed at diabetes prevention. Such programmes are usually only offered to adults with elevated glucose levels putting them at high risk of diabetes. Studies show that the prevalence of T2DM at earlier ages from childhood through to young adults is increasing [3][4][5] and therefore to be effective, prevention initiatives need to target younger age groups and/or provide family based approaches. This paper describes the first stage of a programme of funding, the second stage is to develop and evaluate a family based healthy living intervention. Described elsewhere, this evaluation shows that positive changes in health behaviours are associated with attendance at such programmes [36]. Therefore, this programme of research has shown that it is feasible to identify high risk adolescents using a tool and that healthy behaviour can be promoted through family based prevention workshops.
The data collected in this study has the potential to be used for further hypothesis generating cross-sectional research. The data set also highlights the risk profile of those included, adding to the growing evidence base for T2DM risk in young people in European countries.
The strengths of this study include having recruited over the planned sample size and the use of standardised data collection methods across five countries. Although the non-standard methodology used for developing, evaluating and refining the tool could be seen as a limitation, it could be used to inform the development of screening tools for other areas where no data on which to develop a tool exist, one such example maybe the development of screening tools for use in developing countries. The initial protocol for this study set out to purposely sample individuals to get a specific BMI distribution for the cohort, in practice this was not possible, which has led to differences between the countries included in terms of the cohort characteristics. Also we cannot guarantee the representativeness of the included samples within each country as the study was not designed to recruit representative samples. This is an important limitation which must be taken into account when using this data. Another important limitation is the restricted age range included within this study -12-14 year olds only. This reflects the requirement from the funders of this work, future research should assess the validity of this tool in a wider age range. Ideally further validation would be undertaken in a population based cohort. In terms of the clinical review, a strength of this is the use of two independent reviewers for each individual. Unfortunately we did not collect data on how the clinicians made their decisions, which data they used to form these decisions and timeframe for developing T2DM used. These data could have informed which risk factors to include in the refined tool. Even though this was not conducted the final tool still maintain a high level of performance. Future work could conduct a qualitative study to establish how clinicians make decisions about future risk in adolescents. Adjudication was also performed within countries, i.e. the clinicians reviewing each participant all came from the same country and therefore we cannot assess between country differences in adjudication. The methods used for the refinement of the tool may also have affected the final tool developed. The refinement was completed in two stages, firstly we removed risk factors one at a time to try and simplify the tool while maintaining adequate performance. The order in which risk factors were removed was arbitrary and this could have affected the final tool produced. Although the second stage of the refinement was based on stakeholder requests and here risk factors were removed/included based on clinical opinion and taking into account the ease of completion when used in practice. Therefore we feel that although the final tool produced is pragmatic the high levels of performance are reassuring.

Conclusions
In conclusion, we have refined and evaluated a tool for identifying adolescents with risk factors associated with the lifetime development of T2DM. This tool has high agreement with clinical opinion. Future work to validate the tool using prospective data is required. The PRE-STARt tool could be used both by parents and health care professionals to identify adolescents for referral into diabetes prevention programmes.

Additional files
Additional file 1: This file includes the Case report form which was used to capture data from participants. (PDF 1100 kb) Additional file 2: This file includes an example (female participant version) of the questionnaire completed by all participants. (PDF 400 kb) Additional file 3: Table S1. Family medical history summary statistics. Data reported as n (%). Table S2. Participant medical history summary statistics. Data reported as n (%) unless otherwise stated. Table S3. Socioeconomic summary statistics. Data reported as n (%). Table S4. Perinatal history summary statistics. Data reported as mean (sd) unless otherwise stated. Table S5. Summary statistics for participant's physical activity and sedentary behaviour. Data reported as n (%) unless otherwise stated. Table S6. Summary statistics for participant's diet. Data reported as mean (sd). Table S7.