Skip to main content

Metabolite biomarkers of type 2 diabetes mellitus and pre-diabetes: a systematic review and meta-analysis



We aimed to explore metabolite biomarkers that could be used to identify pre-diabetes and type 2 diabetes mellitus (T2DM) using systematic review and meta-analysis.


Four databases, the Cochrane Library, EMBASE, PubMed and Scopus were selected. A random effect model and a fixed effect model were applied to the results of forest plot analyses to determine the standardized mean difference (SMD) and 95% confidence interval (95% CI) for each metabolite. The SMD for every metabolite was then converted into an odds ratio to create an metabolite biomarker profile.


Twenty-four independent studies reported data from 14,131 healthy individuals and 3499 patients with T2DM, and 14 included studies reported 4844 healthy controls and a total of 2139 pre-diabetes patients. In the serum and plasma of patients with T2DM, compared with the healthy participants, the concentrations of valine, leucine, isoleucine, proline, tyrosine, lysine and glutamate were higher and that of glycine was lower. The concentrations of isoleucine, alanine, proline, glutamate, palmitic acid, 2-aminoadipic acid and lysine were higher and those of glycine, serine, and citrulline were lower in prediabetic patients. Metabolite biomarkers of T2DM and pre-diabetes revealed that the levels of alanine, glutamate and palmitic acid (C16:0) were significantly different in T2DM and pre-diabetes.


Quantified multiple metabolite biomarkers may reflect the different status of pre-diabetes and T2DM, and could provide an important reference for clinical diagnosis and treatment of pre-diabetes and T2DM.

Peer Review reports


Type 2 diabetes mellitus (T2DM) is a highly prevalent chronic disease that is associated with the development of complications including diabetic retinopathy, kidney disease and diabetic ketoacidosis [1, 2], which represent serious threats to human health. Between 1980 and 2014, the number of adults with diabetes increased from 108 million to 422 million [3], with T2DM accounting for > 90% of these cases [4]. Recent studies have shown that diabetes has become one of the three major diseases in the world with the increasing global prevalence rate [5]. However, the symptoms of T2DM are not very obvious or only partially manifest in the early stages of the disease. Therefore, it is particularly important to identify an early diagnosis and effective treatment for diabetes.

In view of the high incidence of T2DM and its serious consequences, the identification of novel diagnostic markers for T2DM has become a subject of intense research. The existing recognized diagnostic biomarkers of T2DM are blood glucose (including fasting blood glucose and 2 h glucose in oral glucose tolerance test) and hemoglobin A1c. The metabolomic approach aims to identify all the metabolites present in a biologic system, whether cells, tissues or living organisms, to identify their physiologic or pathologic effects [6]. The development of metabolomics makes it possible for metabolites to be identified as biomarkers that may be useful for the diagnosis or treatment of diabetes. For example, amino acids have been proposed to be useful diagnostic biomarkers because the metabolism of amino acids is considerably altered in pre-diabetes and continue to vary over the course of T2DM progression [7, 8]. In particular, tryptophan and branched chain amino acids (BCAAs, including valine, leucine and isoleucine) could represent potentially useful biomarkers of T2DM because their serum concentrations are higher in T2DM patients [9]. Additionally, plasma phospholipid such as phosphatidylinositol and sphingomyelin were capable of discriminating healthy individuals and T2DM patients [10].

It is critical to study of bring data on the appearance of metabolic profile abnormalities before the occurrence of pre-diabetes or T2DM, since this might predict and allow prevent the disease progression to pre-diabetes or T2DM. However, there is no current consensus regarding the use of metabolites as diagnostic biomarkers of T2DM, and part of the results were from clinical single-center or insufficient consideration of mixed factors such as different regions and different populations [11]. Therefore, it is a need for an effective and comprehensive evaluation method for the use of metabolites as diagnostic biomarkers of pre-diabetes or early T2DM. The study from Guasch-Ferré et al. showed that several amino acids were consistently associated with the risk of T2DM [12]. Since then, a number of original studies emerged. We hence undertook a systematic review and meta-analysis of the proposed biomarkers of T2DM or pre-diabetes revealed by published metabolomics and constructed a profile of the metabolite biomarkers. The purpose of this study is to explore metabolite biomarkers integrating biomarkers from different studies through systematic review and meta-analysis, which could provide further evidence for early diagnosis of pre-diabetes and T2DM.


The systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [13].

Data sources and search strategy

The Cochrane Library, EMBASE, PubMed and Scopus were searched for studies published from the earliest available online to May 31, 2019. The search words were “metabonomics”, “metabolomics”, “metabolome”, “type 2 diabetes”, “type 2 diabetics”, “type 2 diabetes mellitus”, “insulin resistance”, “HOMA-IR”, “Impaired glucose tolerance”, “glucose intolerance”, “impaired fasting insulin”, “impaired fasting glucose”, “prediabetic”, “pre-diabetes” and “prediabetes” connected with OR and/or AND. To ensure the relevance of the retrieved results, the “Title, Abstract, Keywords” terms were used in the four databases.

Study selection and inclusion criteria

The titles, abstracts and full texts of the articles were evaluated after duplicate records were removed. Before literature screening, the inclusion criteria for the publications obtained were formulated by two authors (Long and Yang) as follows: (1) studies conducted in humans; (2) the participants in the study were not gestational diabetes mellitus (GDM), type 1 diabetes mellitus (T1DM) or subjects under 18 years of age; (3) the study included a diabetic group or a prediabetic group and diagnosis was performed according to the international diagnostic guidelines [14]; (4) the article was not a review, conference abstract, editorial or note; (5) the biologic samples analyzed were collected in the fasting state and (6) the study was not conducted with dietary interventions and (or) medications. The publications initially identified as relevant were screened independently by two investigators (Long and Yang) using Endnote X7 (Thomson ResearchSoft, Stanford, USA). If there was any disagreement regarding the selection or inclusion of a study, this was resolved by discussion or by involvement of a third author (Yan). Studies of biomarkers of human pre-diabetes and T2DM identified using metabonomic technology have been included. The prediabetic category included subject who met the above inclusion criteria and had impaired glucose tolerance (IGT) or impaired fasting glucose (IFG) [15].

Quality assessment and data extraction

The Newcastle-Ottawa Scale (NOS) criteria [16] were used to assess each publication to improve the overall reliability of the extracted data. Three domains, the comparability of cases and controls, selection of cases and controls and exposure, were subdivided into eight risk assessment items. The comparability domain was awarded a maximum of two stars and other items were awarded a maximum of one star, which indicated low, moderate or high risk of bias, respectively. High and low NOS scores reflect low and high risks of bias, respectively.

Two investigators (Long and Yang) independently extracted appropriate information, including the names of the authors and journal, year of publication, study design, population, sample sizes of the case and control groups, the biologic samples obtained, analytic method, determination method, covariates of statistical analysis in the study and the identity and concentrations of the metabolites detected [reported as mean ± standard deviation (SD) or standard error (SE)] in the case and control groups. For the publications that did not provide mean values, we extracted the hazard ratio or odds ratio (OR) and its 95% confidence interval (95% CI). We also extracted the median and interquartile range values from two publications regarding pre-diabetes.

Statistical analysis

Forest plots for each metabolite for which mean ± SD/SE values were available were produced using Review Manager 5.3 software. The raw data for each metabolite were described in the forest plots, which reflected the weighted contribution of each study. The heterogeneities of the pooled means generated using the forest plots were assessed using the I2 statistic. For continuous variables, random effect models [17] were used to assess the pooled means when I2 > 50%; otherwise, fixed effect models were used. The outcomes were considered to be statistically significant when P < 0.05.

To clearly illustrate the relationships between metabolites, pre-diabetes and T2DM, the data provided in the publications were reprocessed. We calculated estimated means and SDs for each metabolite for which median and interquartile ranges were reported in the publications [18, 19]. Because the published data were presented in different forms, using means ± SD/SE or OR value, the outcome indicators were unified to better express the results. The mean ± SD of each metabolite provided in included studies was calculated as standardized mean difference (SMD), and then the SMD was converted to OR value using formula 1 [20, 21].

$$ \ln \kern0.5em OR= SMD\kern0.5em \ast \frac{\pi }{\sqrt{3}} $$

The mean and SD for ORs were obtained using SPSS 20.0 (IBM, Inc., Armonk, NY, USA) and converted outliers were removed when their values were larger than the mean plus five times SD [22]. The ORs were used to construct scatter diagrams with Graphpad Prism 7.0 (GraphPad Software, Inc., San Diego, USA), ensuring that there were at least three sets of data for each metabolite.


Study selection

A total of 3072 publications were identified from the database, and 1549 relevant articles remained after the removal of duplicate studies. A further 1408 publications were excluded after evaluating their titles and abstracts. These comprised 971 studies unrelated to the research topics; 68 that were on inflammation or cardiovascular diseases; 25 on polycystic ovary syndrome; 41 on non-alcoholic fatty liver disease; 156 were reviews, abstracts, editorials, conference papers or notes and 147 were performed on animals. Thus, 141 publications remained for assessment of the full text. After excluding studies of T1DM or GDM and qualitative research, 34 studies remained for inclusion in the meta-analysis, 20 of which were of T2DM, 10 were of pre-diabetes and 4 were of both T2DM and pre-diabetes. The PRISMA flow diagram for the meta-analysis is presented in Fig. 1.

Fig. 1
figure 1

PRISMA flow diagram for the included studies of type 2 diabetes mellitus and pre-diabetes. After the removal of duplicates from the same database, 112 articles were found within the Cochrane Library, 984 within EMBASE, 689 within PubMed and 1287 within Scopus

Quality assessment

The scores of for the studies included in this meta-analysis, generated using the NOS criteria, were shown in Table S1 and Table S2. The maximum score, awarded on the basis of eight risk assessment items [16], was nine stars. Studies with a score of five stars or more were regarded as of medium-to-high quality; otherwise, they were to be categorized as poor-quality and excluded. However, the lowest score was six stars. This implies that all the included studies were of medium-to-high quality, meaning that the data extracted were suitable for inclusion in the meta-analysis.

Characteristics of the included studies

The characteristics of the included studies are shown in Table 1. They comprised 24 independent studies reporting data from 14131 healthy participants and 3499 T2DM patients [10, 11, 22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43]. All these studies compared T2DM patients with healthy participants. Four of the studies were prospective [26, 33, 37, 41] and four literatures were cohort studies [30, 31, 35, 43]. There were two cross-sectional studies [22, 40] and four follow-up studies [32, 34, 36, 42], and the rest were case-control studies. The results of most of the studies were presented as mean ± SD/SE, but some were presented as ORs.

Table 1 Characteristics of the data included in the meta-analysis of type 2 diabetes mellitus

As shown in Table 2, there were 14 studies of pre-diabetes included in the meta-analysis, which contained a total of 2139 prediabetic patients and 4844 healthy controls [22, 24, 25, 29, 44,45,46,47,48,49,50,51,52,53]. Among these studies, one was a cross-sectional study [22], two were follow-up studies [29, 39], one was a longitudinal study [53], one was a cohort study [50], and the remaining nine were case-control studies. The participants in one study conducted in Gothenburg were only female [29]. The results of two of the studies were presented as medians and interquartile ranges [51, 52]. Therefore, the mean and SD of the metabolite concentrations mathematically were estimated [18, 19].

Table 2 Characteristics of the publications included in the meta-analysis of biomarkers in pre-diabetes

Metabolites analyses

Characteristics of the metabolites studied

Metabolites including amino acids, lipids, saccharides and others were analyzed in the 24 studies of T2DM. The frequencies of analysis of each metabolite in the 24 studies were counted and metabolites quantified in three or more studies are shown as a bubble diagram (Fig. 2a). The four categories of metabolite are shown in pink, green, blue and purple, respectively. The ordinal numbers on the bubbles represent different metabolites and the size of each bubble is indicative of the number of studies in which it was analyzed. Eighteen amino acids, five lipids, three saccharides and three other metabolites were assayed. Thus, the most studied metabolites were amino acids, of which the four most commonly analyzed were isoleucine, valine, glycine and leucine, in 14, 13, 12 and 12 studies, respectively. Metabolites studied on less than three articles were excluded, as summarized in Table S3.

Fig. 2
figure 2

Bubble diagram of the publications on type 2 diabetes mellitus (a) and pre-diabetes (b)

For pre-diabetes, the number of metabolites studied in publications was significantly lower in the 14 studies included than for T2DM, as shown in Fig. 2b. There were 14 amino acids, 2 lipids and 3 other metabolites. The top three most commonly analyzed amino acids were leucine, isoleucine and valine, which were studied on 11, 10 and 8 occasions, respectively. Metabolites studied on less than three articles were excluded, as summarized in Table S4.

Analysis of metabolites associated with T2DM

On the basis of data extracted with means ± SD/SE forest plots for each metabolite were created using Review Manager 5.3. Because the dimensions and units used in the studies differed, SMDs were used for the forest plot outputs. For the T2DM studies, because the I2 values for glycine and tyrosine were 29 and 43%, respectively, with a P value in the Q test > 0.1, fixed effect models were used to calculate combined effect sizes. Moreover, the I2 values for valine, leucine, isoleucine, proline, glutamate, lysine, phenylalanine, alanine, histidine and serine were > 90% (Table S5). That is, random effect models were used for these metabolites [17, 54].

As shown in Fig. 3, the concentrations of BCAAs and aromatic amino acids (AAAs) were significantly higher in the serum and plasma of T2DM patients than in control participants. The SMDs of valine (0.91 [0.59, 1.23], P < 0.00001), leucine (0.93 [0.57, 1.29], P < 0.00001), isoleucine (0.93 [0.60, 1.27], P < 0.00001), phenylalanine (0.86 [0.42, 1.31], P = 0.0001) and tyrosine (0.56 [0.37, 0.75], P < 0.00001) were statistically significant. Additionally, the concentration of glycine (− 0.42 [− 0.49, − 0.34], P < 0.00001) was lower and those of proline (0.50 [0.18, 0.82], P = 0.002), glutamate (0.63 [0.19, 1.07], P = 0.005) and lysine (0.84 [0.28, 1.40], P = 0.003) were higher, in the serum and plasma of patients with T2DM than in control participants (Fig. S1). Thus, valine, leucine, isoleucine, tyrosine, glycine, proline, glutamate and lysine could be considered as biomarkers of T2DM according to their forest plots and the first five of these are likely to be most useful, given the associated P values.

Fig. 3
figure 3

Pooled analysis of valine (a), leucine (b), isoleucine (c), phenylalanine (d) and tyrosine (e) in the serum or plasma of type 2 diabetes mellitus patients and control participants

Analysis of metabolites associated with pre-diabetes

For the prediabetic studies, the I2 values for isoleucine, proline, citrulline, 2-aminoadipic acid and lysine were less than 50%, with the P value for the Q test > 0.1 and therefore fixed effect models were used to calculate the combined effect sizes. The I2 values for glycine, alanine, glutamate, serine and palmitic acid (C16:0), leucine, valine, tyrosine, phenylalanine, propionylcarnitine (C3), carnitine (C0), asparagine, tryptophan and myristate (C14:0) were > 50% (Table S6). Therefore, random effect models were used.

As shown in Fig. 4 and Table S6, the concentrations of valine (1.29 [0.75, 1.83], P < 0.00001), leucine (1.07 [0.61, 1.54], P < 0.00001), isoleucine (0.45 [0.36, 0.54], P < 0.00001), phenylalanine (0.92 [0.40, 1.43], P = 0.0004) and tyrosine (1.10 [0.58, 1.62], P < 0.0001) were significantly higher in the serum and plasma of prediabetic patients than in control participants. The concentration of glycine (− 0.76 [− 1.00, − 0.51], P < 0.00001) was lower, while those of proline (0.41 [0.23, 0.59], P < 0.00001), glutamate (0.61 [0.20, 1.02], P = 0.004) and lysine (0.36 [0.24, 0.49], P < 0.00001) were higher in the serum or plasma of prediabetic patients than in control participants (Fig. S2). Furthermore, there were statistically significant differences in the concentrations of serine, citrulline, 2-aminoadipic acid and palmitic acid (C16:0) in the serum or plasma between prediabetic and healthy participants, as shown in Fig. S3 and Table S6. The concentrations of alanine (0.57 [0.30, 0.83], P < 0.0001), 2-aminoadipic acid (0.69 [0.43, 0.95], P < 0.00001), propionylcarnitine (C3) (1.65 [0.83, 2.48], P < 0.0001) and palmitic acid (C16:0) (0.85 [0.44, 1.26], P < 0.0001) in the serum or plasma of prediabetic patients were higher than those of healthy participants, while concentrations of serine (− 0.37 [− 0.70, − 0.04], P = 0.03) and citrulline (− 0.37 [− 0.49, − 0.25], P < 0.00001) were lower. This implies that isoleucine, glycine, proline, glutamate, lysine, serine, citrulline, 2-aminoadipic acid and palmitic acid (C16:0) may represent biomarkers of prediabetes.

Fig. 4
figure 4

Pooled analysis of valine (a), leucine (b), isoleucine (c), phenylalanine (d) and tyrosine (e) in serum and plasma of pre-diabetes and control groups. Studies with several populations comparing patients with pre-diabetes and controls are described by the author name followed by A or B to indicate, for example, subdivision according to sex

Integrative analysis of the metabolite biomarkers

Forest plots were only constructed for metabolites analyzed in at least three studies included in the meta-analysis, but these may not represent the most widely applicable assays. For example, IFG and IGT were only assessed in prediabetic patients in one study [44], but more than three datasets for each metabolite can be more reliably integrated to reflect the features of pre-diabetes. Therefore, we conducted integrative profiling to scientifically combine all the data provided in the included studies.

The ORs for each metabolite provided in the included publications were analyzed to reflect the characteristics of the disease biomarkers, excluding publications containing outliers. As shown in Fig. 5a, 23 metabolites of those analyzed in T2DM patients remained after those with outliers had been excluded. In the scatter diagram, the dots represent the ORs and the colors represent the types of metabolite. The mean ORs for the isoleucine (OR = 2.19), leucine (OR = 1.95), valine (OR = 1.91), phenylalanine (OR = 1.88), lysine (OR = 2.43), arginine (OR = 0.83), methionine (OR = 1.14), glycine (OR = 0.88), tyrosine (OR = 1.99), serine (OR = 0.83), proline (OR = 1.74), alanine (OR = 1.21), glutamate (OR = 1.81), citrulline (OR = 0.69), histidine (OR = 1.44), glutamine (OR = 0.46) and ornithine (OR = 0.94) were significant. The mean ORs for lysophosphatidylcholine (LPC C18:2) and palmitic acid (C16:0) were 0.68 and 1.26, respectively. The mean ORs for glucose and mannose were 5.17 and 4.65, respectively. And the mean ORs for lactate and glycerol were 2.51 and 2.32, respectively. The mean ORs of all the metabolites were demonstrated the characteristic metabolic profile of T2DM.

Fig. 5
figure 5

Metabolite profiling diagrams. a Metabolite profiling diagrams for metabolites in type 2 diabetes mellitus patients. b Metabolite profiling diagrams for metabolites in pre-diabetic patients. Abbreviations: OR, odds ratio; LPC, lysophosphatidylcholine; L-GPC, linoleoyl-glycerophospho-choline; α-HIB, α-hydroxyisobutyric acid; β-HB, β-hydroxybutyric acid; 4-MOP, 4-methyl-2-oxopentanoic acid; 3-MOP, 3-methyl-2-oxopentanoic acid; 3-MOB, 3-methyl-2-oxobutyric acid; α-KB, ketobutyric acid; β-HIB, β-hydroxyisobutyric acid

In the pre-diabetes studies, as shown in Fig. 5b, 32 metabolites were analyzed, comprising 13 amino acids, 5 lipids and 14 other metabolites. The mean ORs of isoleucine (OR = 1.92), leucine (OR = 2.03), valine (OR = 2.24), phenylalanine (OR = 2.15), lysine (OR = 1.84), asparagine (OR = 0.42), glycine (OR = 0.55), tyrosine (OR = 2.57), serine (OR = 0.79), proline (OR = 2.05), alanine (OR = 3.23), glutamate (OR = 5.01), citrulline (OR = 0.56), LPC (C18:2) (OR = 0.73), palmitic acid (C16:0) (OR = 5.99), oleic acid (OR = 1.52), linoleoyl-glycerophospho-choline (OR = 0.62), myristic acid (C14:0) (OR = 1.68), hydroxyisovaleroyl carnitine (OR = 1.18), propionylcarnitine (C3) (OR = 2.41), carnitine (C0) (OR = 3.12), 2-aminoadipic acid (OR = 2.21), α-hydroxyisobutyric acid (OR = 2.40), β-hydroxybutyric acid (OR = 1.38), 4-methyl-2-oxopentanoic acid (OR = 1.57), 3-methyl-2-oxopentanoic acid (OR = 1.49), 3-methyl-2-oxobutyric acid (OR = 1.57), ketobutyric acid (OR = 1.90), 3-hydroxyisobutyric acid (OR = 1.62), vitamin B5 (OR = 1.27), α-ketoglutarate (OR = 1.08) and trigonelline (OR = 0.85) were significant. Unlike T2DM, no saccharides were analyzed. The mean ORs for all the metabolites were constructed to indicate the characteristics of the metabolic profile for pre-diabetes.

From Fig. 5, obviously, alanine, citrulline, glutamate, glycine, isoleucine, leucine, lysine, phenylalanine, proline, serine, tyrosine and valine amino acids, LPC (C18:2) and palmitic acid (C16:0) were statistically similar between T2DM/pre-diabetes patients and healthy controls. The obvious difference in pre-diabetes and T2DM indicates that these disease stages are associated with distinct and quantified metabolic biomarker profiles. In particular, the metabolic biomarkers alanine, glutamate and palmitic acid (C16:0) were significantly different in pre-diabetes and T2DM, which suggests that quantified concentrations of this three metabolites are potential for use as integrative biomarkers for the differentiation of pre-diabetes and T2DM.


The use of a single biomarker to diagnose a disease lacks specificity because multiple disease processes are likely to affect its concentration. Additionally, the main disadvantage of the simple addition of other biomarkers is that their discriminative ability typically overlaps, also limiting the use of this approach [55]. Some studies involving diet were excluded, because the higher intake of metabolites might falsely raise their levels in metabolomics [56]. The use of single biomarkers is limited by the effects of external factors, such as diet. Furthermore, risk models containing biomarkers derived from the pathways directly affected by the disease itself may not demonstrate high predictive value. We believe that the integration of data regarding a number of biomarkers more accurately predict the occurrence of pre-diabetes/T2DM, and map the patient’s current state in a precise manner, which might prevent the further development of T2DM, diabetic macro- and microphaties.

The present meta-analysis, which included 34 independent studies reported data from 14,515 healthy participants, 3499 patients of T2DM and 2139 with pre-diabetes, was performed based on both original OR and OR converted from SMD value. SMD could reflect the original data of each study, and reduce the deviations caused by different methods in included studies. Therefore, the comparability and reliability of meta-analysis are acceptable [21]. There were 23 metabolites concerning T2DM and 32 metabolites concerning pre-diabetes based on included studies. From Fig. 5, obviously, 12 amino acids, LPC (C18:2) and palmitic acid (C16:0) were statistically similar between T2DM/pre-diabetes patients and healthy controls. Metabolite biomarkers of T2DM and pre-diabetes revealed that the levels of alanine, glutamate and palmitic acid (C16:0) are significantly different in T2DM and pre-diabetes. These findings could reflect the different status of pre-diabetes and T2DM, and could provide an important reference for clinical diagnosis and treatment of pre-diabetes and early T2DM, which might prevent the further development of T2DM and reduce the incidence of diabetes complications.

Integrated profiling reflects a set of biomarkers in the context of a network, instead of considering only single or isolated biomarkers. As shown in Fig. S4, the pathogenesis of T2DM is complex and involves many signaling pathways, which has not yet been fully elucidated. Integration hence of the pre-existing metabolite biomarkers may be useful for the prevention and diagnosis of T2DM and pre-diabetes. This method of analysis is suitable for the integration of a number of types of data; for instance, both amino acid biomarkers, belonging to centralized data with strong regularity and a wide range of metabolic biomarkers, belonging to dispersive and isolated data with irregularity. The goals of most studies is improving the diagnosis rate of pre-diabetes and early T2DM, which could reduce the incidence of T2DM and diabetes complications through early intervention treatment. Integrative profiling of metabolic biomarkers should be able to provide reliable references for the selection of biomarkers suitable for the prediction and diagnosis of T2DM and pre-diabetes in the future. It is more potential clinical valuable for high incidence of diabetes (such as China and India) to explore metabolite biomarkers profile for identification and diagnosis of pre-diabetes and T2DM [3]. For the abnormal amino acid and lipid profiles (low levels of metabolites such as glycine, serine and LPC (18:2)), is it possible to increase their levels through external intake to reduce the incidence of pre-diabetes or T2DM? It is worthwhile to design experiments to verify this conjecture at the animal level in future. In further research, conducting clinical, multi-center cohort or prospective observation trials are necessary and important research works.

Although it has shown that quantified metabolic biomarkers could reflect T2DM and pre-diabetes, there were some limitations to the approach used. First, some relevant studies may not have been retrieved from the databases using the search terms described. Second, there were fewer studies of some of the metabolites and there is likely to be a publication bias in favor of positive findings, which may have introduced bias into our analysis. Third, all the information regarding the samples and data collected were derived from the included studies, so the potential confounding factors present in these studies, such as ethnicity, region, education and physical health of the participants might have affected the study results. Although the accuracy of the meta-analysis results was affected by the original research data form included studies, the conclusions of this study were obtained from the meta-analysis conducted in strict compliance with the included criteria and the PRISMA guidelines.


Quantified multiple metabolite biomarkers are useful strategy to differentiate pre-diabetes and T2DM, and we believe that it has potential clinical value for the diagnosis of T2DM.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.



Type 2 diabetes mellitus


Branched chain amino acids


Preferred Reporting Items for Systematic Reviews and Meta-Analyses


Gestational diabetes mellitus


Type 1 diabetes mellitus


Impaired glucose tolerance


Impaired fasting glucose


Newcastle-Ottawa Scale


Standard deviation


Standard error


Odds ratio


Confidence interval


Standardized mean difference




  1. Kostraba JN, Klein R, Dorman JS, et al. The epidemiology of diabetes complications study. IV. Correlates of diabetic background and proliferative retinopathy. Am J Epidemiol. 1991;133(4):381–91.

    Article  CAS  PubMed  Google Scholar 

  2. Roberts MD, Slover RH, Chase HP. Diabetic ketoacidosis with intracerebral complications. Pediatr Diabetes. 2001;2(3):109–14.

    Article  CAS  PubMed  Google Scholar 

  3. Collaboration NRF. Worldwide trends in diabetes since 1980: a pooled analysis of 751 population-based studies with 4.4 million participants. Lancet. 2016;387(10027):1513–30.

    Article  Google Scholar 

  4. Howarth FC, Shafiullah M, Qureshi MA. Chronic effects of type 2 diabetes mellitus on cardiac muscle contraction in the Goto-Kakizaki rat. Exp Physiol. 2007;92(6):1029–36..

    Article  CAS  PubMed  Google Scholar 

  5. Li G, Zhang P, Wang J, et al. Cardiovascular mortality, all-cause mortality, and diabetes incidence after lifestyle intervention for people with impaired glucose tolerance in the Da Qing diabetes prevention study: a 23-year follow-up study. Lancet Diabetes Endocrinol. 2014;2(6):474–80.

    Article  PubMed  Google Scholar 

  6. Roberts G. Metabolomics The Encyclopedia of Biophysics; 2013. p. 89–98.

    Book  Google Scholar 

  7. Tessari P, Cecchet D, Cosma A, et al. Insulin resistance of amino acid and protein metabolism in type 2 diabetes. Clin Nutr. 2011;30(3):267–72.

    Article  CAS  PubMed  Google Scholar 

  8. Michaliszyn SF, Sjaarda LA, Mihalik SJ, et al. Metabolomic profiling of amino acids and beta-cell function relative to insulin sensitivity in youth. J Clin Endocrinol Metab. 2012;97(11):E2119–24.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Wurtz P, Soininen P, Kangas AJ, et al. Branched-chain and aromatic amino acids are predictors of insulin resistance in young adults. Diabetes Care. 2013;36(3):648–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Zhu C, Liang QL, Hu P, et al. Phospholipidomic identification of potential plasma biomarkers associated with type 2 diabetes mellitus and diabetic nephropathy. Talanta. 2011;85(4):1711–20.

    Article  CAS  PubMed  Google Scholar 

  11. Liu X, Gao J, Chen J, et al. Identification of metabolic biomarkers in patients with type 2 diabetic coronary heart diseases based on metabolomic approach. Sci Rep. 2016;6:30785.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Guasch-Ferré M, Hruby A, Toledo E, et al. Metabolomics in prediabetes and diabetes: a systematic review and meta-analysis. Diabetes Care. 2016;39(5):833–46.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  13. Moher D, Liberati A, Tetzlaff J, et al. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ. 2009;339:b2535.

    Article  PubMed  PubMed Central  Google Scholar 

  14. American Diabetes A. Diagnosis and classification of diabetes mellitus. Diabetes Care. 2010;34(Suppl 1):S62–9.

    Article  Google Scholar 

  15. World Health Organisation. Definition and diagnosis of diabetes mellitus and intermediate hyperglycemia: Report of a WHO/IDF consultation. Geneva: World Health Organization; 2006.

  16. Hadwin JA, Lee E, Kumsta R, et al. Cortisol awakening response in children and adolescents with autism spectrum disorder: a systematic review and meta-analysis. Evid Based Ment Health. 2019;22(3):118–24.

    Article  PubMed  Google Scholar 

  17. Higgins JPT, Thompson SG. Quantifying heterogeneity in a meta-analysis. Stat Med. 2002;21(11):1539–58.

    Article  PubMed  Google Scholar 

  18. Wan X, Wang W, Liu J, et al. Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range. BMC Med Res Methodol. 2014;14(1):135.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Luo D, Wan X, Liu J, et al. Optimally estimating the sample mean from the sample size, median, mid-range, and/or mid-quartile range. Stat Methods Med Res. 2018;27(6):1785–805.

    Article  PubMed  Google Scholar 

  20. Hasselblad V, Hedges LV. Meta-analysis of screening and diagnostic tests. Psychol Bull. 1995;117(1):167–78.

    Article  CAS  PubMed  Google Scholar 

  21. da Costa BR, Rutjes AW, Johnston BC, et al. Methods to convert continuous outcomes into odds ratios of treatment response and numbers needed to treat: meta-epidemiological study. Int J Epidemiol. 2012;41(5):1445–59.

    Article  PubMed  Google Scholar 

  22. Wang-Sattler R, Yu Z, Herder C, et al. Novel biomarkers for pre-diabetes identified by metabolomics. Mol Syst Biol. 2012;8(1):615.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  23. Ha CY, Kim JY, Paik JK, et al. The association of specific metabolites of lipid metabolism with markers of oxidative stress, inflammation and arterial stiffness in men with newly diagnosed type 2 diabetes. Clin Endocrinol. 2012;76(5):674–82.

    Article  CAS  Google Scholar 

  24. Menni C, Fauman E, Erte I, et al. Biomarkers for type 2 diabetes and impaired fasting glucose using a nontargeted metabolomics approach. Diabetes. 2013;62(12):4270–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Kujala U, Peltonen M, Laine M, et al. Branched-chain amino acid levels are related with surrogates of disturbed lipid metabolism among older men. Front Med. 2016;3:57.

    Article  Google Scholar 

  26. Peddinti G, Cobb J, Yengo L, et al. Early metabolic markers identify potential targets for the prevention of type 2 diabetes. Diabetologia. 2017;60(9):1740–50.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Okekunle AP, Li Y, Liu L, et al. Abnormal circulating amino acid profiles in multiple metabolic disorders. Diabetes Res Clin Pract. 2017;132:45–58.

    Article  CAS  PubMed  Google Scholar 

  28. Abu Bakar MH, Sarmidi MR. Association of cultured myotubes and fasting plasma metabolite profiles with mitochondrial dysfunction in type 2 diabetes subjects. Mol BioSyst. 2017;13(9):1838–53.

    Article  CAS  PubMed  Google Scholar 

  29. Andersson-Hall U, Gustavsson C, Pedersen A, et al. Higher concentrations of BCAAs and 3-HIB are associated with insulin resistance in the transition from gestational diabetes to type 2 diabetes. J Diabetes Res. 2018;2018:1–12.

    Article  CAS  Google Scholar 

  30. Lee HS, Xu T, Lee Y, et al. Identification of putative biomarkers for type 2 diabetes using metabolomics in the Korea association REsource (KARE) cohort. Metabolomics. 2016;12:178.

    Article  CAS  Google Scholar 

  31. Floegel A, Stefan N, Yu Z, et al. Identification of serum metabolites associated with risk of type 2 diabetes using a targeted metabolomic approach. Diabetes. 2013;62(2):639–48.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Gogna N, Krishna M, Oommen AM, et al. Investigating correlations in the altered metabolic profiles of obese and diabetic subjects in a south Indian Asian population using an NMR-based metabolomic approach. Mol BioSyst. 2015;11(2):595–606.

    Article  CAS  PubMed  Google Scholar 

  33. Lu Y, Wang Y, Ong CN, et al. Metabolic signatures and risk of type 2 diabetes in a Chinese population: an untargeted metabolomics study using both LC-MS and GC-MS. Diabetologia. 2016;59(11):2349–59.

    Article  CAS  PubMed  Google Scholar 

  34. Palmer ND, Stevens RD, Antinozzi PA, et al. Metabolomic profile associated with insulin resistance and conversion to diabetes in the insulin resistance atherosclerosis study. J Clin Endocrinol Metab. 2015;100(3):E463–8.

    Article  CAS  PubMed  Google Scholar 

  35. Ng DP, Salim A, Liu Y, et al. A metabolomic study of low estimated GFR in non-proteinuric type 2 diabetes mellitus. Diabetologia. 2012;55(2):499–508.

    Article  CAS  PubMed  Google Scholar 

  36. Liu J, Semiz S, van der Lee SJ, et al. Metabolomics based markers predict type 2 diabetes in a 14-year follow-up study. Metabolomics. 2017;13(9):104.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  37. Merino J, Leong A, Liu CT, et al. Metabolomics insights into early type 2 diabetes pathogenesis and detection in individuals with normal fasting glucose. Diabetologia. 2018;61(6):1315–24.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Li L, Wang C, Yang H, et al. Metabolomics reveal mitochondrial and fatty acid metabolism disorders that contribute to the development of DKD in T2DM patients. Mol BioSyst. 2017;13(11):2392–400.

    Article  CAS  PubMed  Google Scholar 

  39. Chou J, Liu R, Yu J, et al. Fasting serum αhydroxybutyrate and pyroglutamic acid as important metabolites for detecting isolated post-challenge diabetes based on organic acid profiles. J Chromatogr B. 2018;15:6–16.

    Article  CAS  Google Scholar 

  40. Wolak-Dinsmore J, Gruppen EG, Shalaurova I, et al. A novel NMR-based assay to measure circulating concentrations of branched-chain amino acids: elevation in subjects with type 2 diabetes mellitus and association with carotid intima media thickness. Clin Biochem. 2018;54:92–9.

    Article  CAS  PubMed  Google Scholar 

  41. Lu Y, Wang Y, Liang X, et al. Serum amino acids in association with prevalent and incident type 2 diabetes in a Chinese population. J Chromatogr B. 2019;9:14.

    Google Scholar 

  42. Lu Y, Wang Y, Zou L, et al. Serum lipids in association with type 2 diabetes risk and prevalence in a Chinese population. J Clin Endocrinol Metab. 2018;103(2):671–80.

    Article  PubMed  Google Scholar 

  43. Friedrich N, Budde K, Suhre K, et al. Sex differences in urine metabolites related with risk of diabetes using NMR spectroscopy: results of the study of health in pomerania. Metabolomics. 2015;11:1405–15.

    Article  CAS  Google Scholar 

  44. Cobb J, Eckhart A, Motsinger-Reif A, et al. Alpha-Hydroxybutyric acid is a selective metabolite biomarker of impaired glucose tolerance. Diabetes Care. 2016;39(6):988–95.

    Article  CAS  PubMed  Google Scholar 

  45. Tulipani S, Palau-Rodriguez M, Minarro Alonso A, et al. Biomarkers of morbid obesity and prediabetes by metabolomic profiling of human discordant phenotypes. Clin Chim Acta. 2016;463:53–61.

    Article  CAS  PubMed  Google Scholar 

  46. Zeng M, Xiao Y, Liang YZ, et al. Metabolic alterations of impaired fasting glucose by GC/MS based plasma metabolic profiling combined with chemometrics. Metabolomics. 2010;6:303–11.

    Article  CAS  Google Scholar 

  47. Butte NF, Liu Y, Zakeri IF, et al. Global metabolomic profiling targeting childhood obesity in the Hispanic population. Am J Clin Nutr. 2015;102(2):256–67.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Gao X, Zhang W, Wang Y, et al. Serum metabolic biomarkers distinguish metabolically healthy peripherally obese from unhealthy centrally obese individuals. Nutr Metab. 2016;13:33.

    Article  CAS  Google Scholar 

  49. Kim JY, Park JY, Kim OY, et al. Metabolic profiling of plasma in overweight/obese and lean men using ultra performance liquid chromatography and Q-TOF mass spectrometry (UPLC-Q-TOF MS). J Proteome Res. 2010;9(9):4368–75.

    Article  CAS  PubMed  Google Scholar 

  50. Lee A, Jang HB, Ra M, et al. Prediction of future risk of insulin resistance and metabolic syndrome based on Korean boy's metabolite profiling. Obes Res Clin Pract. 2015;9(4):336–45.

    Article  CAS  PubMed  Google Scholar 

  51. Mastrangelo A, Martos-Moreno GÁ, García A, et al. Insulin resistance in prepubertal obese children correlates with sex-dependent early onset metabolomic alterations. Int J Obes. 2016;40(10):1494–502.

    Article  CAS  Google Scholar 

  52. Newgard CB, An J, Bain JR, et al. A branched-chain amino acid-related metabolic signature that differentiates obese and lean humans and contributes to insulin resistance. Cell Metab. 2009;9(4):311–26.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Ni Y, Zhao L, Yu H, et al. Circulating unsaturated fatty acids delineate the metabolic status of obese individuals. EBioMedicine. 2015;2(10):1513–22.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Higgins JP, Thompson SG, Deeks JJ, et al. Measuring inconsistency in meta-analyses. BMJ. 2003;327(7414):557–60.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Vaishya S, Sarwade R, Seshadri V. MicroRNA, proteins, and metabolites as novel biomarkers for prediabetes, diabetes, and related complications. Front Endocrinol. 2018;9:180.

    Article  Google Scholar 

  56. Wang T, Larson M, Vasan R, et al. Metabolite profiles and the risk of developing diabetes. Nat Med. 2011;17(4):448–53.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

Download references


Not applicable.


This study was supported by the National Natural Science Foundation of China (grant number 81773891), the National Great New Drugs Development Project of China (grant number 2017ZX09301–040) and the Beijing Municipal Science and Technology Commission (grant number 2018000021223TD09, 2018–2-2242, 7194280). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Authors and Affiliations



DY conceived of and proposed the research topic. JL and YH designed the study and performed literature screening and data extraction. JL and DY participated in data analysis. JL, YH, LW, ZY, CP and CY contributed to writing, revising and proof-reading the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Can Yan or Dan Yan.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Long, J., Yang, Z., Wang, L. et al. Metabolite biomarkers of type 2 diabetes mellitus and pre-diabetes: a systematic review and meta-analysis. BMC Endocr Disord 20, 174 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: