Using Hashimoto thyroiditis as gold standard to determine the upper limit value of thyroid stimulating hormone in a Chinese cohort
- Yu Li†1,
- Dong-Ning Chen†1,
- Jing Cui†1,
- Zhong Xin2,
- Guang-Ran Yang2,
- Ming-Jia Niu3 and
- Jin-Kui Yang2Email authorView ORCID ID profile
© The Author(s). 2016
Received: 26 February 2016
Accepted: 28 September 2016
Published: 6 November 2016
Subclinical hypothyroidism, commonly caused by Hashimoto thyroiditis (HT), is a risk factor for cardiovascular diseases. This disorder is defined as merely having elevated serum thyroid stimulating hormone (TSH) levels. However, the upper limit of reference range for TSH is debated recently. This study was to determine the cutoff value for the upper normal limit of TSH in a cohort using the prevalence of Hashimoto thyroiditis as “gold” calibration standard.
The research population was medical staff of 2856 individuals who took part in health examination annually. Serum free triiodothyronine (FT3), free thyroxine (FT4), TSH, thyroid peroxidase antibody (TPAb), thyroglobulin antibody (TGAb) and other biochemistry parameters were tested. Meanwhile, thyroid ultrasound examination was performed. The diagnosis of HT was based on presence of thyroid antibodies (TPAb and TGAb) and abnormalities of thyroid ultrasound examination. We used two different methods to estimate the cutoff point of TSH based on the prevalence of HT.
Joinpoint regression showed the prevalence of HT increased significantly at the ninth decile of TSH value corresponding to 2.9 mU/L. ROC curve showed a TSH cutoff value of 2.6 mU/L with the maximized sensitivity and specificity in identifying HT. Using the newly defined cutoff value of TSH can detect patients with hyperlipidemia more efficiently, which may indicate our approach to define the upper limit of TSH can make more sense from the clinical point of view.
A significant increase in the prevalence of HT occurred among individuals with a TSH of 2.6–2.9 mU/L made it possible to determine the cutoff value of normal upper limit of TSH.
Patients with subclinical hypothyroidism have normal level of serum thyroid hormones (T4, T3, FT3, FT4) and elevated TSH. These patienshave a higher incidence of lipid abnormalities, coronary heart disease, psychiatric disorders and pregnancy complications [1–9], although their clinical symptoms are very mild. Proper screening and treatment of these patients may help to improve the adverse outcome of the involving diseases. Therefore, to define the upper limit of TSH precisely had an important role in detecting patients who had mild thyroid dysfunction and might benefit from early intervention.
The upper limit of reference range for “normal” TSH has been the focus of debate in the recent decade. Some authors insisted on the conventional value of 4.0–5.0 mU/L as the upper limit of normal thyroid stimulating hormone (TSH) , others suggested it narrowed to 2.5–3.0 mU/L . As mentioned in the National Academy of Clinical Biochemistry (NACB) guideline, more than 95 % of normal individuals had TSH below 2.5 mU/L. There were even data showing that African-Americans had very low incidence of HT with a mean TSH level of 1.18 mU/L This value maybe the “true normal” upper limit for TSH, because African-Americans have very low prevalence of Hashimoto thyroiditis (HT) to elevate TSH .
Another question was put forward that which was the true set point of normal upper limit. Which centile (90, 95 or 99th) should be the real normal limit? The approximate questions related to glycemic threshold for diagnosing diabetes were already answered by using diabetic retinopathy (DR) as “gold” calibration standard . Referring to that and based on the correlation between HT (as end-point) and TSH value, we tried using the prevalence of HT as the calibration standard to determine the upper limit of TSH.
A total of 2856 medical staffs with age between 20 and 60 years old participated in a health examination in the year of 2013 were included in our research. Questionnaires related to medication history and other health-related behaviors were completed beforehand. Blood sampling was performed in the morning after eight hours of fasting, followed by the detection of height, weight, waist-to-hip ratio (WHR), blood pressure and physical examinations.
The study was conducted with the approval of the Ethics Committee of Beijing Tongren Hospital, Capital Medical University.
Blood samples were collected for testing thyroid function including serum free triiodothyronine (FT3), free thyroxine (FT4) and TSH, thyroid peroxidase antibody (TPAb), thyroglobulin antibody (TGAb) and other biochemistry parameters including total cholesterol (TC), triglyceride (TG), low density lipoprotein cholesterol (LDL),, fasting plasma glucose (FPG), uric acid (UA), alanine aminotransferase (ALT), aspartate aminotransferase (AST), creatinine (Cr) and carbon dioxide combining power (CO2CP). Appliance for testing the above mentioned biochemistry factors were Immunoassay Systems (Beckman Coulter, UniCel DXI800).
The diagnostic criteria for HT
The diagnosis of HT was established by a combination of presence of thyroid antibodies (TPAb and TGAb) and abnormalities of thyroid sonogram including reduced echo or diffused heterogeneity echo of thyroid with or without nodularity . Ultrasound examination of thyroid was performed by four experienced professional ultrasound physicians. Consistent diagnosis standard was used, including description for the size, nodules and echo of the thyroid.
Framingham score (FS), a 10-year estimated risk of CHD, was calculated by the equation including: age, gender, blood pressure, smoking status and cholesterol levels . FS equaling to or over 9 was considered to be a predictor for the higher CHD risk of over 5 %.
Unpaired t-test, Mann-Whitney U test or Pearson Chi-square test was used to compare the difference between cohorts with or without HT. We used two different methods to estimate the cutoff point of TSH based on the prevalence of HT: Joinpoint regression  and ROC curve by maximizing the sensitivity and specificity. All statistical analyses were conducted with the software package SPSS version 13 for windows, and ljr package of R software (http://www.r-project.org). A two-sided p value of less than 0.05 was considered to be statistically significant.
Characteristics of the observed cohort
By the exclusion standard, those with abnormal thyroid function (FT3 and FT4 out of the normal range), being pregnant, taking thyroid related medicine, accepted thyroid operation and patients with personal history of autoimmune disorders were excluded. A total number of 2856 individuals were included in the research, with the average age of 36.0. The proportion of women (75 %) was higher than men (25 %), which was consistent with the gender characteristic of the medical staff in China.
Characteristics of observed cohort divided by the participants with Hashimoto thyroiditis (HT) or without HT (non-HT)
Free triiodothyronine (pmol/l)
Free thyroid hormone (pmol/l)
Systolic blood pressure (mmHg)
Diastolic blood pressure (mmHg)
Alanine aminotransferase (IU/l)
Aspartate aminotransferase (IU/l)
Blood urea nitrogen (mmol/l)
Total cholesterol (mmol/l)
LDL cholesterol (mmol/l)
HDL cholesterol (mmol/l)
Fasting plasma glucose (mmol/L)
Uric acid (mmol/L)
No significant differences between the HT and non-HT groups were found in age, BMI, WHR, blood pressure or biochemical parameters concerning liver function, blood glucose and lipid. However, serum creatinine and uric acid were lower in HT group, which may be related to the high prevalence of HT in females, because females have relatively low level of serum creatinine and uric acid than males.
The distribution of TSH in the total population and subgroups (HT and non-HT group)
Serum TSH levels (mU/L) in total population and subgroups categorized by age
The prevalence of HT corresponding to each decile of TSH value
ROC curve describing the cutoff value of TSH in detecting participants with HT
Comparison of metabolism-related parameters between groups divided by different cutoff values of TSH
Comparison of metabolism related parameters between groups divided by cutoff values of TSH based on different method (using Joinpoint regression, ROC curve and conventional accepted value respectively)
Derivation of cutoff values
TSH < 2.9
TSH < 2.6
TSH < 4.5
To get a more convincing reference range of TSH, previous researchers used a more stringent standard for screening the reference population proposed by the NHANES III study , and more sensitive assays were used to measure TSH of those reference individuals. The upper limit of TSH was lowered to 3.6–3.7 mU/L [3, 15]. It was believed that the skewing right tail of the TSH distribution curve was caused by the inclusion of those with occult thyroid dysfunctions most representatively as HT. Besides, other factors such as gender and age are also critical to the reference range of TSH . In this study, age does not serve as an influence factor to the upper limit of TSH. The reason may be the research population was a homogeneously young cohort. Besides, It is not an iodine deficient area in Beijing. Therefore, the daily iodine intake and total body iodine status should not be a major factor to influence TSH value in our research population. We aimed to clear up the confounding influence of those individuals with HT to determine the cutoff point of TSH in our research population.
If we defined the upper limit of normal TSH as the 95th centile as adopted by previous researchers, the value would be 4.2, which could be comparable to most publications and more approximate to the traditional opinions insisting on 4.0–5.0 mU/L as the upper limit of normal TSH.
However, there were still considerable proportion of people with occult subclinical thyroid disease (mostly HT) within the upper part of the normal TSH range. Whereas we expect the normal range as a tool to screen out abnormal individuals. We tried an alternative approach to exclude the influence to TSH value confounded by individuals with HT. Based on the fact that TSH and the prevalence of HT was well correlated, we used “the prevalence of HT” as a checkout factor to determine the cutoff upper limit of TSH. Mimicking the diagnostic methodology in determining glycemic thresholds using DR as the checkout factor, we drew a curve describing the correlation between the prevalence of HT and TSH value. From this curve we got a turning point at TSH 2.9 mU/L by Joinpoint regression analysis. The prevalence of HT above this point was 14.5 % and much higher than that below the cutoff point (4.6 %). Similarly, ROC curve revealed a cutoff value of 2.6 mU/L, by which we can effectively detect HT with the best sensitivity and specificity.
These two measures showed very close results which are more close to the proposal in recent NACB guideline about narrowing the upper limit of TSH to 2.5 mU/L . As proposed by Surks , the normal range for TSH should meet the needs to categorize population into groups 1) normal, 2) those who need to be observed closely and 3) those who need medication. Although the lowered cutoff value of TSH may not be used as a treatment threshold, it can help to find more patients with occult subclinical thyroid diseases. If the limit was increased to 4.5U/L, 2.2 % of individuals with HT would be missed.
It has been indicated by several studies [17–19], that elevated TSH within the normal range was positively related to lipid abnormalities or even to the future occurring of hypertension and hyperlipidemia. Likewise, we used the lowered cutoff value of TSH to categorize population to watch if it made sense concerning the metabolism parameters among groups. As expected, people in the group with higher TSH had a much higher age, higher rate of hyperlipidemia and a higher Framingham score. When the cutoff value of 2.6 mU/L was used, it showed better results with more significant difference between groups concerning hyperlipidemia (hyper-TC, hyper-LDL and hyper-TG). Meanwhile, if a lowered upper limit of TSH was used, we can detect more patients with hyperlipidemia with less missed diagnosis. The Framingham score went higher with the increasing of TSH. This was also consistent with Asvold’s observation  and suggested increased incidence of coronary heart disease in population with SCH. Therefore, a lowered upper limit of TSH can also help to detect individuals with increased risk of ischemic heart diseases effectively.
Certainly, there were several limitations in our study, including the higher proportion of females in the research population leading to a bias of TSH value which may be caused by the higher prevalence of HT in females. Secondly, the age of the research population was no older than 60, so the conclusion may not be applied to older people.
This study shows a high prevalence of HT occurred among individuals with a TSH of 2.6–2.9 mU/L. These values are possible to the “true” values of normal upper limit of TSH for Chinese population. From the trend of higher abnormalities in metabolism-related indicators and Framingham score with the elevation of TSH, a proper cutoff value of normal upper limit of TSH will be useful in patients with mild thyroid dysfunction.
The authors thank all the participants and staff in the study.
This work was supported by grants from National Natural Science Foundation of China (81471009, 81561128015).
Availability of data and materials
The data that support the findings of this study are available from the corresponding author upon reasonable request.
YL and DNC contributed to develop research methodology, coordinate data collection, analysis and discussion, write manuscript and submission revision. CJ coordinated to collect and analyze research data. ZX GRY and MJN contributed to the discussion. YJK contributed to the design of the study, research data analysis and discussion, wrote manuscript and coordinated submission. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
The study was approved by Ethics Committee of Beijing Tongren Hospital, Capital Medical University. Study participants have signed informed consent.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Huber G, Staub JJ, Meier C, Mitrache C, Guglielmetti M, Huber P, Braverman LE. Prospective study of the spontaneous course of subclinical hypothyroidism: prognostic value of thyrotropin, thyroid reserve, and thyroid antibodies. J Clin Endocrinol Metab. 2002;87(7):3221–6.View ArticlePubMedGoogle Scholar
- Hollowell JG, Staehling NW, Flanders WD, Hannon WH, Gunter EW, Spencer CA, Braverman LE. Serum TSH, T(4), and thyroid antibodies in the United States population (1988 to 1994): National Health and Nutrition Examination Survey (NHANES III). J Clin Endocrinol Metab. 2002;87(2):489–99.View ArticlePubMedGoogle Scholar
- Kratzsch J, Fiedler GM, Leichtle A, Brugel M, Buchbinder S, Otto L, Sabri O, Matthes G, Thiery J. New reference intervals for thyrotropin and thyroid hormones based on National Academy of Clinical Biochemistry criteria and regular ultrasonography of the thyroid. Clin Chem. 2005;51(8):1480–6.View ArticlePubMedGoogle Scholar
- Knudsen N, Bulow I, Jorgensen T, Laurberg P, Ovesen L, Perrild H. Comparative study of thyroid function and types of thyroid dysfunction in two areas in Denmark with slightly different iodine status. Eur J Endocrinol. 2000;143(4):485–91.View ArticlePubMedGoogle Scholar
- Spencer CA, Hollowell JG, Kazarosyan M, Braverman LE. National Health and Nutrition Examination Survey III thyroid-stimulating hormone (TSH)-thyroperoxidase antibody relationships demonstrate that TSH upper reference limits may be skewed by occult thyroid dysfunction. J Clin Endocrinol Metab. 2007;92(11):4236–40.View ArticlePubMedGoogle Scholar
- Caturegli P, De Remigis A, Rose NR. Hashimoto thyroiditis: clinical and diagnostic criteria. Autoimmun Rev. 2014;13(4–5):391–7.View ArticlePubMedGoogle Scholar
- Asvold BO, Bjoro T, Platou C, Vatten LJ. Thyroid function and the risk of coronary heart disease: 12-year follow-up of the HUNT study in Norway. Clin Endocrinol (Oxf). 2012;77(6):911–7.View ArticleGoogle Scholar
- Almeida C, Brasil MA, Costa AJ, Reis FA, Reuters V, Teixeira P, Ferreira M, Marques AM, Melo BA, Teixeira LB, et al. Subclinical hypothyroidism: psychiatric disorders and symptoms. Rev Bras Psiquiatr. 2007;29(2):157–9.View ArticlePubMedGoogle Scholar
- Canaris GJ, Manowitz NR, Mayor G, Ridgway EC. The Colorado thyroid disease prevalence study. Arch Intern Med. 2000;160(4):526–34.View ArticlePubMedGoogle Scholar
- Brabant G, Beck-Peccoz P, Jarzab B, Laurberg P, Orgiazzi J, Szabolcs I, Weetman AP, Wiersinga WM. Is there a need to redefine the upper normal limit of TSH? Eur J Endocrinol. 2006;154(5):633–7.View ArticlePubMedGoogle Scholar
- Wartofsky L, Dickey RA. The evidence for a narrower thyrotropin reference range is compelling. J Clin Endocrinol Metab. 2005;90(9):5483–8.View ArticlePubMedGoogle Scholar
- Colagiuri S, Lee CM, Wong TY, Balkau B, Shaw JE, Borch-Johnsen K, Group D-CW. Glycemic thresholds for diabetes-specific retinopathy: implications for diagnostic criteria for diabetes. Diabetes Care. 2011;34(1):145–50.View ArticlePubMedGoogle Scholar
- Wilson PW, D'Agostino RB, Levy D, Belanger AM, Silbershatz H, Kannel WB. Prediction of coronary heart disease using risk factor categories. Circulation. 1998;97(18):1837–47.View ArticlePubMedGoogle Scholar
- Czajkowski M, Gill R, Rempala G. Model selection in logistic joinpoint regression with applications to analyzing cohort mortality patterns. Stat Med. 2008;27(9):1508–26.View ArticlePubMedGoogle Scholar
- Chan AO, Iu YP, Shek CC. The reference interval of thyroid-stimulating hormone in Hong Kong Chinese. J Clin Pathol. 2011;64(5):433–6.View ArticlePubMedGoogle Scholar
- Surks MI, Boucai L. Age- and race-based serum thyrotropin reference limits. J Clin Endocrinol Metab. 2010;95(2):496–502.View ArticlePubMedGoogle Scholar
- Asvold BO, Bjoro T, Vatten LJ. Associations of TSH levels within the reference range with future blood pressure and lipid concentrations: 11-year follow-up of the HUNT study. Eur J Endocrinol. 2013;169(1):73–82.View ArticlePubMedGoogle Scholar
- Svare A, Nilsen TI, Bjoro T, Asvold BO, Langhammer A. Serum TSH related to measures of body mass: longitudinal data from the HUNT Study, Norway. Clin Endocrinol (Oxf). 2011;74(6):769–75.View ArticleGoogle Scholar
- Wang F, Tan Y, Wang C, Zhang X, Zhao Y, Song X, Zhang B, Guan Q, Xu J, Zhang J, et al. Thyroid-stimulating hormone levels within the reference range are associated with serum lipid profiles independent of thyroid hormones. J Clin Endocrinol Metab. 2012;97(8):2724–31.View ArticlePubMedGoogle Scholar