Abstract
BACKGROUND: Prescribing of levothyroxine and rates of thyroid function testing may be sensitive to minor changes in the upper limit of the reference range for thyroid-stimulating hormone (TSH) that increase the proportion of abnormal results. We evaluated the population-level change in levothyroxine prescribing and TSH testing after a minor planned decrease in the upper limit of the reference range for TSH in a large urban centre with a single medical laboratory.
METHODS: Using provincial administrative data, we compared predicted volumes of TSH tests with actual TSH test volumes before and after a planned change in the TSH reference range. We also determined the number of new levothyroxine prescriptions for previously untreated patients and the rate of changes to the prescribed dose for those on previously stable, long-term levothyroxine therapy before and after the change in the TSH reference range.
RESULTS: Before the change in the TSH reference range, actual and predicted monthly volumes of TSH testing followed an identical course. After the change, actual test volumes exceeded predicted test volumes by 7.3% (95% confidence interval [CI] 5.3%–9.3%) or about 3000 to 5000 extra tests per month. The proportion of patients with newly “abnormal” TSH results almost tripled, from 3.3% (95% CI 3.2%–3.4%) to 9.1% (95% CI 9.0%–9.2%). The rate of new levothyroxine prescriptions increased from 3.24 (95% CI 3.15–3.33) per 1000 population in 2013 to 4.06 (95% CI 3.96–4.15) per 1000 population in 2014. Among patients with preexisting stable levothyroxine therapy, there was a significant increase in the number of dose escalations (p < 0.001) and a total increase of 500 new prescriptions per month.
INTERPRETATION: Our findings suggest that clinicians may have responded to mildly elevated TSH results with new or increased levothyroxine prescriptions and more TSH testing. Knowledge translation efforts may be useful to accompany minor changes in reference ranges.
Subclinical hypothyroidism, defined as an increase in serum thyroid-stimulating hormone (TSH) with normal levels of free thyroxine, is common. This biochemical phenotype does not have a matching clinical phenotype by which a clinical diagnosis can be recognized.1 This entity is referred to as biochemical subclinical hypothyroidism, whereby the TSH lies between the upper limit of the reference range and 10 mIU/L in individuals younger than 65 years of age.2
Controversy exists as to the merits of treating biochemical subclinical hypothyroidism.3–5 Some cross-sectional studies have reported links to a risk of vascular events in subsets of patients with this condition.6,7 However, the largest prospective studies and pooled data do not support an association with coronary events or with cardiac or total mortality.8,9 Among larger randomized controlled trials, none have shown that levothyroxine therapy in biochemical subclinical hypothyroidism meaningfully affects clinical end points.10–12 Furthermore, many patients who initially display biochemical subclinical hypothyroidism will have spontaneous normalization without intervention. 13–15 A recent guideline16 and a Cochrane systematic review17 have cautioned against routine use of levothyroxine in such cases, particularly when the TSH is below 10 mIU/L. Clinicians may be reassured about the indication for levothyroxine therapy in patients with serum TSH above 10 mIU/L, given that 97.5% of patients who are free of thyroid disease have a TSH value below this level.18 Initiation of levothyroxine in biochemical subclinical hypothyroidism is not always benign, as a substantial proportion of patients are eventually overtreated to the point of causing hyperthyroidism,19 which may have adverse clinical consequences.20,21
The normal range for TSH measurements has previously lacked standardization, but recent harmonization efforts have permitted the adoption of common, assay-independent reference ranges.22 Population studies excluding people with signs of thyroid dysfunction have suggested that the healthy upper reference limit for TSH is between 3.0 and 4.0 mIU/L.23 Expert opinion has suggested that a more accurate TSH reference range, with reduced upper limits, be implemented.18 In cases where the laboratory-specific TSH reference limit historically exceeded 4.0 mIU/L, a change to the new limit of 4.0 mIU/L effectively creates a “new” segment of the population with an abnormal TSH result, whose results would have been considered normal if interpreted in relation to the previous reference range.24 Apparent clinical effects that are purely artifacts of reference range changes were previously seen in a study of sperm counts, in which an adjustment in the reference range produced the spurious appearance of a population decline in fertility.25 A similar effect on vitamin D deficiency rates was seen with a change in reference definitions of normality.26
We hypothesized that lowering the upper limit of the reference range for TSH (without changing the assay) would lead to an increase in the number of patients receiving levothyroxine, reflecting the tendency of clinicians to offer more levothyroxine prescriptions solely on the basis of a new biochemical abnormality. We also hypothesized that this increase in levothyroxine treatment would result in a secondary increase in the use of thyroid laboratory testing as clinicians began to follow up borderline results or new treatments.
Methods
Study setting
Our sampling frame was the Calgary site of Alberta Precision Laboratories, which is the sole provider of laboratory tests to the Calgary region (estimated catchment area 1.4 million persons) as part of the government-provided health system.
On May 1, 2014, our laboratory changed the upper limit of the reference range for TSH from 6.0 to 4.0 mIU/L as part of a provincial initiative to harmonize reference ranges across laboratories in Alberta. The previous upper reference limit had been unchanged in the preceding 20 years. We determined the monthly rate of TSH testing by the Calgary laboratory in the 4 years before the change in upper reference limit for comparison with the frequency of TSH testing in the 12 months after the change. Because of underlying seasonality in laboratory test volumes,27 we used historic test volumes to predict TSH volumes using an interrupted time-series approach with the Holt–Winters additive model, presented as stationary R2 with a mean absolute percentage of error. This model is especially useful in assessing changes in laboratory test-ordering volumes over short periods (< 1 yr) because it controls for both the underlying slope of change in volumes that may be associated with an aging population or with population growth and the marked but predictable monthly variation in test volumes.28 Using this method, our team has previously investigated the impact of changes in our laboratory’s test-ordering practices at monthly time scales.29,30
Data sources
To assess the possible change in physician prescribing behaviour related to the change in upper reference limit for TSH, we interrogated the Alberta Health Pharmaceutical Information Network to determine the number and dose of levothyroxine prescriptions recorded in Calgary Zone city pharmacies for the 6-month period from May 1 to Oct. 31 in each of 3 calendar years (2011, 2012 and 2013, when the historical upper limit of the TSH reference range was in effect) for comparison with the same time frame immediately after implementation of the new upper limit in 2014. The Pharmaceutical Information Network is a provincial program that requires all pharmacists working in community locations to report drug dispensing data to the Alberta Ministry of Health for inclusion in a central database. This database has been updated on a daily basis since 2007 and has been estimated to capture 95% of the drugs dispensed outside of the hospital setting, regardless of patients’ insurance status31 (unpublished electronic health record usage statistics, Alberta Ministry of Health, 2016). We subdivided levothyroxine prescriptions after May 2014 into 2 groups: new prescriptions (for patients with no thyroxine prescriptions recorded in the previous 12 mo) and dose increases (for patients with a stable, lower-dose thyroxine prescription recorded repeatedly for at least 24 mo before the date of the upper limit change).
Statistical analysis
To examine changes in the rates of new prescriptions (with 95% confidence intervals [CIs]) within and between each year, we used a Poisson regression model with the number of new prescriptions as the outcome variable and the month (May to October) and year (2011 to 2014) as the predictor variables. We included the population in each year as an offset, to allow estimation of the rate of new and also higher-dose prescriptions per 1000 population. We examined the consistency of monthly prescribing patterns over study years with the likelihood ratio statistic to assess for evidence against overdispersion. We estimated predicted rates from the model with 95% CIs and performed statistical calculations using Stata 11.0 (StataCorp) with p < 0.05 set as the level of statistical significance.
Ethics approval
This study was approved by the Calgary Regional Health Ethics Board (approvals REB13-1271 and REB13-0459).
Results
Over the study time frame, between 45 000 and 65 000 TSH tests were performed each month at the Calgary laboratory (Figure 1). In the 4-year period before May 2014, the actual and predicted monthly test volumes followed a tightly correlated pattern. The high volume of TSH tests and the stability of test numbers over 4 years strengthens this correlation. However, in the year after the change in reference range, a sharp and persistent separation emerged, with actual TSH test volumes exceeding those predicted by 7.3% (95% CI 5.3%–9.3%), or between 3000 and 5000 extra tests per month, as shown graphically in the Holt–Winters time-series model in Figure 1. The time-series model showed a good fit with the data, with stationary R2 of 0.74 and mean absolute percent error of 2.62 (where a mean absolute percent error less than 10 generally indicates a highly accurate forecast model).
Within the year after the change in reference limit, the prevalence of TSH-defined biochemical subclinical hypothyroidism increased threefold. From January to April 2014, there were 198 797 TSH tests, of which 6611 had results above 6.0 but less than 10.0 mIU/L, for a prevalence of biochemical subclinical hypothyroidism of 3.3% (95% CI 3.2%–3.4%). From May to December 2014, there were 315 432 TSH tests, with 28 632 having a result greater than the new upper reference limit of 4.0 mIU/L but less than 10.0 mIU/L, for a prevalence of 9.1% (95% CI 9.0%–9.2%).
There was a significant difference in the rate of new levothyroxine prescriptions per 1000 population over the 4 years of the study, potentially related to the change in the upper limit of the reference range. For the 6-month period of May to October in each year from 2011 to 2013, new prescriptions of levothyroxine remained stable. However, after the upper limit change, the number of new prescriptions dispensed rose by 25.3%, from 3.24 (95% CI 3.15–3.33) per 1000 population in 2013 to 4.06 (95% CI 3.96–4.15) per 1000 population in 2014 (Table 1).
The numbers of new and increased-dose prescriptions are shown in Figure 2 and the rates of levothyroxine dose increases per 1000 population in Figure 3, according to month of prescription. There was no significant difference in the rate of increased-dose prescriptions across the years 2011 to 2013 (p = 0.1) (Figure 3A), so these data were collapsed for comparison with the 2014 data (Figure 3B). There was a significant interaction between month and year (2 categories, p < 0.001; Figure 3A). This interaction is further illustrated in Figure 3B, where for May alone, there was no significant difference between 2014 and 2011 to 2013 combined (p = 0.1), but for every successive month in 2014 there was a sharp increase in dose escalations, followed by a persistent separation from most corresponding months in the previous years (p < 0.001 for between-year comparisons for all months except August). In absolute terms, the median number of new or increased-dose thyroxine prescriptions per month increased from 1297 (95% CI 1244–1339) in the years before the reference change to 1804 (95% CI 1614–1923) after the reference change (Figures 2 and 3).
Interpretation
We used administrative health data to study how a change in laboratory policy affected patient care and potentially health system costs. Following a minor change in the upper limit of the reference range for TSH, the resultant 7.3% increase in TSH testing added 3000 to 5000 tests per month at our laboratory and about 500 new thyroxine prescriptions per month. In a setting with a population of more than 1 million, our observed increase in new levothyroxine prescriptions and dose escalations for those already taking levothyroxine means that this reference change could have important clinical and economic implications.
We included a large number of TSH tests (about 3 million) to determine population trends. With access to data from the single provider of laboratory tests within our health care system, we were able to accurately capture all tests done within the population across several years before and after the change in reference range. The use of provincial pharmacy data also allowed accurate and comprehensive capture of population trends in the prescribing of levothyroxine.
Population-based studies defining the TSH upper limit support the decision in Calgary to reduce the upper limit of the reference range from 6 to 4 mIU/L.18 However, this does not imply that treating a patient whose TSH falls between 4 and 6 mIU/L with levothyroxine has proven merit or avoids harm.32 Several studies of the natural history of untreated biochemical subclinical hypothyroidism show low rates of conversion to overt hypothyroidism (i.e., raised TSH with frankly low free thyroxine). Indeed, one-third to one-half of patients in the cohorts studied reverted to normal TSH over 2 to 5 years of observation.13–15,33
More than a decade ago, investigators at the Mayo Clinic predicted that reducing the TSH upper limit from 5 to 3 mIU/L would lead to a fourfold rise in the “incidence” of hypothyroidism which, in their cohort, would constitute 20% of test reports.24 A recent population study from Copenhagen suggested that primary care providers were increasingly likely (compared with past years) to offer thyroxine therapy for low degrees of abnormality in TSH, often after just a single measurement.34 We have added to this observation by analyzing the actual population-level consequences of an evidence-based refinement in a laboratory test, which lowered the upper limit of the TSH reference range by 2 mIU/L. This change “instantly” tripled the cohort of patients who could be classified as having biochemical subclinical hypothyroidism. All of these “newly diagnosed” patients would have been considered biochemically euthyroid the day before the change in the reference range. Data from the United Kingdom have shown that thyroxine is the second most commonly prescribed medication in primary care.35 Our study suggests that this may be driven by subtle and nonspecific abnormalities in exquisitely sensitive laboratory tests among patients with possibly nonspecific or no symptoms.36 As such, one unintended consequence of setting a scientifically accurate, narrow reference range for a common laboratory test is a probable marked increase in prescribing behaviour where subtle deviations from “normal” are taken to indicate the presence of disease.
Changing the TSH reference range may have several ramifications. Laboratories making such a change should consider using a specific knowledge translation intervention to help prescribers understand the potential pitfalls that may accompany even minor changes in reference ranges. For example, patients who may not need treatment and whose condition may spontaneously normalize without intervention may be classified as having a “disease.” Patients may also be inconvenienced by requests for more laboratory tests, additions to their individual pill burden and the need to seek more frequent medical follow-up, all of which would be expected to increase the costs of medical care.
Limitations
The limitations of our study include the usual limitations of data derived from administrative databases. For example, our analysis of TSH testing did not directly link individual patients’ results to dispensation of a new prescription for or increased dose of levothyroxine. It is possible that some patients filled prescriptions outside of the Calgary Zone, and these would not have been captured in the pharmacy database. We cannot account for possible patient migration into or out of the Calgary Zone for either blood tests or filling prescriptions. However, our use of 3 prior years of data for both laboratory testing and prescriptions helped to establish the lack of significant population variation in either measure before the date of the reference change, increasing the plausibility of the hypothesized relationship between the change in the reference range and the observed changes in laboratory utilization and prescribing behaviour.
Clustering of effects because of variation in physicians’ prescribing and test-ordering behaviour could not be detected to determine whether the observed trends reflected widespread practice or were driven by a select few. However, with 3 million TSH measurements for a population of about 1.4 million, it is unlikely that a small subset of providers could explain the population trends that we observed.
We were unable to exclude pregnant women from our analysis. For them, trimester-specific TSH reference ranges may be more appropriate. Studying thyroid testing and thyroxine prescription in pregnancy were not objectives of this study.37 However, we consider it unlikely that the relatively small numbers of women in a pregnancy testing cohort would substantially confound the results in this much larger one.
Finally, we were unable to ascertain whether the patients in our data set had any compelling clinical indications for levothyroxine therapy.
Conclusion
A minor lowering of the upper limit of the TSH reference range resulted in a substantial increase in laboratory test use, and possibly unnecessary levothyroxine prescribing and designation of patients as having subclinical hypothyroidism. Knowledge translation efforts are important for users of the medical biochemistry laboratory when a reference range is changed. Collaboration between clinical chemists and physicians is essential to ensure consideration of all potential outcomes. Efforts to improve both clinicians’ and patients’ knowledge about subclinical hypothyroidism are warranted.
Footnotes
Competing interests: None declared.
This article has been peer reviewed.
Contributors: Christopher Symonds conceived of the project, contributed to the analysis and wrote the first draft of the manuscript. Gregory Kline co-wrote the first draft and generated the graphic displays. Inelda Gjata conducted the data gathering from the provincial Pharmaceutical Information Network. Marianne Rose performed the statistical analysis. Maggie Guo collected the data and constructed the laboratory database. Lara Cooke supervised the project construction and assisted in the ethics approval application and the primary analysis. Christopher Naugler performed statistical analysis of the laboratory data. All of the authors revised the manuscript for important intellectual content, approved the final version for publication and agreed to act as guarantors of the work.
Funding: This project was funded by the Physician Learning Program, a joint venture of the University of Alberta, the University of Calgary and the Alberta Medical Association.
Data sharing: The data used for this study are not publicly available because of privacy or ethics restrictions.
- Accepted March 17, 2020.