Routinely collected data and comparative effectiveness evidence: promises and limitations

Lars G. Hemkens; Despina G. Contopoulos-Ioannidis; John P.A. Ioannidis

doi:10.1503/cmaj.150653

Routinely collected data (RCD) are increasingly used for biomedical research. Extensive resources have been invested in this field: they include the set-up of disease registries and clinical databases at regional, national or international levels; the promotion of the use of electronic health records; and making use of wearable devices for the collection of health data. Analysis of this data can inform on descriptive features (prevalence or incidence of disease, treatments and risk factors), associations with putative risk factors and/or treatment effects of interventions (e.g., drugs, surgery, psychotherapy or medical devices).

Although descriptive estimates and associations offer interesting information, treatment effects are most important for clinical decision-making. They are the core of comparative effectiveness research. In this article, we focus primarily on RCD for determining treatment effects, because they are increasingly considered mainstream options for building evidence on treatment choices. The promises and hype of personalized medicine (or precision medicine, predictive medicine, participatory medicine, 4P or stratified medicine) are also similarly fueled by the widespread use of RCD. We do not use these terms here, because these promises have the same major challenges that are faced by traditional comparative effectiveness research — even to a higher degree — because they try to identify best options for single patients or small subgroups rather than larger populations. In this overview, we contrast the expectations many have of the use of RCD versus their limitations, discuss which expectations can be met and suggest potential changes in the research agenda for RCD.

Main strengths and weaknesses of routinely collected data

Big data studies with enormous sample sizes or real-world analyses of near-perfect representations of routine care fuel tremendous expectations for RCD in clinical decision-making. Although the traditional limitations of observational research remain, such extremes amplify strengths and weaknesses. The latter may increase exponentially by challenges specifically related to the very nature of data not collected for the purpose of research (e.g., additional biases or errors occurring when gigantic datasets have to be assembled, cleaned, processed, linked and retrospectively analyzed).

In theory, RCD have several advantages. Data collection under real-world circumstances maximizes representativeness and generalizability, minimizes costs and effort, and allows the capture of information in large populations and many clinical events in large datasets that are continuously updated and cover long periods.

However, these theoretical advantages should be viewed cautiously. First, many RCD are collected in situations where populations, diseases, settings and/or interventions are not representative (e.g., when data are collected in tertiary referral hospitals or in health care systems where the population or use of specific interventions are selected by ability to pay or other filters). Evaluation of newly approved drugs may be difficult because there are few existing routine data, and barriers to access innovative drugs may create strong confounding by indication. Second, costs are not necessarily low in all cases (e.g., many hospitals and health care systems make large investments in infrastructure and maintenance because of the increasing popularity of electronic health records). Fragmentation of efforts escalates cost compared with centralized systems that include all health care facilities in a country (e.g., the health care system in Taiwan1). Third, large sample sizes without thorough analytical safeguards can result in statistically significant false-positive and false-negative results.

The observational nature of RCD is an inherent limitation for the study of treatment effects. Which treatment is chosen depends on various known (e.g., severity of disease) or unknown factors that may be associated with the outcome. Such confounding by indication can invalidate real-world observations. Multiple statistical methods are used to reduce these biases (e.g., propensity scores and instrumental variables analyses),2^,3 but only properly designed randomized controlled trials (RCTs) can pre-emptively overcome such biases.

Multiple errors and biases may interfere with routine data collection and processing (e.g., data linkage problems, misclassification bias and underreporting).3^,4 This further reduces the validity of RCD. Additional steps, such as manual reviews of patient records, are sometimes incorporated to improve the quality of the RCD data. However, this adds to the cost and does not solve misclassification problems that occur when risk exposures and/or outcomes are ascertained in a nonstandardized way and when differences in coding practices also exist. Differences in management practice within and across institutions can reflect differences in several other confounding factors (e.g., disease severity).

Studies of RCD or better RCTs?

To understand how to best use RCD for health care decision-making, we should revisit the limitations of RCTs (the gold standard for studying treatment effects) and whether overcoming these limitations needs a better RCT agenda or use of RCD.

Generalizability and real-world relevance of clinical studies, in particular those that are used for drug approval, are often limited by narrow inclusion and exclusion criteria,5 and trial participants may have different characteristics than non-participants. Trials are frequently conducted under artificial conditions that differ from routine care (e.g., use of run-in periods, structured follow-up visits or standardized cotreatments). Certain populations are frequently underrepresented in RCTs, including children, women, older adults or patients with comorbidities and polypharmacy.6^–10 Drug–drug interactions or adverse effects occurring in routine care may be overlooked. Cost considerations prohibit large studies that would be informative for subgroup-specific effects.

Some of these deficiencies may be best solved by improving the RCT agenda rather than turning to RCD. For example, the cost of RCTs can be reduced substantially, allowing very large sample sizes and better representativeness of the enrolled populations, if simple, pragmatic megatrials are adopted and RCD are used for collecting outcome information.11^,12 Nevertheless, such megatrials are uncommon, and thus observational RCD studies are used to fill the evidence gap. For uncommon conditions, even megatrials would have few patients to inform on outcomes in these subgroups. Studies using RCD can reach sample sizes that are 100- to 1000-fold bigger than the sample sizes of large trials. However, the planning and reporting for claims of subgroup differences in clinical research have been dismal, and most claims are not validated.13 For example, it remains unknown whether the treatment effect suggested by RCD studies involving patients over 80 years of age with modest renal impairment, hypertension and taking three other drugs would be more reliable than the average treatment effect suggested by an RCT that involved patients with none or few of these characteristics.

Given the limited funds for RCTs, many important health care questions are not studied. Such evidence gaps could be addressed by a better RCT research agenda that prioritizes the use of pragmatic, patient-important outcomes14 and relevant head-to-head comparisons.5^,15^,16 Some comparative effectiveness evidence may also be accommodated by network meta-analyses of RCTs.5^,15^,17 However, even then, an exhaustive evaluation of treatment effects on mortality and other patient-important outcomes (including major harms) with RCTs alone is unrealistic. Here, RCD could fill many evidence gaps. One may then decide that the RCD evidence is strong enough to lead to policy or guideline changes, or the RCD evidence may be used to guide the design of future RCTs. There are also situations where conducting RCTs would be unrealistic or perceived as unethical.18

Randomized controlled trials currently differ from RCD studies in many features besides randomization. Many of the features that improve the validity of RCTs, either directly or indirectly, may also contribute to the perceived practical disadvantages of this type of research. For example, the regulatory requirements that need to be fulfilled before a trial may start are often cumbersome.19 These requirements are a direct result of the experimental nature and ethical implications of randomization.20 They include thorough reflections about the intended purpose of the research to justify randomization, study protocols clearly stating assumptions, hypotheses and calculations of sample size, and submission of protocols to regulatory authorities. Working in larger collaborative groups of researchers with various backgrounds and exchanges with involved stakeholders, ethics committees or data–safety monitoring boards generates feedback loops that may improve initial RCT research plans.

Most of these steps are often not undertaken for RCD research. Some of the perceived practical advantages of RCD studies may actually be limitations. Available datasets may be rapidly analyzed by small teams or a single researcher. Studies of RCD are largely overpowered to obtain nominally significant effects, however small they may be.21 Post hoc explanations are easily invoked, increasing confidence in spurious findings.22^,23 Results can remain unpublished, or results may be published depending on the plausibility of explanations, preconceived hypotheses, commercial interests or the researcher’s personal need for scientific reward.

In Table 1, we summarize some of the limitations of current RCTs, beginning with those that may be the most amenable to improvement of the current RCT agenda. We list ways to bypass these limitations with RCD and highlight residual caveats of RCD studies.

View this table:

Table 1:

Limitations of RCTs and whether they can be amended by RCD studies

The status quo of routinely collected data

We recently conducted an empirical analysis on how RCD studies try to complement RCTs to understand treatment effects.24 We assessed 337 RCD studies that investigated the comparative effectiveness of medical treatments on mortality. Seventy percent of these studies were incremental research that supplemented existing RCTs but did not fill fundamental knowledge gaps (i.e., questions never evaluated in RCTs). In only six (1.8%) of these RCD studies did the authors state that conducting RCTs on their research topic would be unethical, and in only 18 (5.3%) did they state that it would be difficult. Typically, investigators conducting the RCDs reasoned that RCT results had limited generalizability (37.6%), did not adequately address specific outcomes (31.9%) or certain populations (23.5%), or were inconclusive or inconsistent (25.8%).

Most RCD studies focus on questions that have been addressed by RCTs or could be definitively addressed by RCTs.24 Agreement between the results of such RCD studies and the results of the RCTs offers some incremental reassurance, but the benefit for clinical decision-making is limited or nonexistent. When RCTs and observational studies disagree,25 the situation becomes complicated. Much of the interpretation of inconsistent results between such sources of evidence is currently a case-by-case discussion. Eventually, residual bias owing to nonrandomization or the artificial RCT setting may be used as arguments for almost any disagreement. Consensus becomes difficult to reach.

In areas without evidence from RCTs, studies of RCTS may provide the only guidance on a critical health care question, albeit with recognizable limitations. Policy or guideline changes based on RCD should acknowledge the limitations of RCD, and strategic plans should be in place to monitor the clinical impact of these changes. Unfortunately, current RCD studies do not focus on the large numbers of critical health care questions that do not have evidence from RCTs.24 For example, comparisons of drug and nondrug treatments, and evaluations of inexpensive drugs are lacking. Evidence from RCD studies would be useful in providing answers to these vital questions.

Changes in the RCD research agenda and practices

Overall, expectations about the utility of RCD studies for understanding treatment effects are probably overestimated. We discuss what improvements can be made in RCD studies and what resources would be required (Table 2).

View this table:

Table 2:

Options to improve the value of routinely collected health data

Selecting priorities

In selecting research questions, prior evidence must be systematically reviewed. Another study or analysis may not be necessary. Routinely collected data studies should focus more on questions that have not been addressed or are difficult or impossible to address with other study designs.

Protocols and prespecification

Research using RCD may or may not use explicit protocols and prespecified analyses. It is important to know what was not prespecified. Exploratory analyses should be described as such; they need further prospective validation with protocol-based, prespecified studies. Wherever prespecification is not feasible, transparent and complete documentation of the conduct of the study is still useful. The validity of RCD and their proper interpretation can be improved by using falsification end points (negative controls of known null associations),27 validation datasets28 and prespecified rules when the study hypotheses should be considered confirmed or rejected.

Registration

Registration of RCD studies that have prospective design and/or analysis elements and explicit protocols would help shape a more efficient research agenda and reduce selective reporting of methods and findings. For explorative research, it may be best to register datasets; this would facilitate planning a concerted research agenda, data-sharing activities and using datasets for validation.29^,30

Reporting

Incomplete or unusable reporting wastes research resources.31 Studies using RCD have a low rate of reporting.32 Recently, the RECORD (REporting of studies Conducted using Observational Routinely-collected health Data) statement was published,33 which aims to improve the reporting quality specifically of observational RCD studies by providing an extension to the STROBE (STrengthening the Reporting of OBservational studies in Epidemiology) statement.34 In addition to transparent reporting, the results need to be embedded into a systematic review of the available evidence. Journals, peer reviewers, funders and authorities can help to improve the reporting quality of RCD studies.

Access to raw data

Lack of access to raw data makes it impossible to independently assess analytic errors and biases, and limits opportunities for joint analyses. Facilitated availability of different datasets would support external validation and improve standardization and efforts to enhance quality. Patients should be asked for explicit consent up front for prospective data sharing of RCD, as is required for RCTs. The misleading view that health information is not really protected data when it is routinely collected creates serious problems.35 Consent issues would be best decided during database building. Data deidentification should also be carefully planned.

Research networks

Large research networks can foster the joint use of RCT and RCD datasets. Research networks may be in the best position to face the challenges involved in establishing harmonized/standardized research. This includes outcome definitions (e.g., by developing and validating universally accepted lists of diagnostic codes for specific outcomes), time points of outcome assessments, risk exposures to be analyzed, subgroup analyses to be explored, and predetermined effect sizes and other criteria for clinically significant outcome differences. Standardized guidance can be developed for organizing and implementing data sharing. Collaborators with various levels of expertise and backgrounds would provide diverse perspectives to maximize research applicability.

Research on research

More research on the reliability of RCD results is necessary (e.g., on the performance of approaches to deal with confounding by indication, such as propensity scores, instrumental variables or the use of falsification end points). Compared with RCTs, there is little empirical guidance on the interpretation of RCD evidence. We need to develop a better understanding of and tools for assessment of risk of bias, generalizability and data validity.

Conclusion

Research using RCD is becoming increasingly popular, but its limitations cannot be overstated. Several suggested improvements may increase the utility of this research but would require additional resources. Studies using RCD should be prioritized for situations where RCTs cannot be conducted. Nevertheless, interpretation of RCD must be done with caution.

KEY POINTS

Routinely collected data (RCD) are increasingly used for biomedical research; however, their utility for understanding treatment effects is probably overestimated.
Many of the perceived advantages of RCD should be viewed cautiously, because of the inevitable biases of observational research and specific biases due to the nature of these data.
Improvements may increase the utility of RCD but require resources for implementation; they include improvements in research priority setting, transparency of data and protocols, and collaborative research networks.
Although many evidence gaps may be better addressed by an improved randomized controlled trial (RCT) agenda, RCD studies may be required in situations where RCTs are difficult or impossible to perform; interpretation of these studies should be cautious.

Footnotes

See also page www.cmaj.ca/lookup/doi/10.1503/cmaj.160410, www.cmaj.ca/lookup/doi/10.1503/cmaj.151470 and CMAJ Open article www.cmajopen.ca/content/4/2/E132
Competing interests: None declared.
This article has been peer reviewed.
Contributors: Lars Hemkens wrote the first draft of the article. All of the authors contributed to the writing and editing of the manuscript, revised it critically for intellectual content, approved the final version to be published and agreed to act as guarantors of the work.

References

↵
1. Hsing AW,
2. Ioannidis JP
. Nationwide population science: lessons from the Taiwan National Health Insurance Research Database. JAMA Intern Med. 2015;175:1527–9.
OpenUrl CrossRef PubMed
↵
1. Hernán MA,
2. Robins JM
. Instruments for causal inference: An epidemiologist’s dream? Epidemiology 2006;17:360–72.
OpenUrl CrossRef PubMed
↵
1. Schneeweiss S,
2. Avorn J
. A review of uses of health care utilization databases for epidemiologic research on therapeutics. J Clin Epidemiol 2005;58:323–37.
OpenUrl CrossRef PubMed
↵
1. Bohensky MA,
2. Jolley D,
3. Sundararajan V,
4. et al
. Data linkage: a powerful research tool with potential problems. BMC Health Serv Res 2010;10:346.
OpenUrl CrossRef PubMed
↵
1. Naci H,
2. Ioannidis JP
. Comparative effectiveness of exercise and drug interventions on mortality outcomes: metaepidemiological study. BMJ 2013;347:f5577.
OpenUrl Abstract/FREE Full Text
↵
1. Geller SE,
2. Adams MG,
3. Carnes M
. Adherence to federal guidelines for reporting of sex and race/ethnicity in clinical trials. J Womens Health (Larchmt) 2006;15:1123–31.
OpenUrl CrossRef PubMed
1. Dodd KS,
2. Saczynski JS,
3. Zhao Y,
4. et al
. Exclusion of older adults and women from recent trials of acute coronary syndromes. J Am Geriatr Soc 2011;59:506–11.
OpenUrl CrossRef PubMed
1. Heiat A,
2. Gross CP,
3. Krumholz HM
. Representation of the elderly, women, and minorities in heart failure clinical trials. Arch Intern Med 2002;162:1682–8.
OpenUrl CrossRef PubMed
1. Konrat C,
2. Boutron I,
3. Trinquart L,
4. et al
. Underrepresentation of elderly people in randomised controlled trials. The example of trials of 4 widely prescribed drugs. PLoS ONE 2012;7:e33559.
OpenUrl CrossRef PubMed
↵
1. Van Spall HG,
2. Toren A,
3. Kiss A,
4. et al
. Eligibility criteria of randomized controlled trials published in high-impact general medical journals: a systematic sampling review. JAMA 2007; 297:1233–40.
OpenUrl CrossRef PubMed
↵
1. Eisenstein EL,
2. Lemons PW II.,
3. Tardiff BE,
4. et al
. Reducing the costs of phase III cardiovascular clinical trials. Am Heart J 2005; 149:482–8.
OpenUrl CrossRef PubMed
↵
1. Vickers AJ
. Clinical trials in crisis: four simple methodologic fixes. Clin Trials 2014;11:615–21.
OpenUrl CrossRef PubMed
↵
1. Rothwell PM
. Treating individuals 2. Subgroup analysis in randomised controlled trials: importance, indications, and interpretation. Lancet 2005;365:176–86.
OpenUrl CrossRef PubMed
↵
1. Chalmers I,
2. Bracken MB,
3. Djulbegovic B,
4. et al
. How to increase value and reduce waste when research priorities are set. Lancet 2014;383:156–65.
OpenUrl CrossRef PubMed
↵
1. Lathyris DN,
2. Patsopoulos NA,
3. Salanti G,
4. et al
. Industry sponsorship and selection of comparators in randomized clinical trials. Eur J Clin Invest 2010;40:172–82.
OpenUrl CrossRef PubMed
↵
1. Rizos EC,
2. Salanti G,
3. Kontoyiannis DP,
4. et al
. Homophily and co-occurrence patterns shape randomized trials agendas: illustration in antifungal agents. J Clin Epidemiol 2011;64: 830–42.
OpenUrl CrossRef PubMed
↵
1. Mills EJ,
2. Thorlund K,
3. Ioannidis JP
. Demystifying trial networks and network meta-analysis. BMJ 2013;346:f2914.
OpenUrl FREE Full Text
↵
1. Djulbegovic B,
2. Hozo I
. At what degree of belief in a research hypothesis is a trial in humans justified? J Eval Clin Pract 2002; 8:269–76.
OpenUrl CrossRef PubMed
↵
1. Chan AW,
2. Song F,
3. Vickers A,
4. et al
. Increasing value and reducing waste: addressing inaccessible research. Lancet 2014; 383:257–66.
OpenUrl CrossRef PubMed
↵
1. Rosner F
. The ethics of randomized clinical trials. Am J Med 1987;82:283–90.
OpenUrl CrossRef PubMed
↵
1. Siontis GC,
2. Ioannidis JP
. Risk factors and interventions with statistically significant tiny effects. Int J Epidemiol 2011; 40:1292–307.
OpenUrl CrossRef PubMed
↵
1. Mynatt CR,
2. Doherty ME,
3. Tweney RD
. Confirmation bias in a simulated research environment: an experimental study of scientific inference. Q J Exp Psychol 1977;29:85–95.
OpenUrl CrossRef
↵
1. Nickerson RS
. Confirmation bias: a ubiquitous phenomenon in many guises. Rev Gen Psychol 1998;2:175.
OpenUrl CrossRef
↵
1. Hemkens LG,
2. Contopoulos-Ioannidis DG,
3. Ioannidis J
. Do routinely collected health data complement randomized evidence? A survey. CMAJ Open. In press.
↵
1. Ioannidis JP,
2. Haidich AB,
3. Pappa M,
4. et al
. Comparison of evidence of treatment effects in randomized and nonrandomized studies. JAMA 2001;286:821–30.
OpenUrl CrossRef PubMed
1. Richardson WS,
2. Wilson MC,
3. Nishikawa J,
4. et al
. The well-built clinical question: a key to evidence-based decisions. ACP J Club 1995;123:A12–3.
OpenUrl PubMed
↵
1. Prasad V,
2. Jena AB
. Prespecified falsification end points: Can they validate true observational associations? JAMA 2013;309: 241–2.
OpenUrl CrossRef PubMed
↵
1. Young SS,
2. Karr A
. Deming, data and observational studies. A process out of control and needing fixing. Significance 2011; 8:116–20.
OpenUrl CrossRef
↵
1. Lash TL,
2. Vandenbroucke JP
. Should preregistration of epidemiologic study protocols become compulsory? Reflections and a counterproposal. Epidemiology 2012;23:184–8.
OpenUrl CrossRef PubMed
↵
1. Ioannidis JP
. The importance of potential studies that have not existed and registration of observational data sets. JAMA 2012; 308:575–6.
OpenUrl CrossRef PubMed
↵
1. Glasziou P,
2. Altman DG,
3. Bossuyt P,
4. et al
. Reducing waste from incomplete or unusable reports of biomedical research. Lancet 2014;383:267–76.
OpenUrl CrossRef PubMed
↵
1. Hemkens LG,
2. Benchimol EI,
3. Langan SM,
4. et al
. Reporting of studies using routinely collected health data: systematic literature analysis [oral abstract presentation]. 2015 REWARD/EQUATOR Conference: Increasing value and reducing waste in biomedical research; 2015 Sept. 28–30; Edinburgh (UK).
↵
1. Benchimol EI,
2. Smeeth L,
3. Guttmann A,
4. et al
. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLoS Med 2015;12:e1001885.
OpenUrl CrossRef PubMed
↵
1. von Elm E,
2. Altman DG,
3. Egger M,
4. et al
. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. PLoS Med 2007;4:e296.
OpenUrl CrossRef PubMed
↵
1. Ioannidis JP
. Informed consent, big data, and the oxymoron of research that is not research. Am J Bioeth 2013;13:40–2.
OpenUrl

[1] ↵
Hsing AW,
Ioannidis JP
. Nationwide population science: lessons from the Taiwan National Health Insurance Research Database. JAMA Intern Med. 2015;175:1527–9.
OpenUrl CrossRef PubMed

[2] Hsing AW,

[3] Ioannidis JP

[4] ↵
Hernán MA,
Robins JM
. Instruments for causal inference: An epidemiologist’s dream? Epidemiology 2006;17:360–72.
OpenUrl CrossRef PubMed

[5] Hernán MA,

[6] Robins JM

[7] ↵
Schneeweiss S,
Avorn J
. A review of uses of health care utilization databases for epidemiologic research on therapeutics. J Clin Epidemiol 2005;58:323–37.
OpenUrl CrossRef PubMed

[8] Schneeweiss S,

[9] Avorn J

[10] ↵
Bohensky MA,
Jolley D,
Sundararajan V,
et al
. Data linkage: a powerful research tool with potential problems. BMC Health Serv Res 2010;10:346.
OpenUrl CrossRef PubMed

[11] Bohensky MA,

[12] Jolley D,

[13] Sundararajan V,

[14] et al

[15] ↵
Naci H,
Ioannidis JP
. Comparative effectiveness of exercise and drug interventions on mortality outcomes: metaepidemiological study. BMJ 2013;347:f5577.
OpenUrl Abstract/FREE Full Text

[16] Naci H,

[17] Ioannidis JP

[18] ↵
Geller SE,
Adams MG,
Carnes M
. Adherence to federal guidelines for reporting of sex and race/ethnicity in clinical trials. J Womens Health (Larchmt) 2006;15:1123–31.
OpenUrl CrossRef PubMed

[19] Geller SE,

[20] Adams MG,

[21] Carnes M

[22] Dodd KS,
Saczynski JS,
Zhao Y,
et al
. Exclusion of older adults and women from recent trials of acute coronary syndromes. J Am Geriatr Soc 2011;59:506–11.
OpenUrl CrossRef PubMed

[23] Dodd KS,

[24] Saczynski JS,

[25] Zhao Y,

[26] et al

[27] Heiat A,
Gross CP,
Krumholz HM
. Representation of the elderly, women, and minorities in heart failure clinical trials. Arch Intern Med 2002;162:1682–8.
OpenUrl CrossRef PubMed

[28] Heiat A,

[29] Gross CP,

[30] Krumholz HM

[31] Konrat C,
Boutron I,
Trinquart L,
et al
. Underrepresentation of elderly people in randomised controlled trials. The example of trials of 4 widely prescribed drugs. PLoS ONE 2012;7:e33559.
OpenUrl CrossRef PubMed

[32] Konrat C,

[33] Boutron I,

[34] Trinquart L,

[35] et al

[36] ↵
Van Spall HG,
Toren A,
Kiss A,
et al
. Eligibility criteria of randomized controlled trials published in high-impact general medical journals: a systematic sampling review. JAMA 2007; 297:1233–40.
OpenUrl CrossRef PubMed

[37] Van Spall HG,

[38] Toren A,

[39] Kiss A,

[40] et al

[41] ↵
Eisenstein EL,
Lemons PW II.,
Tardiff BE,
et al
. Reducing the costs of phase III cardiovascular clinical trials. Am Heart J 2005; 149:482–8.
OpenUrl CrossRef PubMed

[42] Eisenstein EL,

[43] Lemons PW II.,

[44] Tardiff BE,

[45] et al

[46] ↵
Vickers AJ
. Clinical trials in crisis: four simple methodologic fixes. Clin Trials 2014;11:615–21.
OpenUrl CrossRef PubMed

[47] Vickers AJ

[48] ↵
Rothwell PM
. Treating individuals 2. Subgroup analysis in randomised controlled trials: importance, indications, and interpretation. Lancet 2005;365:176–86.
OpenUrl CrossRef PubMed

[49] Rothwell PM

[50] ↵
Chalmers I,
Bracken MB,
Djulbegovic B,
et al
. How to increase value and reduce waste when research priorities are set. Lancet 2014;383:156–65.
OpenUrl CrossRef PubMed

[51] Chalmers I,

[52] Bracken MB,

[53] Djulbegovic B,

[54] et al

[55] ↵
Lathyris DN,
Patsopoulos NA,
Salanti G,
et al
. Industry sponsorship and selection of comparators in randomized clinical trials. Eur J Clin Invest 2010;40:172–82.
OpenUrl CrossRef PubMed

[56] Lathyris DN,

[57] Patsopoulos NA,

[58] Salanti G,

[59] et al

[60] ↵
Rizos EC,
Salanti G,
Kontoyiannis DP,
et al
. Homophily and co-occurrence patterns shape randomized trials agendas: illustration in antifungal agents. J Clin Epidemiol 2011;64: 830–42.
OpenUrl CrossRef PubMed

[61] Rizos EC,

[62] Salanti G,

[63] Kontoyiannis DP,

[64] et al

[65] ↵
Mills EJ,
Thorlund K,
Ioannidis JP
. Demystifying trial networks and network meta-analysis. BMJ 2013;346:f2914.
OpenUrl FREE Full Text

[66] Mills EJ,

[67] Thorlund K,

[68] Ioannidis JP

[69] ↵
Djulbegovic B,
Hozo I
. At what degree of belief in a research hypothesis is a trial in humans justified? J Eval Clin Pract 2002; 8:269–76.
OpenUrl CrossRef PubMed

[70] Djulbegovic B,

[71] Hozo I

[72] ↵
Chan AW,
Song F,
Vickers A,
et al
. Increasing value and reducing waste: addressing inaccessible research. Lancet 2014; 383:257–66.
OpenUrl CrossRef PubMed

[73] Chan AW,

[74] Song F,

[75] Vickers A,

[76] et al

[77] ↵
Rosner F
. The ethics of randomized clinical trials. Am J Med 1987;82:283–90.
OpenUrl CrossRef PubMed

[78] Rosner F

[79] ↵
Siontis GC,
Ioannidis JP
. Risk factors and interventions with statistically significant tiny effects. Int J Epidemiol 2011; 40:1292–307.
OpenUrl CrossRef PubMed

[80] Siontis GC,

[81] Ioannidis JP

[82] ↵
Mynatt CR,
Doherty ME,
Tweney RD
. Confirmation bias in a simulated research environment: an experimental study of scientific inference. Q J Exp Psychol 1977;29:85–95.
OpenUrl CrossRef

[83] Mynatt CR,

[84] Doherty ME,

[85] Tweney RD

[86] ↵
Nickerson RS
. Confirmation bias: a ubiquitous phenomenon in many guises. Rev Gen Psychol 1998;2:175.
OpenUrl CrossRef

[87] Nickerson RS

[88] ↵
Hemkens LG,
Contopoulos-Ioannidis DG,
Ioannidis J
. Do routinely collected health data complement randomized evidence? A survey. CMAJ Open. In press.

[89] Hemkens LG,

[90] Contopoulos-Ioannidis DG,

[91] Ioannidis J

[92] ↵
Ioannidis JP,
Haidich AB,
Pappa M,
et al
. Comparison of evidence of treatment effects in randomized and nonrandomized studies. JAMA 2001;286:821–30.
OpenUrl CrossRef PubMed

[93] Ioannidis JP,

[94] Haidich AB,

[95] Pappa M,

[96] et al

[97] Richardson WS,
Wilson MC,
Nishikawa J,
et al
. The well-built clinical question: a key to evidence-based decisions. ACP J Club 1995;123:A12–3.
OpenUrl PubMed

[98] Richardson WS,

[99] Wilson MC,

[100] Nishikawa J,

[101] et al

[102] ↵
Prasad V,
Jena AB
. Prespecified falsification end points: Can they validate true observational associations? JAMA 2013;309: 241–2.
OpenUrl CrossRef PubMed

[103] Prasad V,

[104] Jena AB

[105] ↵
Young SS,
Karr A
. Deming, data and observational studies. A process out of control and needing fixing. Significance 2011; 8:116–20.
OpenUrl CrossRef

[106] Young SS,

[107] Karr A

[108] ↵
Lash TL,
Vandenbroucke JP
. Should preregistration of epidemiologic study protocols become compulsory? Reflections and a counterproposal. Epidemiology 2012;23:184–8.
OpenUrl CrossRef PubMed

[109] Lash TL,

[110] Vandenbroucke JP

[111] ↵
Ioannidis JP
. The importance of potential studies that have not existed and registration of observational data sets. JAMA 2012; 308:575–6.
OpenUrl CrossRef PubMed

[112] Ioannidis JP

[113] ↵
Glasziou P,
Altman DG,
Bossuyt P,
et al
. Reducing waste from incomplete or unusable reports of biomedical research. Lancet 2014;383:267–76.
OpenUrl CrossRef PubMed

[114] Glasziou P,

[115] Altman DG,

[116] Bossuyt P,

[117] et al

[118] ↵
Hemkens LG,
Benchimol EI,
Langan SM,
et al
. Reporting of studies using routinely collected health data: systematic literature analysis [oral abstract presentation]. 2015 REWARD/EQUATOR Conference: Increasing value and reducing waste in biomedical research; 2015 Sept. 28–30; Edinburgh (UK).

[119] Hemkens LG,

[120] Benchimol EI,

[121] Langan SM,

[122] et al

[123] ↵
Benchimol EI,
Smeeth L,
Guttmann A,
et al
. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLoS Med 2015;12:e1001885.
OpenUrl CrossRef PubMed

[124] Benchimol EI,

[125] Smeeth L,

[126] Guttmann A,

[127] et al

[128] ↵
von Elm E,
Altman DG,
Egger M,
et al
. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. PLoS Med 2007;4:e296.
OpenUrl CrossRef PubMed

[129] von Elm E,

[130] Altman DG,

[131] Egger M,

[132] et al

[133] ↵
Ioannidis JP
. Informed consent, big data, and the oxymoron of research that is not research. Am J Bioeth 2013;13:40–2.
OpenUrl

[134] Ioannidis JP

Main menu

User menu

Search

Routinely collected data and comparative effectiveness evidence: promises and limitations

Main strengths and weaknesses of routinely collected data

Studies of RCD or better RCTs?

The status quo of routinely collected data

Changes in the RCD research agenda and practices

Selecting priorities

Protocols and prespecification

Registration

Reporting

Access to raw data

Research networks

Research on research

Conclusion

Footnotes

References

In this issue

Article tools

Citation Manager Formats

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Collections

Content

Information for

About

Main menu

User menu

Search

Routinely collected data and comparative effectiveness evidence: promises and limitations

Main strengths and weaknesses of routinely collected data

Studies of RCD or better RCTs?

The status quo of routinely collected data

Changes in the RCD research agenda and practices

Selecting priorities

Protocols and prespecification

Registration

Reporting

Access to raw data

Research networks

Research on research

Conclusion

Footnotes

References

In this issue

Article tools

Citation Manager Formats

Jump to section

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Collections

Content

Information for

About