Users’ guide to the surgical literature: how to assess a noninferiority trial

Achilleas Thoma; Forough Farrokhyar; Daniel Waltho; Luis H. Braga; Sheila Sprague; Charlie H. Goldsmith

doi:10.1503/cjs.000317

Summary

A well-planned randomized controlled trial (RCT) is the most optimal study design to determine if a novel surgical intervention is any different than a prevailing one. Traditionally, when we want to show that a new surgical intervention is superior to a standard one, we analyze data from an RCT to see if the null hypothesis of “no difference” can be rejected (i.e., the 2 surgical interventions have the same effect).1,2 Let’s consider a hypothetical RCT that compares laparoscopic to open appendectomy and the outcome measured is a pain score based on a Likert scale from 0 to 10. Suppose it was found that the mean pain score following laparoscopic appendectomy was 7 points and that following open appendectomy was 8 points, and that this 1-point difference was statistically significant. Such a result would be uncommon because it would require a large sample size, but let’s accept this for now. Although statistically the result is significant, we do not consider this 1-point difference to have clinical relevance. This type of thinking addresses the concept of minimum clinically important difference (MCID), which describes a threshold that might persuade us to change our surgical practice. The meaningful MCID is usually based on the available best evidence derived from previous systematic reviews, pilot/feasibility studies or clinical judgment based on discussion with experts in the field.

In another hypothetical RCT, the length of stay (LOS) after laparoscopic appendectomy was observed to be 24 hours versus 30 hours after open appendectomy, with a p < 0.05. It would be meaningless to conclude that the observed difference of 6 hours is the truth without reporting a confidence interval (CI), as a p value alone does not provide information on the degree of uncertainty (variation) applied in measuring the difference in hospital stay.3 Briefly, a CI provides information regarding the degree of uncertainty associated with the observed difference of 6 hours in hospital stay. It is within the CI that the true difference will likely lie. Let’s say that in our hypothetical example the 95% CI for hospitalization time difference of 6 hours was 1–11 hours in favour of the laparoscopic approach. This means we are 95% confident that the true difference lies somewhere between 1 and 11 hours, which is quite wide, raising uncertainty when a definitive conclusion is made.

In the last decade, we have observed an increase in the publication of noninferiority RCTs. This article explores this type of study design and discusses the tools that can be used to appraise such a study.

Clinical scenario

At the last cardiac surgery weekly academic rounds there was a heated exchange between 2 surgeons, who were arguing the merit of ex-vivo heart perfusion compared with cold storage as a means of preserving donor hearts before transplantation. To resolve this dilemma, the division head has assigned you, the newest member of the division, with the task of finding the best evidence to answer this clinical question and report your findings to the group at next week’s rounds.

Finding the evidence

To identify the best evidence and inform your colleagues you begin by conducting a literature search according to the “Users’ guide to the surgical literature: how to perform a high-quality literature search.”4 You follow the PICOT format, which serves as the starting point for identification of important key words used in the search process:5

Population: heart transplant patients
Intervention: heart transplantation with ex-vivo perfusion
Comparison: heart transplantation with cold storage
Outcome: patient survival and graft survival
Time horizon: 30 days after transplant

You then conduct a literature search in PubMed Clinical Queries using the search terms “heart transplantation” AND “ex-vivo perfusion” AND “cold storage,” using the “Therapy” and “Broad” filters. You identify 10 articles: 7 ex-vivo human/animal studies,6–12 1 nonrandomized clinical study,13 1 review14 and 1 RCT.15 The RCT addresses your research question and has the benefit of being level-I evidence.16 However, when reading the article, you are perplexed that it is labelled as a “randomized noninferiority trial.”

Noninferiority RCT design

A noninferiority RCT design seeks to determine whether a new intervention is not worse than a prevailing (standard) one within an acceptable margin of risk or benefit, referred to as the noninferiority margin.17–20 It is usually assumed that the standard intervention has been shown to have better (superior) clinical effect than a placebo or an earlier intervention. The new intervention is considered to be noninferior to the standard one when it is shown to have reduced costs, have fewer adverse effects (harm), be less invasive, and be of greater convenience. In trials that investigate noninferiority, the null hypothesis is not symmetric. The new intervention will be proven noninferior if it is similar to the standard intervention, but not beyond the margin of noninferiority for a specified outcome measure. If the new intervention is found to be superior, it is an additional benefit. Tests of noninferiority should be linked to the predefined noninferiority margin and predefined α. An α of 0.025 for a 1-sided noninferiority hypothesis is equivalent to the 1-sided 97.5% CI, as an α of 0.05 for a 2-sided hypothesis is equivalent to a 2-sided 95% CI.17

Suppose the hospital administrators would like to expedite surgical patients’ hospital discharge with the adoption of this new surgical approach if it is proven to be noninferior to the standard procedure within a 3-hour noninferiority margin.

Figure 1 presents some possible scenarios for noninferiority trials observing mean differences in hospital stay following laparoscopic and open approaches. Scenario A shows that laparoscopic surgery is superior to open surgery, as the CI lies to the left of the no difference line (zero). In scenario B, the CI includes the threshold of noninferiority. It means that noninferiority is not shown, as the true difference in hospital stay could be worse than the 3-hour predefined noninferiority margin for laparoscopic surgery. Scenarios C and D show that laparoscopic surgery is noninferior to open surgery because the upper confidence limit lies to the left of the 3-hour noninferiority margin in hospital stay. Scenario D shows that laparoscopic surgery is definitely noninferior to open surgery, as the CI lies to the left of the noninferiority margin and also excludes zero line of no difference. In scenario E, the laparoscopic surgery is definitely inferior to open surgery with respect to hospital stay, as the lower bound of the CI lies to the right of the noninferiority margin. Such a scenario is less likely, as it requires a very large sample size.17

Fig. 1

Possible scenarios for observed mean differences in hospital stay between laparoscopic and open surgery in noninferiority designs. The dotted line represents the noninferiority margin. Reproduced from the study by Piaggio and colleagues17 with permission from the American Medical Association (license no. 4003080684018).

The choice of noninferiority margin requires sound clinical judgment.18 The noninferiority margin should be the smallest clinically meaningful difference between the 2 surgical interventions. In general, margins for mortality or serious adverse events should be more stringent than those for symptom control or quality of life.18 Many experts have stipulated that the noninferiority margin for efficacy outcomes should be no more than 50%, and preferably no more than 20% of the treatment effect for the standard treatment, as established in placebo-controlled superiority RCTs.18,19 Unfortunately no validated rules exist for calculating the noninferiority margin, and many trials use margins that statisticians consider to be too liberal.21 It is important that, whenever possible, this margin be validated by published expert consensus22 and not left to the sole discretion of the investigators or sponsors of the study.18

Returning to the clinical scenario

The article you identified is a prospective, open-label, multicentre, randomized noninferiority trial, by Ardehali and colleagues15 that took place at 10 heart transplant centres in the United States and Europe. Eligible heart transplant patients were randomly assigned to receive either donor hearts preserved with the organ care system (OCS; ex-vivo heart perfusion) or standard cold storage (SCS). The key methodological characteristics of the study are summarized in Figure 2 and Table 1.15

Fig. 2

Methodological characteristics of the PROCEED II Trial.12 *One patient experienced clinically significant ventricular assist device-related complications while waiting for a second donor offer, 1 patient needed a combined heart and kidney transplant while waiting for a second offer; 1 patient became ventilator-dependent on day of transplant, and 1 patient deteriorated and was delisted; 1 donor heart had left-ventricular hypertrophy (> 1.3 cm), and 1 donor was older than 60 years. †Two opened randomization envelopes before confirming availability of the Organ Care System team, 1 organ procurement organization refused to retrieve the Organ Care System because of absence of research consent, and 1 donor heart had a high concentration of serum lactate before retrieval. ‡After turning down initial donor heart offers, 1 recipient had more than 2 sternotomies while waiting for a third offer and 1 recipient withdrew before a second offer. §The randomization card was misread. ¶Deviations in the organ care system group were due to user error in cannulation and 1 recipient receiving unassigned treatment; in the standard cold storage group 1 recipient was enrolled in another pharmaceutical trial and 4 received unassigned treatment. Reproduced from the study by Ardehali and colleagues15 with permission from Elsevier (license no. 4003081323980).

View this table:

Table 1

Primary and secondary outcomes of the noninferiority trial selected for study^*

To effectively appraise a noninferiority surgical RCT, you use a similar framework to that of previous users’ guide articles (Box 1).1,3,23

Box 1

Framework for critical appraisal of an article that deals with a surgical noninferiority randomized controlled trial (RCT)

Are the results valid?

Did the novel and standard surgical intervention groups start with similar prognostic factors?
Was the prognostic balance between the 2 surgical groups maintained as the RCT progressed?
Did the investigators guard against an unwarranted conclusion of noninferiority?
Did the investigators analyze patients according to the surgical treatment they received, as well as to the groups to which they were assigned?
Did the investigators report a predefined noninferiority margin?
Did the investigators power the study for test of noninferiority?

What are the results?

Were all patient-important outcomes considered?
Were the results precise?
Were the investigators appropriately interpreting the concept of noninferiority?

How can I apply the results to my patient or clinical practice?

Were the study patients similar to my patient?
Were all patient-important outcomes considered?
Are the likely advantages of the novel surgical treatment worth the potential harm and costs?

Are the results valid?

Did the novel and standard surgical interventions start with similar prognostic factors?

As with the more commonly seen superiority RCT, the noninferiority RCT is expected to minimize the risk of bias by ensuring concealment of randomization, balance between known and unknown prognostic factors, blinding of patients, surgeons and outcome assessors to treatment allocation, and complete follow-up of all patients. In reviewing the noninferiority trial by Ardehali and colleagues,15 you see that an independent biostatistician prepared sealed and masked randomization envelopes, which were assigned to the research trial sites. The investigators, however, did not report if the envelopes were opened sequentially and one at a time. Patients, investigators and medical personnel were not blinded to group allocation. They chose an open-label design because the method of donor heart preservation made blinding of medical staff infeasible. In reviewing Table 1 of their article, you see no glaring differences in main demographic characteristics of patients assigned to the 2 competing approaches; these characteristics included age, sex, height, body mass index (BMI), diagnosis of cardiomyopathy of the recipient patients, and the cause of death of the donor patients.

There was, however, some imbalance in the preservation time before the heart transplantation. The preservation time was longer in the OCS group than in the SCS group (324 ± 79 min v.195 ± 65 min, p < 0.001); however, the mean total ischemia time was significantly shorter in the OCS group than in the SCS group (113 ± 27 min v. 195 ± 65 min, p < 0.001).

Was the prognostic balance between the 2 surgical groups maintained as the RCT progressed?

As the heart transplantation is a definitive procedure, there is probably little room to provide differential care to affect the prognostic balance after the event. Figure 2 of the article by Ardehali and colleagues15 shows the flow of the patients in the 2 groups. It details the results of the randomization protocol, wherein 130 patients were randomly assigned to either group: 67 to OCS and 63 to SCS. There appears to have been deviation of the protocol in 2 patients in the OCS and 5 in the SCS groups. It is important to note that 2 patients in the OCS group and 1 patient in the SCS group crossed over (i.e., these patients were transplanted using the other respective system).

Ardehali and colleagues15 did not report details regarding postoperative care, so you do not know if there was differential care between the 2 groups. Therefore, you cannot conclude with any certainty whether the 2 groups were balanced in this regard.

Did the investigators guard against an unwarranted conclusion of noninferiority?

In the present noninferiority trial, the investigators declared the noninferiority margin (Δ) to be 0.10 (10%). Unfortunately, they did not provide any evidence to support this difference, which leads you to wonder if a smaller noninferiority margin (e.g., 5%) could have, or indeed should have, been accepted.

Did the investigators analyze patients according to the surgical treatment they received, as well as to the groups to which they were assigned?

The purpose of randomization is to ensure that prognostic factors are balanced between the surgical interventions. Patients who do not adhere to the allocated treatment, as in the study protocol, may have a different prognosis than those who do.24 Omission of patients who do not adhere to the novel intervention is likely to bias results toward overestimation of treatment effects in a superiority trial. An intention-to-treat analysis, wherein patients are analyzed according to the group they were assigned, provides an unbiased estimate of the treatment effectiveness, irrespective of their adherence to the study protocol.

Ardehali and colleagues15 conducted both the intentionto-treat and the as-per-protocol analyses and found similar results in both analyses. Therefore, you remain assured that the authors’ analysis of the results may be appropriate. However, the authors could have assured readers of the findings by statistically addressing the missing data (e.g., multiple imputations or best and worst case scenario).

Did the investigators report a predefined noninferiority margin?

In a noninferiority RCT it is important that the investigators report the noninferiority margin and the rationale for choosing it. In the statistical analysis section of their methods, Ardehali and colleagues15 mentioned a 10% noninferiority margin, but provided no rationale for choosing it.

Did the investigators power the study for a test of noninferiority?

The investigators reported in the Methods section of their article that they calculated the 1-sided 95% upper confidence bound based on the normal approximation for the difference between the 2 population proportions. An upper confidence bound less than the 10% noninferiority margin would have rejected the null hypothesis. For the purpose of sample-size calculation, they assumed πOCS = 0.95 and πSOC = 0.94. On the basis of these assumptions, use of a normal approximation test and a 1-sided α level of 0.05, inclusion of 54 patients per treatment group would have provided 80% power.

There should be a justification for choosing a superiority versus a noninferiority study design. To some degree the authors justified the choice of the noninferiority design in that the OCS system provides certain benefits, such as the potential of “distant procurement for donor hearts, thus expanding the donor pool” in contrast to the standard cold storage. The justification of the study design is made on the research question asked and the hypothesis — specifically on the clinical advantages of the novel intervention. The measured outcomes play an important role in the sample size calculation through the choice of the MCID. Survival should demand a smaller MCID than, for example, a quality of life (QOL) outcome. The 10% choice as the noninferiority margin seems very liberal, which most surgeons or patients would not accept in a case of life or death. A noninferiority margin of 1%–2% would likely be a better choice. This raises the concern that the study may have been designed originally as a superiority study.

What are the results?

Were all patient-important outcomes considered?

The investigators found that the 30-day patient and heart transplant survival rate (primary outcome) was 94% in the OCS group and 97% in the SCS group (p = 0.45). The intention-to-treat analysis (94% v. 97%, p = 0.36) and the as-per-protocol analysis (93% v. 97%, p = 0.39) supported the overall estimate.

Multiple clinically important outcomes were included, such as graft failure and left and right ventricular dysfunction, with a time horizon of 30 days. Some surgeons may consider this short-term time frame of limited value; a longer followup would have been more appropriate. Patient-important outcomes, such as quality of life, were not considered. A validated patient-reported outcome scale would have provided more information on the merits of the comparative interventions. You note this as a limitation of this noninferiority RCT.

The secondary outcomes — serious adverse events, incidence of severe rejection and median length of stay in the intensive care unit (ICU) — were similar for the 2 approaches. Based on these results, the investigators concluded that the OCS approach was not inferior to the SCS approach. You believe that their conclusion is reasonable based on the results of the study.

Were the results precise?

The precision of the results is normally presented as a confidence interval (CI). In this noninferiority study the authors provided the CI for both the primary outcomes (30-d patient and graft survival) and the secondary outcomes (cardiac-related serious adverse events, incidence of severe rejection and ICU length of stay). They provided this for the intention-to-treat, as-treated and as-per-protocol analyses (Table 1).

Did the investigators appropriately interpret the concept of noninferiority?

The authors reported the 30-day patient and graft survival rates to be 94% in the OCS group and 97% in the SCS group. The patient and graft survival rates were the same, as no repeat heart transplant surgeries were performed. The authors reported that the upper bound of the 95% CI for the percentage differences in the primary effectiveness outcome between the 2 populations was 8.8%, which is less than 10%, so the null hypothesis was rejected in favour of the alternative hypothesis. Based on this finding you concur that noninferiority was shown.

How can I apply the results to my patient or clinical practice?

Were the study patients similar to my patient?

Based on the demographic evidence provided by Ardehali and colleagues15 in Table 1 of their study (not shown here), in which they reported the mean age, weight, height, BMI and types of cardiomyopathy their patients had as well as the donor characteristics, you conclude that the patients treated in your division would be similar and that, therefore, the study’s conclusions are applicable.

Were all patient-important outcomes considered?

The investigators included patient and graft 30-day survival as a primary outcome. A longer survival time horizon (e.g., 1- or 2-yr survival) would have been preferable. The investigators also included 30-day right and left ventricular function and length of stay in the ICU as secondary outcomes. The outcomes research movement in the last 20 years, however, expects clinical investigators to measure patients’ quality of life after medical interventions. The quality of life assessment requires a longer follow-up, and this trial was designed based on immediate and short-term outcome assessment. The authors might have suggested this for future investigation.

Are the likely advantages of the novel surgical treatment worth the potential harm and costs?

Although the authors concluded that the novel intervention was noninferior to the standard approach, you should not be rushed to adopt it. The investigators did not report the resource utilization associated with either approach. Many new innovations are costly. The ideal study, therefore, would be one in which resource utilization and costs are captured. Health-related quality of life can also be measured using a utility scale from which quality-adjusted life years (QALYs) can be calculated. The integration of costs and QALYs in a cost–utility analysis can help determine whether the new innovation is cost-effective or not.25

Resolution of the scenario

There are consequences to future patients and society if incorrect inferences from a poorly designed and conducted noninferiority RCTs are accepted. It is important to determine if this noninferiority study is really a failed superiority RCT. You can do this by determining whether the authors’ noninferiority threshold was appropriate or not. To do so, you review the literature for similar studies to determine the upper boundary for the CI of the primary outcome (30-d mortality and heart transplant survival) and examine the extent to which it exceeds the chosen threshold. If the upper boundary is substantially greater/lower than the threshold chosen by the investigators (10%) you may choose not to adopt this new technology. This is, unfortunately, the case with the RCT by Ardehali and colleagues.15 Their noninferiority margin of 0.10 (10%) was chosen without supportive documentation.

Conclusion

Although, in general, you are happy with the designation of this study as an RCT, you are not persuaded that it met all the criteria for its designation as a noninferiority RCT. Specifically, you are concerned that their noninferiority margin of 0.10 (10%) was chosen without supportive evidence. As a result, you recommend to your colleagues that the study has definite weaknesses. You then offer to review and critique a superiority RCT comparing these approaches to present at next week’s rounds.

Footnotes

Competing interests: None declared.
Contributors: All authors designed the study. D. Waltho acquired the data, which F. Farrokhyar, D. Waltho and C. Goldsmith analyzed. A. Thoma, F. Farrokhyar and D. Waltho wrote the article, which all authors reviewed and approved for publication.

Accepted July 5, 2017.

References

↵
1. Thoma A,
2. Farrokhyar F,
3. Bhandari M,
4. et al
.Users’ guide to the surgical literature. How to assess a randomized controlled trial in surgery.Can J Surg 2004;47:200–8.
OpenUrl
↵
1. Farrokhyar F,
2. Karanicolas PJ,
3. Thoma A,
4. et al
.Randomized controlled trials of surgical interventions.Ann Surg 2010;251:409–16.
OpenUrl CrossRef PubMed
↵
1. Cadeddu M,
2. Farrokhyar F,
3. Levis C,
4. et al
.Users’ guide to the surgical literature. Understanding confidence intervals.Can J Surg 2012;55:207–11.
OpenUrl
↵
1. Waltho DA,
2. Kaur MN,
3. Haynes RB,
4. et al
.Users’ guide to the surgical literature: how to perform a high quality literature search.Can J Surg 2015;58:349–58.
OpenUrl
↵
1. Richardson WS,
2. Wilson MC,
3. Nishikawa J,
4. et al
.The well-built clinical question: a key to evidence-based decisions.ACP J Club 1995;123:A12–3.
OpenUrl PubMed
↵
1. Van Caenegem O,
2. Beauloye C,
3. Bertrand L,
4. et al
.Vanoverschelde JL1. Hypothermic continuous machine perfusion enables preservation of energy charge and functional recovery of heart grafts in an ex vivo model of donation following circulatory death.Eur J Cardiothorac Surg 2015;49:1348–53.
OpenUrl
1. Liersch-Nordqvist A,
2. Ingemansson R,
3. Pierre L,
4. et al
.Lungs exposed to 1 hour warm ischemia without heparin before harvesting might be suitable candidates for transplantation.J Cardiothorac Surg 2015;10:131
OpenUrl
1. Osaki S,
2. Locher MR,
3. Lushaj EB,
4. et al
.Functional evaluation of human donation after cardiac death donor hearts using a continuous isolated myocardial perfusion technique: Potential for expansion of the cardiac donor population.J Thorac Cardiovasc Surg 2014;148:1123–30.
OpenUrl CrossRef PubMed
1. Mulloy DP,
2. Stone ML,
3. Crosby IK,
4. et al
.Ex vivo rehabilitation of non-heart-beating donor lungs in preclinical porcine model: delayed perfusion results in superior lung function.J Thorac Cardiovasc Surg 2012;144:1208–15.
OpenUrl CrossRef PubMed
1. Soares PR,
2. Braga KA,
3. Nepomuceno NA,
4. et al
.Comparison between Perfadex and locally manufactured low-potassium dextran solution for pulmonary preservation in an ex vivo isolated lung perfusion model.Transplant Proc 2011;43:84–8.
OpenUrl PubMed
1. Inci I,
2. Ampollini L,
3. Arni S,
4. et al
.Ex vivo reconditioning of marginal donor lungs injured by acid aspiration.J Heart Lung Transplant 2008;27:1229–36.
OpenUrl CrossRef PubMed
↵
1. Kobayashi T,
2. Liu D,
3. Ogawa H,
4. et al
.Removal of blood group A/B antigen in organs by ex vivo and in vivo administration of endo-beta-galactosidase (ABase) for ABO-incompatible transplantation.Transpl Immunol 2009;20:132–8.
OpenUrl CrossRef PubMed
↵
1. Koerner MM,
2. Ghodsizad A,
3. Schulz U,
4. et al
.Normothermic ex vivo allograft blood perfusion in clinical heart transplantation.Heart Surg Forum 2014;17:E141–5.
OpenUrl CrossRef PubMed
↵
1. Minasian SM,
2. Galagudza MM,
3. Dmitriev YV,
4. et al
.Preservation of the donor heart: from basic science to clinical studies.Interact Cardiovasc Thorac Surg 2015;20:510–9.
OpenUrl CrossRef PubMed
↵
1. Ardehali A,
2. Esmailian F,
3. Deng M,
4. et al
.Ex-vivo perfusion of donor hearts for human heart transplantation (PROCEED II): a prospective, open-label, multicentre, randomized non-inferiority trial.Lancet 2015;385:2577–84.
OpenUrl CrossRef PubMed
↵
Canadian Task Force on the Periodic Health ExaminationTask force report: the periodic health examination.CMAJ 1979;121:1193–254.
OpenUrl PubMed
↵
1. Piaggio G,
2. Elbourne DR,
3. Pocock SJ,
4. et al
.Reporting of noninferiority and equivalence randomized trials — extension of the CONSORT 2010 Statement.JAMA 2017;308:2594–604.
OpenUrl
↵
1. Scott IA
.Noninferiority trials: determining whether alternative treatments are good enough.Med J Aust 2009;190:326–30.
OpenUrl PubMed
↵
1. Kaji AH,
2. Lewis RJ,
3. Mulla SM
(2005) Guideline on the choice of the noninferiority margin (Euroean Medicines Agency, London (UK)).
↵
1. Hahn S
.Understanding noninferiority trials.Korean J Pediatr 2012;55:403–7.
OpenUrl CrossRef PubMed
↵
1. Lange S,
2. Freitag G
.Choice of delta: requirements and reality — results of a systematic review.Biom J 2005;47:12–27.
OpenUrl CrossRef PubMed
↵
1. Wyrwich KW,
2. Spertus JA,
3. Krenke K,
4. et al
.Clinically important differences in health status for patients with heart disease: an expert consensus panel report.Am Hear J 2004;147:615–22.
OpenUrl
↵
1. Cadeddu M,
2. Farrokhyar F,
3. Thoma A,
4. et al
.Users’ guide to the surgical literature: how to assess power and sample size.Can J Surg 2008;51:476–82.
OpenUrl
↵
1. Friedman LM,
2. Furberg CD,
3. DeMets DL
Fundamentals of clinical trials3rd edNew York (NY)Springer Verlag199843–6.
↵
1. Thoma A,
2. McKnight LL
.Quality-adjusted life-year as a surgical outcome measure: a primer for plastic surgeons.Plast Reconstr Surg 2010;125:1279–87.
OpenUrl CrossRef PubMed

In this issue

Article tools

Respond to this article

Download PDF

Cited By...

No citing articles found.

Google Scholar

[1] ↵
Thoma A,
Farrokhyar F,
Bhandari M,
et al
.Users’ guide to the surgical literature. How to assess a randomized controlled trial in surgery.Can J Surg 2004;47:200–8.
OpenUrl

[2] Thoma A,

[3] Farrokhyar F,

[4] Bhandari M,

[5] et al

[6] ↵
Farrokhyar F,
Karanicolas PJ,
Thoma A,
et al
.Randomized controlled trials of surgical interventions.Ann Surg 2010;251:409–16.
OpenUrl CrossRef PubMed

[7] Farrokhyar F,

[8] Karanicolas PJ,

[9] Thoma A,

[10] et al

[11] ↵
Cadeddu M,
Farrokhyar F,
Levis C,
et al
.Users’ guide to the surgical literature. Understanding confidence intervals.Can J Surg 2012;55:207–11.
OpenUrl

[12] Cadeddu M,

[13] Farrokhyar F,

[14] Levis C,

[15] et al

[16] ↵
Waltho DA,
Kaur MN,
Haynes RB,
et al
.Users’ guide to the surgical literature: how to perform a high quality literature search.Can J Surg 2015;58:349–58.
OpenUrl

[17] Waltho DA,

[18] Kaur MN,

[19] Haynes RB,

[20] et al

[21] ↵
Richardson WS,
Wilson MC,
Nishikawa J,
et al
.The well-built clinical question: a key to evidence-based decisions.ACP J Club 1995;123:A12–3.
OpenUrl PubMed

[22] Richardson WS,

[23] Wilson MC,

[24] Nishikawa J,

[25] et al

[26] ↵
Van Caenegem O,
Beauloye C,
Bertrand L,
et al
.Vanoverschelde JL1. Hypothermic continuous machine perfusion enables preservation of energy charge and functional recovery of heart grafts in an ex vivo model of donation following circulatory death.Eur J Cardiothorac Surg 2015;49:1348–53.
OpenUrl

[27] Van Caenegem O,

[28] Beauloye C,

[29] Bertrand L,

[30] et al

[31] Liersch-Nordqvist A,
Ingemansson R,
Pierre L,
et al
.Lungs exposed to 1 hour warm ischemia without heparin before harvesting might be suitable candidates for transplantation.J Cardiothorac Surg 2015;10:131
OpenUrl

[32] Liersch-Nordqvist A,

[33] Ingemansson R,

[34] Pierre L,

[35] et al

[36] Osaki S,
Locher MR,
Lushaj EB,
et al
.Functional evaluation of human donation after cardiac death donor hearts using a continuous isolated myocardial perfusion technique: Potential for expansion of the cardiac donor population.J Thorac Cardiovasc Surg 2014;148:1123–30.
OpenUrl CrossRef PubMed

[37] Osaki S,

[38] Locher MR,

[39] Lushaj EB,

[40] et al

[41] Mulloy DP,
Stone ML,
Crosby IK,
et al
.Ex vivo rehabilitation of non-heart-beating donor lungs in preclinical porcine model: delayed perfusion results in superior lung function.J Thorac Cardiovasc Surg 2012;144:1208–15.
OpenUrl CrossRef PubMed

[42] Mulloy DP,

[43] Stone ML,

[44] Crosby IK,

[45] et al

[46] Soares PR,
Braga KA,
Nepomuceno NA,
et al
.Comparison between Perfadex and locally manufactured low-potassium dextran solution for pulmonary preservation in an ex vivo isolated lung perfusion model.Transplant Proc 2011;43:84–8.
OpenUrl PubMed

[47] Soares PR,

[48] Braga KA,

[49] Nepomuceno NA,

[50] et al

[51] Inci I,
Ampollini L,
Arni S,
et al
.Ex vivo reconditioning of marginal donor lungs injured by acid aspiration.J Heart Lung Transplant 2008;27:1229–36.
OpenUrl CrossRef PubMed

[52] Inci I,

[53] Ampollini L,

[54] Arni S,

[55] et al

[56] ↵
Kobayashi T,
Liu D,
Ogawa H,
et al
.Removal of blood group A/B antigen in organs by ex vivo and in vivo administration of endo-beta-galactosidase (ABase) for ABO-incompatible transplantation.Transpl Immunol 2009;20:132–8.
OpenUrl CrossRef PubMed

[57] Kobayashi T,

[58] Liu D,

[59] Ogawa H,

[60] et al

[61] ↵
Koerner MM,
Ghodsizad A,
Schulz U,
et al
.Normothermic ex vivo allograft blood perfusion in clinical heart transplantation.Heart Surg Forum 2014;17:E141–5.
OpenUrl CrossRef PubMed

[62] Koerner MM,

[63] Ghodsizad A,

[64] Schulz U,

[65] et al

[66] ↵
Minasian SM,
Galagudza MM,
Dmitriev YV,
et al
.Preservation of the donor heart: from basic science to clinical studies.Interact Cardiovasc Thorac Surg 2015;20:510–9.
OpenUrl CrossRef PubMed

[67] Minasian SM,

[68] Galagudza MM,

[69] Dmitriev YV,

[70] et al

[71] ↵
Ardehali A,
Esmailian F,
Deng M,
et al
.Ex-vivo perfusion of donor hearts for human heart transplantation (PROCEED II): a prospective, open-label, multicentre, randomized non-inferiority trial.Lancet 2015;385:2577–84.
OpenUrl CrossRef PubMed

[72] Ardehali A,

[73] Esmailian F,

[74] Deng M,

[75] et al

[76] ↵
Canadian Task Force on the Periodic Health ExaminationTask force report: the periodic health examination.CMAJ 1979;121:1193–254.
OpenUrl PubMed

[77] ↵
Piaggio G,
Elbourne DR,
Pocock SJ,
et al
.Reporting of noninferiority and equivalence randomized trials — extension of the CONSORT 2010 Statement.JAMA 2017;308:2594–604.
OpenUrl

[78] Piaggio G,

[79] Elbourne DR,

[80] Pocock SJ,

[81] et al

[82] ↵
Scott IA
.Noninferiority trials: determining whether alternative treatments are good enough.Med J Aust 2009;190:326–30.
OpenUrl PubMed

[83] Scott IA

[84] ↵
Kaji AH,
Lewis RJ,
Mulla SM
(2005) Guideline on the choice of the noninferiority margin (Euroean Medicines Agency, London (UK)).

[85] Kaji AH,

[86] Lewis RJ,

[87] Mulla SM

[88] ↵
Hahn S
.Understanding noninferiority trials.Korean J Pediatr 2012;55:403–7.
OpenUrl CrossRef PubMed

[89] Hahn S

[90] ↵
Lange S,
Freitag G
.Choice of delta: requirements and reality — results of a systematic review.Biom J 2005;47:12–27.
OpenUrl CrossRef PubMed

[91] Lange S,

[92] Freitag G

[93] ↵
Wyrwich KW,
Spertus JA,
Krenke K,
et al
.Clinically important differences in health status for patients with heart disease: an expert consensus panel report.Am Hear J 2004;147:615–22.
OpenUrl

[94] Wyrwich KW,

[95] Spertus JA,

[96] Krenke K,

[97] et al

[98] ↵
Cadeddu M,
Farrokhyar F,
Thoma A,
et al
.Users’ guide to the surgical literature: how to assess power and sample size.Can J Surg 2008;51:476–82.
OpenUrl

[99] Cadeddu M,

[100] Farrokhyar F,

[101] Thoma A,

[102] et al

[103] ↵
Friedman LM,
Furberg CD,
DeMets DL
Fundamentals of clinical trials3rd edNew York (NY)Springer Verlag199843–6.

[104] Friedman LM,

[105] Furberg CD,

[106] DeMets DL

[107] ↵
Thoma A,
McKnight LL
.Quality-adjusted life-year as a surgical outcome measure: a primer for plastic surgeons.Plast Reconstr Surg 2010;125:1279–87.
OpenUrl CrossRef PubMed

[108] Thoma A,

[109] McKnight LL

Main menu

User menu

Search

Users’ guide to the surgical literature: how to assess a noninferiority trial

Summary

Clinical scenario

Finding the evidence

Noninferiority RCT design

Returning to the clinical scenario

Framework for critical appraisal of an article that deals with a surgical noninferiority randomized controlled trial (RCT)

Are the results valid?

Did the novel and standard surgical interventions start with similar prognostic factors?

Was the prognostic balance between the 2 surgical groups maintained as the RCT progressed?

Did the investigators guard against an unwarranted conclusion of noninferiority?

Did the investigators analyze patients according to the surgical treatment they received, as well as to the groups to which they were assigned?

Did the investigators report a predefined noninferiority margin?

Did the investigators power the study for a test of noninferiority?

What are the results?

Were all patient-important outcomes considered?

Were the results precise?

Did the investigators appropriately interpret the concept of noninferiority?

How can I apply the results to my patient or clinical practice?

Were the study patients similar to my patient?

Were all patient-important outcomes considered?

Are the likely advantages of the novel surgical treatment worth the potential harm and costs?

Resolution of the scenario

Conclusion

Footnotes

References

In this issue

Article tools

Citation Manager Formats

Related Articles

Cited By...

Similar Articles

Content

Authors & Reviewers

About