Perspectives1 October 1996
    Author, Article, and Disclosure Information

    Abstract

    Phase 3 clinical trials, which evaluate the effect that new interventions have on the clinical outcomes of particular relevance to the patient (such as death, loss of vision, or other major symptomatic event), often require many participants to be followed for a long time.There has recently been great interest in using surrogate end points, such as tumor shrinkage or changes in cholesterol level, blood pressure, CD4 cell count, or other laboratory measures, to reduce the cost and duration of clinical trials. In theory, for a surrogate end point to be an effective substitute for the clinical outcome, effects of the intervention on the surrogate must reliably predict the overall effect on the clinical outcome. In practice, this requirement frequently fails. Among several explanations for this failure is the possibility that the disease process could affect the clinical outcome through several causal pathways that are not mediated through the surrogate, with the intervention's effect on these pathways differing from its effect on the surrogate. Even more likely, the intervention might also affect the clinical outcome by unintended, unanticipated, and unrecognized mechanisms of action that operate independently of the disease process. We use examples from several disease areas to illustrate how surrogate end points have been misleading about the actual effects that treatments have on the health of patients.

    Surrogate end points can be useful in phase 2 screening trials for identifying whether a new intervention is biologically active and for guiding decisions about whether the intervention is promising enough to justify a large definitive trial with clinically meaningful outcomes.In definitive phase 3 trials, except for rare circumstances in which the validity of the surrogate end point has already been rigorously established, the primary end point should be the true clinical outcome.

    References

    • 1. Fleming TR. Evaluating therapeutic interventions: some issues and experiences (with discussion and rejoinder). Statistical Science. 1992; 7:428-56. Google Scholar
    • 2. Fleming TR. Evaluation of active control trials in AIDS. J Acquir Immune Defic Syndr. 1990; 3(Suppl 2):S82-7. Google Scholar
    • 3. Johnson JR, Temple R. Food and Drug Administration requirements for approval of new anticancer drugs. Cancer Treat Rep. 1985; 69:1155-9. Google Scholar
    • 4. Ellenberg SS, Hamilton JM. Surrogate endpoints in clinical trials: cancer. Stat Med. 1989; 8:405-13. Google Scholar
    • 5. Fleming TR, Prentice RL, Pepe MS, Glidden D. Surrogate and auxiliary endpoints in clinical trials, with potential applications in cancer and AIDS research. Stat Med. 1994; 13:955-68. Google Scholar
    • 6. Herson J. The use of surrogate endpoints in clinical trials. Stat Med. 1989; 8:403-4. Google Scholar
    • 7. Hillis A, Seigel D. Surrogate endpoints in clinical trials: ophthalmologic disorders. Stat Med. 1989; 8:427-30. Google Scholar
    • 8. Kosorok MR, Fleming TR. Using surrogate failure time data to increase cost effectiveness in clinical trials. Biometrika. 1993; 80:823-33. Google Scholar
    • 9. Machado SG, Gail MH, Ellenberg SS. On the use of laboratory markers as surrogates for clinical endpoints in the evaluation of treatment for HIV infection. J Acquir Immune Defic Syndr. 1990; 3:1065-73. Google Scholar
    • 10. Pepe MS, Reilly M, Fleming TR. Auxiliary outcome data and the mean score method. Journal of Statistical Planning and Inference. 1994; 42:137-60. Google Scholar
    • 11. Prentice RL. Surrogate endpoints in clinical trials: definition and operational criteria. Stat Med. 1989; 8:431-40. Google Scholar
    • 12. Wittes J, Lakatos E, Probstfield J. Surrogate endpoints in clinical trials: cardiovascular diseases. Stat Med. 1989; 8:415-25. Google Scholar
    • 13. Temple RJ. A regulatory authority's opinion about surrogate endpoints. In: Nimmo WS, Tucker GT, eds. Clinical Measurement in Drug Evaluation. New York: J Wiley; 1995. Google Scholar
    • 14. Ellenberg SS. Surrogate end points in clinical trials [Editorial]. BMJ. 1991; 302:63-4. Google Scholar
    • 15. Fleming TR. Surrogate markers in AIDS and cancer trials. Stat Med. 1994; 13:1423-35. Google Scholar
    • 16. Lagakos SW, Hoth DF. Surrogate markers in AIDS: where are we? Ann Intern Med. 1992; 116:599-601. Google Scholar
    • 17. Lin DY, Fischl MA, Schoenfeld DA. Evaluating the role of CD4-lymphocyte counts as surrogate endpoints in human immunodeficiency virus clinical trials. Stat Med. 1993; 12:835-42. Google Scholar
    • 18. Aboulker JP, Swart AM: Preliminary analysis of the Concorde Trial. Concorde Coordinating Committee [Letter]. Lancet. 1993; 341:889-90. Google Scholar
    • 19. Choi S, Lagakos SW, Schooley TT, Volberding PA. CD4+ lymphocytes are an incomplete surrogate marker for clinical progression in persons with asymptomatic HIV infection taking zidovudine. Ann Intern Med. 1993; 118:674-80. Google Scholar
    • 20. De Gruttola V, Wulfsohn M, Fischl M, Tsiatis A. Modeling the relationship between survival and CD4 lymphocytes in patients with AIDS and AIDS-related complex. J Acquir Immune Defic Syndr. 1993; 6:359-65. Google Scholar
    • 21. Jacobson MA, Bacchetti P, Kolokathis A, Chaisson RE, Szabo S, Polsky B, et al. Surrogate markers for survival in patients with AIDS and AIDS related complex treated with zidovudine. BMJ. 1991; 302:73-8. Google Scholar
    • 22. Nowak R. In Concorde's wake: is the AIDS clinical-trials program flawed? Journal of NIH Research. 1993; 5:37-9. Google Scholar
    • 23. Volberding PA, Lagakos SW, Grimes J, Stein DS, Balfour HH, Reichman RC, et al. The duration of zidovudine benefit in persons with asymptomatic HIV infection. Prolonged evaluation of protocol 019 of the AIDS Clinical Trials Group. JAMA. 1994; 272:437-42. Google Scholar
    • 24. Volberding PA, Lagakos SW, Koch MA, Pettinelli C, Myers MW, Booth DK, et al. Zidovudine in asymptomatic human immunodeficiency virus infection. A controlled trial in persons with fewer than 500 CD4-positive cells per cubic millimeter. The AIDS Clinical Trials Group of the National Institute of Allergy and Infectious Diseases. N Engl J Med. 1990; 322:941-9. Google Scholar
    • 25. Boissel JP, Collet JP, Moleur P, Haugh M. Surrogate endpoints: a basis for a rationale approach. Eur J Clin Pharmacol. 1992; 43:235-44. Google Scholar
    • 26. Preliminary Report: effect of encainide and flecainide on mortality in a randomized trial of arrhythmia suppression after myocardial infarction. The Cardiac Arrhythmia Suppression Trial (CAST) Investigators. N Engl J Med. 1989; 321:406-12. Google Scholar
    • 27. Echt DS, Liebson PR, Mitchell LB, Peters RW, Obias-Manno D, Barker AH, et al. Mortality and morbidity in patients receiving encainide, flecainide, or placebo. The Cardiac Arrhythmia Suppression Trial. N Engl J Med. 1991; 324:781-8. Google Scholar
    • 28. Effect of the antiarrhythmic agent moricizine on survival after myocardial infarction. The Cardiac Arrhythmia Suppression Trial II Investigators. N Engl J Med. 1992; 327:227-33. Google Scholar
    • 29. Coplen SE, Antman EM, Berlin JA, Hewitt P, Chalmers TC. Efficacy and safety of quinidine therapy for maintenance of sinus rhythm after cardioversion. A meta-analysis of randomized control trials. Circulation. 1990; 82:1106-16. Google Scholar
    • 30. Hine LK, Laird N, Hewitt P, Chalmers TC. Meta-analytic evidence against prophylactic use of lidocaine in acute myocardial infarction. Arch Intern Med. 1989; 149:2694-8. Google Scholar
    • 31. MacMahon S, Collins R, Peto R, Koster RW, Yusuf S. Effects of prophylactic lidocaine in suspected acute myocardial infarction. An overview of results from the randomized, controlled trials. JAMA. 1988; 260:1910-6. Google Scholar
    • 32. Effects of enalapril on mortality in severe congestive heart failure. Results of the Cooperative North Scandinavian Enalapril Survival Study (CONSENSUS). The CONSENSUS Trial Study Group. N Engl J Med. 1987; 316:1429-35. Google Scholar
    • 33. Pfeffer MA, Braunwald E, Moye LA, Basta L, Brown EJ Jr, Cuddy TE, et al. Effect of captropril on mortality and morbidity in patients with left ventricular dysfunction after myocardial infarction. Results of the Survival and Ventricular Enlargement Trial. The SAVE Investigators. N Engl J Med. 1992; 327:669-77. Google Scholar
    • 34. Effects of enalapril on mortality in severe congestive heart failure. Results of the Cooperative North Scandinavian Enalapril Survival Study (CONSENSUS). The CONSENSUS Trial Study Group. N Engl J Med. 1987; 316:1429-35. Google Scholar
    • 35. Feldman AM, Bristow MR, Parmley WW, Carson PE, Pepine CJ, Gilbert EM, et al. Effects of vesnarinone on morbidity and mortality in patients with heart failure. Vesnarinone Study Group. N Engl J Med. 1993; 329:149-55. Google Scholar
    • 36. Rationale, design, implementation, and baseline characteristics of patients in the DIG trials: a large, simple, long-term trial to evaluate the effect of digitalis on mortality in heart failure. The Digitalis Investigation Group. Control Clin Trials. 1996; 17:77-97. Google Scholar
    • 37. Packer M, Carver JR, Rodehoffer RJ, Ivanhoe RJ, DiBianco R, Zeldis SM, et al. Effect of oral milrinone on mortality in severe chronic heart failure. The PROMISE Study Research Group. N Engl J Med. 1991; 325:1468-75. Google Scholar
    • 38. Packer M, Rouleau J, Swedberg K, Pitt B, Fisher L, Klepper M, et al. Effect of Flosequinan on survival in chronic heart failure: preliminary results of the PROFILE study [Abstract]. Circulation. 1993; 88(Suppl 1):I-301. Google Scholar
    • 39. Law MR, Thompson SG, Wald NJ. Assessing possible hazards of reducing serum cholesterol. BMJ. 1994; 308:373-9. Google Scholar
    • 40. Law MR, Wald NJ, Thompson SG. By how much and how quickly does reduction in serum cholesterol concentration lower risk of ischemic heart disease? BMJ. 1994; 308:367-72. Google Scholar
    • 41. Rossouw JE, Lewis B, Rifkind BM. The value of lowering cholesterol after myocardial infarction. N Engl J Med. 1990; 323:1112-9. Google Scholar
    • 42. Clofibrate and niacin in coronary heart disease. JAMA. 1975; 231:360-81. Google Scholar
    • 43. Muldoon MF, Manuck SB, Matthews KA. Lowering cholesterol concentrations and mortality: a quantitative review of primary prevention trials. BMJ. 1990; 301:309-14. Google Scholar
    • 44. Holme I. Relation of coronary heart disease incidence and total mortality to plasma cholesterol reduction in randomised trials: use of meta-analysis. Br Heart J. 1993; 69(1 Suppl):S42-7. Google Scholar
    • 45. Law MR, Wald NJ, Thompson SG. By how much and how quickly does reduction in serum cholesterol concentration lower risk of ischaemic heart disease? BMJ. 1994; 308:367-72. Google Scholar
    • 46. Law MR, Thompson SG, Wald NJ. Assessing possible hazards of reducing serum cholesterol. BMJ. 1994; 308:373-9. Google Scholar
    • 47. Gordon DJ. Cholesterol lowering and total mortality. In: Rifkind BM, ed. Contemporary Issues in Cholesterol Lowering: Clinical and Population Aspects. New York: Marcel Dekker; 1994. Google Scholar
    • 48. Randomised trial of cholesterol lowering in 4444 patients with coronary heart disease: the Scandinavian Simvastatin Survival Study (4S). Lancet. 1994; 344:1383-9. Google Scholar
    • 49. Collins R, Peto R, MacMahon S, Hebert P, Fiebach NH, Eberlein KA, et al. Blood pressure, stroke, and coronary heart disease. Part 2, Short-term reductions in blood pressure: overview of randomized drug trials in their epidemiological context. Lancet. 1990; 335:827-38. Google Scholar
    • 50. Five-year findings of the hypertension detection and follow-up program. I. Reduction in mortality of persons with high blood pressure, including mild hypertension. Hypertension Detection and Follow-up Program Cooperative Group. JAMA. 1979; 242:2562-71. Google Scholar
    • 51. Prevention of stroke by antihypertensive drug treatment in older persons with isolated systolic hypertension: final results of the Systolic Hypertension in the Elderly Program (SHEP). SHEP Cooperative Research Group. JAMA. 1991; 265:3255-64. Google Scholar
    • 52. Furberg CD, Berglund G, Manolio TA, Psaty BM. Overtreatment and undertreatment of hypertension. J Intern Med. 1994; 235:387-97. Google Scholar
    • 53. Psaty BM, Heckbert SR, Koepsell TD, Siscovick DS, Lemaitre R, Smith NL, et al. The risk of incident myocardial infarction associated with anti-hypertensive drug therapies [Abstract]. Circulation. 1996; 91:925. Google Scholar
    • 54. Held PH, Yusuf S, Furberg CD. Calcium channel blockers in acute myocardial infarction and unstable angina: an overview. BMJ. 1989; 299:1187-92. Google Scholar
    • 55. Moertel CG. Improving the efficiency of clinical trials: a medical perspective. Stat Med. 1984; 3:455-68. Google Scholar
    • 56. Modulation of fluorouracil by leucovorin in patients with advanced colorectal cancer: evidence in terms of response rate. Advanced Colorectal Cancer Meta-Analysis Project. J Clin Oncol. 1992; 10:896-903. Google Scholar
    • 57. Sande MA, Carpenter CC, Cobbs CG, Holmes KK, Sanford JD. Anti-retroviral therapy for adult HIV-infected patients. Recommendations from a state-of-the-art conference. National Institute of Allergy and Infectious Diseases State-of-the-Art Panel on Anti-Retroviral Therapy for Adult HIV-Infected Patients. JAMA. 1993; 270:2583-9. Google Scholar
    • 58. Riggs BL, Hodgson SF, O'Fallon WM, Chao EY, Wahner HW, Muhs JM, et al. Effect of fluoride treatment on the fracture rate in postmenopausal women with osteoporosis. N Engl J Med. 1990; 322:802-9. Google Scholar
    • 59. Riggs BL, Seeman E, Hodgson SF, Taves DR, O'Fallon WM. Effect of the fluoride/calcium regimen on vertebral fracture occurrence in postmenopausal osteoporosis. Comparison with conventional therapy. N Engl J Med. 1982; 306:446-50. Google Scholar
    • 60. A controlled trial of interferon γ to prevent infection in chronic granulomatous disease. The International Chronic Granulomatous Disease Cooperative Study Group. N Engl J Med. 1991; 324:509-16. Google Scholar
    • 61. Comstock GW. Identification of an effective vaccine against tuberculosis. Am Rev Respir Dis. 1988; 138:479-80. Google Scholar
    • 62. Urokinase-Streptokinase embolism trial. Phase 2 results. A cooperative study. JAMA. 1974; 229:1606-13. Google Scholar
    • 63. The Thrombolysis in Myocardial Infarction (TIMI) trial. Phase I findings. TIMI Study Group N Engl J Med. 1985; 312:932-6. Google Scholar
    • 64. Effectiveness of intravenous thrombolytic treatment in acute myocardial infarction. Gruppo Italiano per lo Studio Della Streptochinasi nell'Infarto Miocardico (GISSI). Lancet. 1986; 1:397-402. Google Scholar
    • 65. Randomised trial of intravenous streptokinase, oral aspirin, both, or neither among 17,187 cases of suspected myocardial infarction: ISIS-2. ISIS-2 (Second International Study of Infarct Survival) Collaborative Group. Lancet. 1988; 2:349-60. Google Scholar
    • 66. GISSI-2: a factorial randomised trial of alteplase versus streptokinase and heparin versus no heparin among 12,490 patients with acute myocardial infarction. Gruppo Italiano per lo Studio della Sopravvivenza nell'Infarto Miocardico. Lancet. 1990; 336:65-71. Google Scholar
    • 67. In-hospital mortality and clinical course of 20,891 patients with suspected acute myocardial infarction randomised between alteplase and streptokinase with or without heparin. The International Study Group. Lancet. 1990; 336:71-5. Google Scholar
    • 68. An international randomized trial comparing four thrombolytic strategies for acute myocardial infarction. The GUSTO Investigators. N Engl J Med. 1993; 329:673-82. Google Scholar
    • 69. Fontana RS, Sanderson DR, Taylor WF, Woolner LB, Miller WE, Muhm JR, et al. Early lung cancer detection: results of the initial (prevalence) radiologic and cytologic screening in the Mayo Clinic Study. Am Rev Respir Dis. 1984; 130:561-5. Google Scholar
    • 70. Fontana RS, Sanderson DR, Woolner LB, Miller WE, Muhm JR. Lung cancer screening: The Mayo program. J Occup Med. 1986; 28:746-50. Google Scholar
    • 71. Berson EL, Rosner B, Sandberg MA, Hayes KC, Nicholson BW, Wiegel-DiFranco C, et al. A randomized trial of vitamin A and vitamin E supplementation for retinitis pigmentosa. Arch Ophthalmol. 1993; 111:761-72. Google Scholar
    • 72. Massof RW, Finkelstein D. Supplemental vitamin A retards loss of ERG amplitude in retinitis pigmentosa [Editorial]. Arch Ophthalmol. 1993; 111:751-4. Google Scholar
    • 73. Continuous or nocturnal oxygen therapy in hypoxemic chronic obstructive lung disease: a clinical trial. Nocturnal Oxygen Therapy Trial Group. Ann Intern Med. 1980; 93:391-8. Google Scholar
    • 74. Hujoel PP, DeRouen TA. A survey of endpoint characteristics in periodontal clinical trials published 1988-1992, and implications for future studies. J Clin Periodontol. 1995; 22:397-407. Google Scholar
    • 75. Hujoel PP, DeRouen TA, Leroux B, Powell LV, Kiyak HA. The validity of probing attachment levels as a surrogate for tooth loss in the elderly [Abstract]. J Dent Res. 1996; 75:370. Google Scholar
    • 76. Coronary artery surgery study (CASS): a randomized trial of coronary artery bypass surgery. Survival data. Circulation. 1983; 68:939-50. Google Scholar
    • 77. Lin DY, Fleming TR, DeGruttola V. Estimating the proportion of treatment effect explained by a surrogate marker. Stat Med. 1996; [In press]. Google Scholar
    • 78. Freedman LS, Graubard BL, Schatzkin A. Statistical validation of intermediate endpoints for chronic diseases. Stat Med. 1992; 11:167-78. Google Scholar
    • 79. DeGruttola V, Fleming T, Lin DY, Coombs R. Validating surrogate markers—are we being naive? J Infect Dis. [In press]. Google Scholar