Discussion
The personal and public health consequences of lung cancer are enormous, and even a small benefit from screening could save many lives. This review found that in 1 large, good-quality trial that used 3 annual LDCTs to screen high-risk persons aged 55 to 74 years, lung cancer and all-cause mortality were reduced in the LDCT group compared with the annual chest radiography group by 20% and 7%, respectively (
Appendix Table 3)
(53). Twenty-five percent of the overall deaths in the control group were from lung cancer in this study, highlighting the large contribution of this disease to overall mortality in this age and risk strata of the population.
One fair-quality Italian trial involving men older than 60 years suggested that screening CT reduced lung cancer mortality, but this association was not significant
(39, 40). Two European trials (1 fair-quality
[60] and 1 poor-quality
[57]) in lower-risk and younger patients showed no benefit of LDCT screening in reducing lung cancer mortality. In the evidence report
(22), we found no data to support chest radiography for lung cancer screening, although data from the PLCO (Prostate, Lung, Colorectal, and Ovarian) Cancer Screening Trial suggested a benefit among high-risk persons and possibly high- and average-risk women
(22, 56). This suggests that, if there is any benefit of chest radiography screening, the benefit of lung cancer screening with LDCT shown in the NLST may be even greater if applied to an unscreened population.
Several factors may account for differential mortality among trials. First, the European studies enrolled fewer patients and had shorter follow-up than the NLST and had inadequate power to detect a difference in lung cancer mortality. In addition, the DANTE trial had reduced power to detect a difference, because 9 patients in the control group and 16 patients in the LDCT group were diagnosed with lung cancer at baseline with chest radiography and sputum cytology. This difference also suggests possible inadequate randomization or inadequate sample size, because the baseline risk for cancer diagnosed by sputum cytology or chest radiography was nearly 2-fold higher in the LDCT group.
Second, follow-up durations among randomized groups differed in the European trials. Adjustment for actual follow-up months by group in the MILD and DANTE studies markedly changed results, with the latter suggesting a benefit of screening rather than a neutral effect. However, this difference was not significant. In addition, the DLCST investigators noted that follow-up for lung cancer in the control group was less complete than in the intervention group, but they did not provide actual follow-up time.
Third, participants in the studies showing or suggesting reduced lung cancer mortality (NLST and the DANTE trial) were older with greater smoking exposure than those in studies not showing benefit (
Table). Of note, lung cancer incidence and mortality and overall mortality rates in the NLST and the DANTE trial were 2- to 4-fold higher than in the DLCST and the MILD study, suggesting that LDCT screening might be more beneficial in higher-risk populations. A recent modeling study supports this hypothesis, finding that the NNS to save 1 life from lung cancer over 6 years (3 years of annual screening) was 82 for high-risk participants compared with 3180 for minimally eligible NLST participants
(90).
Fourth, results from the MILD study should be interpreted with caution. The trial was rated poor-quality because of inadequate randomization with systematic differences between groups and differential follow-up. Finally, differential mortality among trials may be due to different population characteristics and systems of medical care in Europe than in the United States.
The potential benefits of lung cancer screening must be weighed against potential harms. Because of the low PPV of screening LDCT, subsequent procedures are often needed for diagnosis. These procedures are usually noninvasive, such as clinical examinations, repeated CT, and positron emission tomography; however, some may be invasive, such as biopsy and surgery. In the studies that we reviewed, most invasive procedures performed were for cancer, not benign disease, with a PPV ranging from 50% to 92% in included studies. This contrasts with the high number of false-positive findings requiring further evaluation with imaging or clinical follow-up, which were predominantly done for benign disease. Screening with LDCT did not seem to reduce overall quality of life or affect smoking rates. In addition, LDCT detected many incidental findings, such as emphysema and coronary artery calcifications, but the effect of these findings was not studied.
Overdiagnosis and consequent overtreatment is a concern in lung cancer screening. The 1% to 2.7% prevalence of unrecognized lung cancer suggests a preclinical pool of lung cancer in high-risk populations. The clinical significance of these tumors is uncertain, but patients with lung cancer typically receive treatment, resulting in harm to those with nonlethal cancer. Elderly smokers have high mortality rates from causes other than lung cancer, which also increases the risk for overdiagnosis.
In the future, biomarkers and CT variables, such as volume-doubling time and nodule size, may help discriminate among biologically aggressive and indolent tumors
(82, 89). Arguments against substantial overdiagnosis come from autopsy studies that report low rates of unsuspected lung cancer, as well as natural history studies showing high mortality rates among untreated patients with early-stage lung cancer
(63, 91–93). Overall, the reductions in lung cancer and all-cause mortality in the NLST, despite a higher incidence of lung cancer in the LDCT group (1040 vs. 941 cases), suggest that the benefit of screening outweighs the potential harm of overdiagnosis
(53).
Radiation exposure is a harm of LDCT lung cancer screening
(94). For context, LDCT is associated with radiation exposure near that of mammography. Radiation-induced cancer over 10 to 20 years is particularly concerning, although none of the studies reported on this potential outcome. Radiation dose varies by body weight, CT detector and manufacturer, and the number of images obtained. In many institutions, current practice involves following nodules with LDCT rather than high-resolution CT, which substantially reduces radiation exposure. If LDCT screening becomes routine, it will be important to measure the risk for radiation-associated harms and identify methods to lower the dose.
Our review differs from a recently published systematic review of LDCT screening
(95). First, our review is more comprehensive, because we identified 8215 citations compared with 591. For example, we identified 7 studies that reported psychosocial outcomes (3 of which reported quality of life), whereas the other review identified 1 study. Second, studies published since the other review provide new data. Third, we analyzed rates of lung cancer and mortality by using the actual person-years of follow-up, which affects the effect size observed in 2 of the trials.
Our review has limitations. The NLST results may not be generalizable, because participants were younger, better educated, and less likely to be current smokers than the general U.S. population that would be eligible for LDCT screening by NLST criteria
(96). The trial was conducted at mostly large academic centers. However, the large size of the trial, as well as the involvement of community health care providers in nodule management and treatment, may mitigate this concern. Furthermore, differences in population characteristics and systems of medical care and the small sample sizes used in the European studies may limit applicability to a U.S. population. Other limitations include a lack of specific information on LDCT screening in women and racial and ethnic groups. Studies of cost-effectiveness, modeling studies of radiation risk, and studies that evaluated patient perspectives on screening were not included because they were considered out of the scope of our review.
Future research to identify methods for focusing LDCT screening on persons at highest risk for disease, to improve discrimination between benign and malignant pulmonary nodules, and to find early indicators of aggressive disease is warranted. Studies have examined the role of biomarkers in these settings, and the NLST has collected biological specimens during enrollment; however, no results have yet been reported
(53). New studies of risk modeling that could apply to currently screened groups, such as the Bach and Liverpool risk models, may facilitate identification of patients at higher risk who might benefit differentially from screening with LDCT
(95, 97).
If LDCT screening becomes routine, the risk for harms should be measured and methods to limit them should be identified. It is also important to continue to evaluate the psychosocial consequences in patients who undergo screening, because psychological responses to screening and abnormal or normal results may differ between patients participating in research studies and the general population.
In conclusion, LDCT screening seemed to reduce lung cancer mortality. This result was driven by 1 large, good-quality study conducted in the United States in which the NNS to prevent 1 lung cancer death (among those who completed at least 1 screening) was 320 and the NNS to prevent 1 death overall was 219 over 6.5 years. These benefits compare with numbers needed to invite to screen to prevent 1 breast cancer death in mammography trials of 1339 for women aged 50 to 59 years after 11 to 20 years of follow-up
(98, 99). They also compare with an NNS with flexible sigmoidoscopy of 817 to prevent 1 colon cancer death
(100). Given the high number of current and former smokers in the population at risk for lung cancer, identifying and treating early-stage lung cancer with screening will hopefully clarify the balance of benefits and harms associated with screening. In addition, more work in public health to reduce smoking remains the most important approach to reducing morbidity and mortality from lung cancer.
Author's Response
Thank you for your letters regarding our recent systematic review of low-dose CT screening for lung cancer. We appreciate your interest and respond below:
1. Over-diagnosis--- as we note in our paper and the associated systematic review, over-diagnosis is a risk of lung cancer screening and of uncertain magnitude. We do not believe that a valid estimate of over-diagnosis comes from the Mayo Lung Project for many reasons, most outlined in our report [Humphrey, 2013 #16875] and the prior review [Humphrey, 2004 #61]). More research in this important area is needed.
2. We rated the quality of the DLCST as moderate. Limitations we identified included a lack of description of allocation concealment as you note. The paper states that patients were randomized at their first visit. Allocation concealment was not described, only that patients were randomized in block permutation form. [Pedersen, 2009 #3703] In general, we rely on information in publications to evaluate trial quality. In addition, on page 5 of the 2012 paper, unequal follow-up is noted. Specifically, the paper states that, “Information on lung cancer in the control group is not as up to date as in the screening group.”[Saghir, 2012 #8634]
3. As we note in our paper, we strongly agree that more research in the area of psychosocial outcomes will be very important to aid individual decision making for people considering low-dose CT screening for lung cancer.
Please note also that we do not make recommendations about screening. These are made by the Task Force based on the systematic review we prepare. We very much look forward to more data from current trials to better inform decision making in lung cancer screening. Thank you again for your interest.
Sincerely,
Linda Humphrey, MD, MPH
Mark Deffebach, MD
Miranda Pappas, MA
Bernadette Zakher, MBBS
Christopher Slatore, MD, MS
Recommendation about low-dose computed tomography screening is premature
The U.S. Preventive Services Task Force (USPSTF) has published a systematic review on screening for lung cancer with low-dose computed tomography (LDCT).(1) Seven randomized controlled trials (RCTs) on LDCT (one from the U.S., six from Europe) were identified but only four were included in the review because the other three have not reported results in the intervention and control groups yet. A reduction in lung-cancer specific mortality and a trend for reduced overall mortality was seen in the U.S. trial (which compared LDCT with chest X-ray, and had far more participants and years of follow-up than other trials), but not in any of the three European trials (which compared LDCT with no screening, have less participants, and have not reported final results). In addition, the USPSTF noted that the main harms were: radiation exposure, overdiagnosis, a high rate of false-positives and common incidental findings. In this comment we argue that the available evidence on screening harms is still too scarce to make definitive recommendations about low-dose computed tomography screening. We disagree with two arguments presented by the USPSTF: 1) that no studies formally reported overdiagnosis, and 2) that screening did not affect overall health-related quality of life or long-term anxiety. 1) Overdiagnosis The European trials, which compared LDCT with no screening, have not had enough follow-up time to report overdiagnosis. However, they all found more early-stage lung cancer, but not fewer cases of advanced lung cancer in the LDCT groups. (1) This is indirect evidence for overdiagnosis. In the NLST, which compared LDCT with chest X-ray, 119 more cases of lung cancer in the LDCT group were found (approximately 4 more lung cancers per 1000 participants).(2) Furthermore, overdiagnosis has previously been reported in chest X-ray screening.(3) Our interpretation is that LDCT-screening results in some degree of overdiagnosis, but the exact degree of overdiagnosis cannot be estimated before the European trials (which compared LDCT with no screening) have had sufficient follow-up time.(4) There was no evidence of overdiagnosis at 6 years of follow-up in the Prostate, Lung, Colorectal, and Ovarian (PLCO) study within the subgroup of participants that met the eligibility criteria of the NLST.(5) Nevertheless, in the Mayo Lung Project (MLP), lung cancer screening with chest X-ray was associated with approximately 10 overdiagnosed cancers per 1000 participants.(6) Thus, the reduction 3 lung-cancer deaths per 1000 participants found in the NLST(7) needs to be balanced against the plausible estimates of overdiagnosis: from 4/1000 participants (NLST + PLCO estimate) to 14/1000 participants (NLST + MLP estimate). 2) Psychosocial consequences, If psychosocial consequences of cancer screening are to be measured there is a need for questionnaires with high content validity and adequate psychometric properties.(8) In a qualitative study none of the interviewees reported pathological levels on anxiety and depression that needed therapeutic and/or pharmaceutical treatment.(9) The use of anxiolytic or antidepressant medication might therefore be an insufficient surrogate outcome for psychosocial consequences of LDCT-screening.(10) So far, RCTs on LDCT-screening have not provided evidence that the questionnaires used in their surveys have adequate measurement qualities and properties. Therefore, the results of these surveys are questionable. In the DLCST an instrument has been developed using qualitative group interviews and item response theory modeling: the Consequences of Screening in Lung Cancer (COS-LC) questionnaire.(9) The COS-LC has been used trough out the DLCST.(11) In the DLCST we have found that participation in lung cancer screening has negative psychosocial consequences for the apparently healthy participants one year after randomization.(12) In another study we have revealed substantial socio-demographic and psychosocial participation bias in DLCST(13) that most likely has underestimated the negative psychosocial consequences in the first study.(12) Before it can be assessed if a medical screening modality results in more good than harm – or vice versa – high quality evidence about the potential intended beneficial effects and the potential unintended harmful effects is needed. The USPSTF has 30 July on its website (http://www.uspreventiveservicestaskforce.org/draftrec.htm) published a draft of their recommendation for LDCT-screening based on their review that be commented until 26 August. The USPSTF recommends annual screening for lung cancer with low-dose computed tomography (LDCT) in persons at high risk for lung cancer based on age and smoking history. We do not think there is high quality evidence to support this recommendation because of the lack of high quality evidence about overdiagnosis and psychosocial consequences of LDCT-screening. Without such information we cannot understand how there is either “high certainty that the net benefit is moderate or there is moderate certainty that the net benefit is moderate to substantial”.The European RCTs will hopefully be able to raise sufficient evidence about the degree of overdiagnosis of LDCT-screening of which the true effect can only be determined in comparison to a non-screening control group.(4) Furthermore, in the DLCST we are working on our data on costs and psychosocial consequences and hope to report our main results soon. In addition, cost-effectiveness analysis can only be conducted if evidence about all relevant benefits and harms exists. Without high quality evidence about overdiagnosis, psychosocial consequences and cost effectiveness we find that any recommendation about LDCT-screening is premature.
Reference List
(1) Humphrey LL, Deffebach M, Pappas M, Baumann C, Artis K, Mitchell JP, et al. Screening for Lung Cancer With Low-Dose Computed Tomography: A Systematic Review to Update the U.S. Preventive Services Task Force Recommendation. Ann Intern Med 2013 Jul 30;N/A(N/A):N/A.
(2) Aberle DR, Adams AM, Berg CD, Black WC, Clapp JD, Fagerstrom RM, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med 2011 Aug 4;365(5):395-409.
(3) Manser R, Lethaby A, Irving LB, Stone C, Byrnes G, Abramson MJ, et al. Screening for lung cancer. Cochrane Database Syst Rev 2013;6:CD001991.
(4) Field JK, van Klaveren R, Pedersen JH, Pastorino U, Paci E, Becker N, et al. European randomized lung cancer screening trials: Post NLST. J Surg Oncol 2013 Jul 1;n/a.
(5) Oken MM, Hocking WG, Kvale PA, Andriole GL, Buys SS, Church TR, et al. Screening by Chest Radiograph and Lung Cancer Mortality: The Prostate, Lung, Colorectal, and Ovarian (PLCO) Randomized Trial. JAMA 2011 Oct 26.
(6) Marcus PM, Bergstralh EJ, Fagerstrom RM, Williams DE, Fontana R, Taylor WF, et al. Lung cancer mortality in the Mayo Lung Project: impact of extended follow-up. J Natl Cancer Inst 2000 Aug 16;92(16):1308-16.
(7) Heleno B, Rasmussen JF, Brodersen J. Reduced Lung-Cancer Mortality with CT Screening. N Engl J Med 2011 Nov 24;365(21):2035-8.
(8) Brodersen J, McKenna SP, Doward LC, Thorsen H. Measuring the psychosocial consequences of screening. Health Qual Life Outcomes 2007 Jan 8;5(1):3.
(9) Brodersen J, Thorsen H, Kreiner S. Consequences Of Screening in Lung Cancer: Development and Dimensionality of a Questionnaire. Value in Health 2010 Aug 18;13(5):601-12.
(10) Kaerlev L, Iachina M, Pedersen JH, Green A, Norgard BM. CT-Screening for lung cancer does not increase the use of anxiolytic or antidepressant medication. BMC Cancer 2012;12:188.
(11) Pedersen JH, Ashraf H, Dirksen A, Bach K, Hansen H, Tønnesen P, et al. The Danish Randomized Lung Cancer CT Screening Trial – Overall Design and Results of the Prevalence Round. Journal of Thoracic Oncology 2009 May;4(5):608-14.
(12) Aggestrup LM, Hestbech MS, Siersma V, Pedersen JH, Brodersen J. Psychosocial Consequences of Allocation to Lung Cancer Screening – a Randomised Controlled Trial. BMJ Open 2012;10.1136/bmjopen-2011-000663.
(13) Hestbech MS, Siersma V, Dirksen A, Pedersen JH, Brodersen J. Participation bias in a randomised trial of screening for lung cancer. Lung Cancer 2011 Sep;73(3):325-31.
Clarifying information about the Danish Lung Cancer Screening Trial
The U.S. Preventive Services Task Force (USPSTF) has published a systematic review on screening for lung cancer with low-dose computed tomography (LDCT).[1] In the review the USPSTF stated 1) that there were insufficient and unequal follow-up in three European randomized controlled trials to evaluate the degree of overdiagnosis and 2) that the allocation in the Danish Lung Cancer Screening Trial (DLCST) was unclear. In this comment we (as representatives of the Danish Lung Cancer Screening Trial (DLCST)) present clarifying information about the DLCST. 1) Insufficient and unequal follow-up. We agree with the USPSTF that the follow-up of the DLCST (and the two other European trials) is insufficient to assess the impact on mortality and the level of overdiagnosis. However, for mortality and cancer incidence the follow-up in the screening and control group was equal as stated in our status-paper after five annual screening rounds: “…the latest follow-up for both groups was set as end of screening, 31 March 2010.”[2] 2) Unclear allocation in the DLCST Allocation in the DLCST was described for the first time in our paper reporting on the prevalence round: "Participants were randomized by a computer program (random permuted blocks of 10 participants) to either annual screening by low-dose CT (the screening group) or the control group who were not offered CT screening."[3] Later, in our paper including all five screening rounds we wrote: "After inclusion, the participants were randomized to the screening group (n=2052) or the control group (n=2052). The screening group received five annual low-dose chest CT scans (one baseline scan and four subsequent incidence scans)."[2]We would have been happy to clarify any matter about allocation concealment if any of the authors of the USPSTF had contacted us. The participants in the DLCST were randomised by a centrally-held, computer-generated list (random permuted blocks of 10 participants). Allocation concealment was ensured, as the computer application did not release the randomization code until the participant had been recruited into the trial, which took place after eligibility was checked and informed consent was provided. During the entire recruitment period (October 2004 to March 2006), the researchers involved in participant recruitment were unaware that the randomised list contained random permuted blocks, and thus could not guess the next allocation in the sequence.
Reference List
[1] Humphrey LL, Deffebach M, Pappas M, Baumann C, Artis K, Mitchell JP et al. Screening for Lung Cancer With Low-Dose Computed Tomography: A Systematic Review to Update the U.S. Preventive Services Task Force Recommendation. Ann Intern Med 2013; N/A(N/A):N/A.
[2] Saghir Z, Dirksen A, Ashraf H, Bach KS, Brodersen J, Clementsen PF et al. CT screening for lung cancer brings forward early disease. The randomised Danish Lung Cancer Screening Trial: status after five annual screening rounds with low-dose CT. Thorax 2012; 67(4):296-301.
[3] Pedersen JH, Ashraf H, Dirksen A, Bach K, Hansen H, Tønnesen P et al. The Danish Randomized Lung Cancer CT Screening Trial – Overall Design and Results of the Prevalence Round. Journal of Thoracic Oncology 2009; 4(5):608-614.