|
|||||
|
|
||||||
Originally published as JCO Early Release 10.1200/JCO.2005.11.136 on March 21 2005 © 2005 American Society of Clinical Oncology. Improved Detection of Prostate Cancer Using Classification and Regression Tree AnalysisFrom the Division of Urology, Portland Veterans Administration Medical Center; Divisions of Urology and Hematology and Medical Oncology, Oregon Health and Science University; and Biostatistics Shared Resource, Oregon Health and Science University Cancer Institute, Portland, OR Address reprint requests to Mark Garzotto, MD, Urology Section, Portland Veterans Administration Medical Center, 3710 SW US Veterans Hospital Rd, Portland, OR 97239; e-mail: garzotto{at}ohsu.edu
PURPOSE: To build a decision tree for patients suspected of having prostate cancer using classification and regression tree (CART) analysis.
PATIENTS AND METHODS: Data were uniformly collected on 1,433 referred men with a serum prostate-specific antigen (PSA) levels of
RESULTS: CART analysis selected a PSA cutoff of more than 1.55 ng/mL for further work-up, regardless of DRE findings. CART then selected the following subgroups at risk for a positive biopsy: (1) PSAD more than 0.165 ng/mL/cc; (2) PSAD CONCLUSION: Application of CART analysis to the prostate biopsy decision results in a significant reduction in unnecessary biopsies while retaining a high degree of sensitivity when compared with the standard of performing a biopsy of all patients with an abnormal PSA or DRE.
Prostate cancer is the most common cancer in men and the second leading cause of male cancer deaths in the United States. In the absence of effective treatment options for advanced prostate cancer, intensive efforts to detect low-stage, curable cancers may help to improve prostate cancerspecific survival.1 Recent data suggest that prostate-specific antigen (PSA) screening reduces the clinical stage at presentation and prostate cancer mortality2; however, validation of these results awaits the completion of ongoing randomized screening trials.3 Optimal strategies for selecting appropriate patients for prostate biopsy have yet to be defined. Screening for prostate cancer detects the majority of prostate cancer patients; however, their effectiveness is severely hampered by a lack of specificity. Among men who are screened, 22% to 26% of men will have an abnormal serum PSA and/or digital rectal examination (DRE) and are likely to receive a recommendation for a prostate biopsy.4,5 Furthermore, 20% to 25% of these screen-positive men will be found to have cancer detected on biopsy. Thus, the majority of patients who are biopsied because of screening abnormalities undergo biopsy unnecessarily. To reduce the rate of unnecessary biopsies, efforts have focused on characterizing patient groups with an abnormal PSA and/or DRE who have a low likelihood of a positive biopsy. Multiple factors that have been associated with the detection of prostate cancer include age, race, family history, hypoechoic lesions on transrectal ultrasound (TRUS), PSA density (PSAD), PSA velocity, transition-zone PSAD, and percentage of free PSA.6 Prior work has demonstrated that the number of positive biopsies can be reduced by a modest degree with the use of percentage of free PSA, nomograms, predictive indices, and artificial neural networks.7-11 Instruments that accurately predict the presence of cancer have the potential to reduce the number of unnecessary biopsies along with their accompanying pain, morbidity, and cost.12 Classification and regression tree (CART) analysis belongs to a family of nonparametric regression methods and is based on the recursive partitioning method. The CART builds a decision tree structure and classifies subjects into high- and low-risk groups.13 It can be used simply to explore the data, identify possible high-risk subgroups, and uncover interactions or effect modifications among prognostic factors. Unlike the commonly used logistic regression method, CART analysis does not assume a multiplicative risk model or a specific parametric probability model, does not require a specification of the risk function (eg, a linear or quadratic effect of age), and is not affected by outlying observations. Most importantly, the results of CART analysis are presented as a decision tree, which is intuitive and easier to understand than the results of many other statistical methods. In this study, CART analysis was used to create a decision tree that can assist clinicians in making a prostate biopsy decision.
Study Population and Biopsy Procedure From 1993 to 2002, data were uniformly collected for the purposes of clinical care on 1,433 consecutive referred patients with a serum PSA level of 10 ng/mL who underwent an initial prostate biopsy procedure. Patient referrals were obtained in the course of routine clinical care and not as part of a population-based screening trial. The primary outcome in this study was the detection of prostate cancer on biopsy. It should be noted, however, that a negative biopsy is not synonymous with the absence of prostate cancer. Variables recorded included age, a family history of prostate cancer in a first-degree relative, race, indication for patient referral, and history of vasectomy. At presentation, the serum PSA measurement (Abbott Diagnostics, Abbott Park, IL) was repeated, and a member of the urology team performed a DRE on all patients. The DRE was classified as normal, asymmetric, suspicious, or cancer-likely.7 Before the biopsy procedure, all patients received a cleansing enema and prophylactic broad-spectrum antibiotics. Patients with overt evidence of prostatitis or urinary tract infection by urine dipstick test were excluded. Complete data were available on 1,433 consecutive patients with a serum PSA level of 10 ng/mL or less, and all analyses were carried out on this group. All procedures were carried out using a Bruel and Kjær system 3535 (Marlboro, MA) device with a model 8551 7.0-MHz probe. Prostate volumes were obtained by measuring the gland in three dimensions, and volume was estimated using the following formula: 0.52 [length (cm) x width (cm) x height (cm)]. The PSAD was calculated by dividing the serum PSA by the calculated prostate volume. At the same setting, all patients underwent ultrasound-guided prostate biopsies performed using an 18-gauge biopsy instrument (Bard, Covington, GA). A minimum of six biopsy cores was obtained from all patients (range, six to 13 cores). Additional lesional biopsies were obtained when a TRUS-detected lesion was detected outside of the planned biopsy template. This study was approved by the Portland Veterans Affairs Research Service Institutional Review Board and was granted exempt status from need for informed consent.
Statistical Methods This analysis was designed to identify predictors of biopsy outcome and not of true disease status. Current prostate diagnosis techniques do not allow for the complete elimination of false-negative biopsies; therefore, true disease status (presence or absence of prostate cancer) cannot be completely determined. Because the presence of false-negative biopsies introduces an asymmetric bias, the results should not be extrapolated to definitively predict true disease status.
Patient Characteristics The median age of the study group was 65.1 years (Table 1). Most of the patients were classified as white (93.3%). A family history of prostate cancer was reported in 17.6% of patients. The median PSA level in this group was 5.0 ng/mL (mean PSA, 4.8 ng/mL). The DRE was normal in 48.8% of patients, asymmetric in 5.5% of patients, suspicious in 41.7% of patients, and cancer-likely in 4.0% of patients.
Ultrasound and Biopsy Data The median prostate volume was 34.2 cc (range, 4.9 to 205 cc; Table 2). The median PSAD was 0.12 ng/mL/mL. Hypoechoic lesions were demonstrated in 44.9% of patients. Cancer was detected 24.4% of patients. The majority of tumors (51.8%) was determined to be Gleason score 6.
CART Analysis The initial CART procedure was carried out on the model building set (n = 1,173) using DRE and PSA data only to determine value of these primary factors in the prostate decision process. The initial CART selected a PSA cutoff level of more than 1.55 ng/mL alone for the identification of patients at risk for a positive prostate biopsy (Fig 1). Using this cutoff, 96.6% (sensitivity) of cancer patients (281 of 291 patients) were identified for further analysis, whereas 193 (21.9%) of 882 subjects without cancer were correctly identified (specificity). Using this PSA cutoff alone, the percent overall reduction in prostate biopsies was 17.3% in the model building set (203 of 1,173 procedures).
In the group with PSA more than 1.55 ng/mL (n = 970), prostate cancer was detected in 29.0% of patients (281 of 970 patients). The second CART analysis was carried out using all remaining variables, and the resulting decision tree was merged with the initial CART-derived tree. This supplementary decision algorithm identified the following four groups for a biopsy of the prostate: (1) PSAD more than 0.165 ng/mL/cc; (2) PSAD 0.165 ng/mL/cc and TRUS hypoechoic lesion; (3) PSAD 0.165 ng/mL/cc, absence of a hypoechoic TRUS lesion, age older than 55.5 years, and prostate volume 44.0 mL; and (4) PSAD 0.165 ng/mL/cc, absence of a hypoechoic TRUS lesion, age older than 55.5 years, and 50.25 cc less than prostate volume 80.8 cc. The incidences of cancer detection in these groups were 48.8%, 26.4%, 21.6%, and 16.5%, respectively. For subjects with a PSA more than 1.55 ng/mL, CART detected 98.9% of the remaining prostate cancers (278 of 281 cancers) and correctly identified patients without cancer in 20.5% of the remaining subjects (141 of 698 subjects). The complete model was found to have an overall sensitivity of 95.5% (278 of 291 cancer patients) and a specificity of 37.9% (334 of 882 noncancer patients) in the model building set. The positive predictive value was 33.7%, and the test carried a negative predictive value of 96.3%. CART analysis was then carried out using the randomly selected validation set (n = 260). In this study, the sensitivity was 96.6% (57 of 59 patients), and the specificity was 31.3% (63 of 201 patients). Assuming the prostate cancer prevalence of 24% in this population, the positive and negative predictive values from the validation set were 29.2% and 96.9%, respectively. The results generated by the CART were then compared with a logistic regression model created using the same factors. These analyses were performed using the validation data set. The CART decision tree had a receiver operator characteristic curve AUC of 0.74. When the same variables were entered into the logistic regression model, the AUC was 0.72. Thus, both CART- and logistic regressionbased models are comparable in their effectiveness in discriminating biopsy-positive cases from biopy-negative cases.
Generalized Additives Models
There was a monotone increasing relationship between the risk of prostate cancer and both patient age and PSAD. Serum PSA was found to have a biphasic distribution, with an initial linear increase in the low PSA range that was followed by a plateau phase in the intermediate PSA range from 4 to 10 ng/mL. Prostate volume demonstrated a sinusoidal pattern relative to prostate cancer risk.
Biopsy Gleason Scores
The optimal strategy to screen patients for prostate cancer has yet to be defined. Screening for prostate cancer with the combination of PSA and DRE has been thought to be a sensitive means of detecting prostate cancer15,16; however, recent data suggest these tests have diagnostic limitations. A study of men who had a normal DRE and a serum PSA level less than 3.0 ng/mL demonstrated that standard prostate screening practice is able to detect 85% of prostate cancers.17 The biologic significance of these tumors is not known; although these subclinical cancers may belong to a pool that are destined for a more indolent course.
Using PSA and DRE, up to 25% of men who present for prostate screening will have abnormalities that could lead to a biopsy of the prostate.5 Efforts have been made to reduce the number of unnecessary biopsy procedures through the use of PSA derivatives, PSA subspecies analysis, and multivariable models based on either logistic regression or artificial neural networks.8-11,18 Many of these models have not gained wide acceptance because of a lack of sensitivity, specificity, or both. Recently, we reported on the development of a prebiopsy nomogram, which incorporates the results of patient age and DRE and TRUS findings to determine the relative likelihood of a positive biopsy for patients with a serum PSA of In this study, a serum PSA level of greater than 1.55 ng/mL alone was identified as an indication for further testing in a population referred for prostate evaluation. The DRE lost predictive value when the PSA test was coanalyzed as a continuous variable. It is possible that, at this threshold, tumors may be too small for prostate cancer detection by DRE as previously suggested by Vis et al.19 Schroder et al20 found that the positive predictive value of the DRE was negatively associated with increased PSA levels. The positive predictive value for the DRE was 45% when the PSA level was between 4.0 and 10.0 ng/mL; however, it was reduced to 10% when the PSA level was between 1.0 and 1.9 ng/mL and was reduced to 4% when the PSA level was between 0.0 and 0.9 ng/mL. These investigators calculated that, for patients with DRE abnormalities and PSA values between 1.0 and 1.9 ng/mL, 38 biopsy procedures would be necessary to discover one cancer. For patients with PSA values from 0.0 to 0.9 ng/mL, 46 biopsies were required to detect a single cancer, which is a rate they determined to be "unacceptably high."4 Others have similarly reported the DRE to have a low positive predictive yield in patients with a low PSA.21,22
Recently, it has been proposed that the normal PSA cutoff level be decreased to 2.5 or 3.0 ng/mL.23,24 A higher rate of organ-confined disease and the potential for curability have been cited as justification to changing the current screening recommendations.20,25,26 Additional evidence lending support to a lowering of the PSA cutoff level comes from the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial.27 In this trial, men with a PSA level of less than 2.0 ng/mL had a 97.4% chance of maintaining a PSA
In the intermediate PSA range, the most important disease to discriminate from prostate cancer is benign prostatic hyperplasia.30 Many studies have shown a reduction in prostate cancer risk with increasing prostate size. In the current study, an overall reduction in prostate cancer risk was observed with increase in gland size. However, a trend towards increased risk was observed on the low and high end of the volume scale (Fig 2D). Because there are relatively small numbers of patients in these ranges, it is unclear what their significance is. Additional studies with greater numbers of patients in these categories would be required to make assumptions about prostate cancer risk in these areas. The CART model chose select patients with smaller prostate glands (volume There were limitations to the current study that warrant mention. Because percentage of free PSA was not available at the onset of this study, we were unable to assess its utility in the current model. The CART model did include PSAD, which has been shown to have equivalent or improved predictive capacity compared with percentage of free PSA.31,32 Future studies that incorporate percentage of free PSA or its subtypes into multivariable models will likely result in further improvements to such models. Similar to other reports,5,9 this study included a low proportion of black men; thus, it is not known how well the CART model would perform in this population or other racial or ethnic groups. Our model was designed to predict the presence of cancer in men on initial prostate biopsy only and not whether cancer would be detected on subsequent biopsies. Therefore, similar to those patients who have biopsy-negative results, patients who forego a biopsy based on the results of the CART should continue to be monitored. Recent studies have suggested that extended biopsy schemes taking more cores improves the rate of cancer detection.33 Then again, a randomized controlled trial comparing a six-cores with 12-cores biopsy procedure showed no difference in cancer detection rates.34 Although it is possible that extended biopsy schemes may find cancer more frequently, it may also contribute disproportionately to the overdiagnosis of prostate cancer.35 Nonetheless, similar to all studies of prostate cancer diagnosis, this analysis is limited by the bias introduced by false-negative biopsies. This limitation of the biopsy procedure precludes the definitive determination of the true disease status. Therefore, the results of this analysis should be used to aid in predicting the outcome of a biopsy procedure, which is the primary goal of the analysis, and should not be used to predict true disease status.
In summary, CART analysis chose a PSA cutoff level of
The authors indicated no potential conflicts of interest.
Supported by Veterans Affairs Career Development Award (M.G.) and National Institutes of Health grant No. P30 CA 69533 to the Biostatistics Shared Resource of the Oregon Health and Science University Cancer Institute. Authors' disclosures of potential conflicts of interest are found at the end of this article.
1. Polascik TJ, Oesterling JE, Partin AW: Prostate specific antigen: A decade of discoveryWhat we have learned and where we are going. J Urol 162:293-306, 1999[CrossRef][Medline] 2. Berger AP, Spranger R, Kofler K, et al: Early detection of prostate cancer with low PSA cut-off values leads to significant stage migration in radical prostatectomy specimens. Prostate 57:93-98, 2003[CrossRef][Medline] 3. de Koning HJ, Auvinen A, Berenguer Sanchez A, et al: Large-scale randomized prostate cancer screening trials: Program performances in the European Randomized Screening for Prostate Cancer trial and the Prostate, Lung, Colorectal and Ovary cancer trial. Int J Cancer 97:237-244, 2002[CrossRef][Medline] 4. Kranse R, Beemsterboer P, Rietbergen J, et al: Predictors for biopsy outcome in the European Randomized Study of Screening for Prostate Cancer (Rotterdam region). Prostate 39:316-322, 1999[CrossRef][Medline] 5. Catalona WJ, Richie JP, Ahmann FR, et al: Comparison of digital rectal examination and serum prostate specific antigen in the early detection of prostate cancer: Results of a multicenter clinical trial of 6,630 men. J Urol 151:1283-1290, 1994[Medline] 6. Kamoi K, Babaian RJ: Advances in the application of prostate-specific antigen in the detection of early-stage prostate cancer. Semin Oncol 26:140-149, 1999[Medline] 7. Garzotto M, Hudson RG, Peters L, et al: Predictive modeling for the presence of prostate cancer using clinical, laboratory, and ultrasound parameters in patients with PSA levels < or = 10 ng/ml. Cancer 98:1417-1422, 2003[CrossRef][Medline]
8. Catalona WJ, Partin AW, Slawin KM, et al: Use of the percentage of free prostate-specific antigen to enhance differentiation of prostate cancer from benign prostatic disease: A prospective multicenter clinical trial. JAMA 279:1542-1547, 1998
9. Djavan B, Remzi M, Zlotta A, et al: Novel artificial neural network for early detection of prostate cancer. J Clin Oncol 20:921-929, 2002 10. Eastham JA, May R, Robertson JL, et al: Development of a nomogram that predicts the probability of a positive prostate biopsy in men with an abnormal digital rectal examination and a prostate-specific antigen between 0 and 4 ng/mL. Urology 54:709-713, 1999[CrossRef][Medline] 11. Carlson GD, Calvanese CB, Partin AW: An algorithm combining age, total prostate-specific antigen (PSA), and percent free PSA to predict prostate cancer: Results on 4298 cases. Urology 52:455-461, 1998[CrossRef][Medline] 12. Rodriguez LV, Terris MK: Risks and complications of transrectal ultrasound guided prostate needle biopsy: A prospective study and review of the literature. J Urol 160:2115-2120, 1998[CrossRef][Medline] 13. Breiman L, Friedman J, Olshen R, et al: Classification and Regression Trees. Belmont, CA, Wadsworth, 1984 14. Hastie T, Tisbshirani R: Generalized Linear Models. London, United Kingdom, Chapman & Hall, 1990 15. Cooner WH, Mosley BR, Rutherford CL Jr, et al: Prostate cancer detection in a clinical urological practice by ultrasonography, digital rectal examination and prostate specific antigen. J Urol 143:1146-1152, 1990[Medline] 16. Ellis WJ, Chetner MP, Preston SD, et al: Diagnosis of prostatic carcinoma: The yield of serum prostate specific antigen, digital rectal examination and transrectal ultrasonography. J Urol 152:1520-1525, 1994[Medline]
17. Thompson IM, Goodman PJ, Tangen CM, et al: The influence of finasteride on the development of prostate cancer. N Engl J Med 349:215-224, 2003 18. Carter HB, Pearson JD: Prostate-specific antigen velocity and repeated measures of prostate-specific antigen. Urol Clin North Am 24:333-338, 1997[CrossRef][Medline] 19. Vis AN, Kranse R, Roobol M, et al: Serendipity in detecting disease in low prostate-specific antigen ranges. BJU Int 89:384-389, 2002[CrossRef][Medline]
20. Schroder FH, van der Maas P, Beemsterboer P, et al: Evaluation of the digital rectal examination as a screening test for prostate cancer: Rotterdam section of the European Randomized Study of Screening for Prostate Cancer. J Natl Cancer Inst 90:1817-1823, 1998 21. Flanigan RC, Catalona WJ, Richie JP, et al: Accuracy of digital rectal examination and transrectal ultrasonography in localizing prostate cancer. J Urol 152:1506-1509, 1994[Medline] 22. Carvalhal GF, Smith DS, Mager DE, et al: Digital rectal examination for detecting prostate cancer at prostate specific antigen levels of 4 ng./ml. or less. J Urol 161:835-839, 1999[CrossRef][Medline]
23. Punglia RS, D'Amico AV, Catalona WJ, et al: Effect of verification bias on screening for prostate cancer by measurement of prostate-specific antigen. N Engl J Med 349:335-342, 2003 24. Schroder FH, van der Cruijsen-Koeter I, de Koning HJ, et al: Prostate cancer detection at low prostate specific antigen. J Urol 163:806-812, 2000[CrossRef][Medline] 25. Catalona WJ, Smith DS, Ratliff TL, et al: Detection of organ-confined prostate cancer is increased through prostate-specific antigen-based screening. JAMA 270:948-954, 1993[Abstract] 26. Carter HB, Epstein JI, Chan DW, et al: Recommended prostate-specific antigen testing intervals for the detection of curable prostate cancer. JAMA 277:1456-1460, 1997[Abstract] 27. Crawford ED, Chia D, Andriole GL, et al: PSA testing interval reduction in screening intervals: Data from the prostate, lung, colorectal, and ovarian screening trial. Proc Am Soc Clin Oncol 21:2a, 2002 (abstr 4)
28. Lee F, Gray JM, McLeary RD, et al: Prostatic evaluation by transrectal sonography: Criteria for diagnosis of early carcinoma. Radiology 158:91-95, 1986 29. Shinohara K, Wheeler TM, Scardino PT: The appearance of prostate cancer on transrectal ultrasonography: Correlation of imaging and pathological examinations. J Urol 142:76-82, 1989[Medline] 30. Meigs JB, Barry MJ, Oesterling JE, et al: Interpreting results of prostate-specific antigen testing for early detection of prostate cancer. J Gen Intern Med 11:505-512, 1996[Medline] 31. Catalona WJ, Southwick PC, Slawin KM, et al: Comparison of percent free PSA, PSA density, and age-specific PSA cutoffs for prostate cancer detection and staging. Urology 56:255-260, 2000[CrossRef][Medline] 32. Berger AP, Pelzer A, Niecher M, et al: Diagnostic utility of PSA, free PSA and PS density at serum levels below 4.0 compared to total serum levels between 4.1 and 10 ng/ml in an early prostate cancer detection programme. J Urol 169:277a, 2003 (abstr 1077) 33. Eskew LA, Bare RL, McCullough DL: Systematic 5 region prostate biopsy is superior to sextant method for diagnosing carcinoma of the prostate. J Urol 157:199-202, 1997[CrossRef][Medline] 34. Naughton CK, Miller DC, Mager DE, et al: A prospective randomized trial comparing 6 versus 12 prostate biopsy cores: Impact on cancer detection. J Urol 164:388-392, 2000[CrossRef][Medline]
35. Etzioni R, Penson DF, Legler JM, et al: Overdiagnosis due to prostate-specific antigen screening: Lessons from U.S. prostate cancer incidence trends. J Natl Cancer Inst 94:981-990, 2002 Submitted November 21, 2003; accepted October 6, 2004. Related Editorial
This article has been cited by other articles:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||
|
Copyright © 2005 by the American Society of Clinical Oncology, Online ISSN: 1527-7755. Print ISSN: 0732-183X
|