|
|||||
|
|
||||||
Journal of Clinical Oncology, Vol 26, No 2 (January 10), 2008: pp. 258-263 © 2008 American Society of Clinical Oncology. DOI: 10.1200/JCO.2007.13.0179 Nodal Stage Classification for Breast Carcinoma: Improving Interobserver Reproducibility Through Standardized Histologic Criteria and Image-Based Training
From the John Wayne Cancer Institute at Saint John's Health Center, Santa Monica; David Geffen School of Medicine, University of California at Los Angeles, Los Angeles; St Jude Medical Center, Fullerton; Orange Coast Memorial Medical Center, Fountain Valley, CA; University of Vermont College of Medicine, Burlington, VT; Bacs-Kiskun County Teaching Hospital, Kecskemet, Hungary; Brigham and Women's Hospital; Beth Israel Deaconess Medical Center, Boston, MA; and European Institute of Oncology and University of Milan, Milan, Italy Corresponding author: Roderick R. Turner, MD, Department of Pathology, Saint John's Health Center, 1328 Twenty-Second St, Santa Monica, CA 90404; e-mail: Roderick.Turner{at}stjohns.org
Purpose Reliable pathologic stage classification of axillary lymph nodes is an important determinant of prognosis and therapeutic decision making for patients with invasive breast cancer. Pathologists' distinction between micrometastasis (pN1mi) and isolated tumor cells [ITC; pN0(i+)] is variable using the American Joint Committee on Cancer (AJCC) Staging Manual (Sixth Edition). We sought to determine whether a set of clearly defined histologic criteria could lead to reproducible nodal classification by pathologists. Patients and Methods Digital images of sentinel lymph node biopsies from 56 patients with small-volume nodal metastases were examined by six experienced breast pathologists (MDs), first as a pre-test, and again as a post-test after studying a training program that outlined and illustrated the classification criteria.
Results Post-test results, after study of the training program, were significantly improved. Compared with the reference MD, agreement improved from 76.2% (pre-test Conclusion Application of current definitions for classification of small-volume nodal metastases are inconsistent, leading to variable classification of ITC and micrometastases. Reproducibility of pathologic nodal stage classification is achievable through study of a training set to clarify the AJCC criteria.
Reliable pathologic staging of the primary tumor and regional lymph nodes is an essential component of treatment planning for patients with invasive breast carcinoma. Axillary sentinel lymph nodes (SLNs) commonly demonstrate small amounts of metastatic carcinoma1 that require distinction between micrometastasis (pN1mi) and isolated tumor cells [ITC; pN0(i+)] in the TNM staging system.2 This distinction influences individual treatment recommendations and population-based studies of surgical and adjuvant therapies. Diagnostic variation in the pathologic stage classification of nodal micrometastasis and ITC is recognized as a problem area because American Joint Committee on Cancer (AJCC) guidelines are complex and unclear.3,4 Practical application has been complicated by the use of different wording in the International Union Against Cancer (UICC) system.5 The most challenging issues pertain to measurements of multiple tumor cell clusters or single cells; their microanatomic location in nodal parenchyma, subcapsular sinus, or perinodal tissues; and the dispersed pattern of lobular carcinoma metastases. A clarification of the AJCC Sixth Edition emphasized that the (i+) designation refers to ITC detected by either hematoxylin-eosin (H-E) or cytokeratin immunohistochemistry (IHC), rather than metastases detected exclusively by IHC.6 However, clear guidelines on how to measure multiple clusters or single cells have not been provided, which has contributed to the wide variation observed in clinical practice. Improvements in diagnostic reproducibility have previously been achieved in several areas of breast pathology through the use of standardized histologic criteria. Successful studies have been reported for ductal proliferative lesions,7 histologic type8 and grade9 of carcinoma, and flat epithelial atypia.10 We hypothesized that a uniform set of histologic criteria and definitions, combined with a well-illustrated training program, could significantly improve interobserver agreement in SLN pathologic stage classification.
Selection of Digital Images A test set of 56 cases with small amounts of metastatic mammary carcinoma in SLNs was compiled as digital images and placed onto a password-protected Web site at the John Wayne Cancer Institute. Thirty-seven cases were obtained from the Pathology Department files at Saint John's Health Center; 19 cases were utilized from an online site (http://breast-pathology.uni-muenster.de/) of a previously published study4 (case numbers 1, 3 to 5, 7, 9, 13, 16, 23, 26, 31 to 33, 35, 37, 39, 43, 45, and 50 in this study). Case selection included both common cases and difficult, challenge cases likely to elicit differences in interpretation, and spanned the spectrum of ITC (single cell to multiple clusters up to 0.2 mm) and micrometastases as single or multiple clusters (0.21 to 2.0 mm). These included a few cases that some observers may interpret as macrometastasis and others that could be interpreted as negative/benign cells only. A calibrated measurement bar was present in each image. The cases included seven lobular carcinomas (cases17, 23, 30, 41, 42, 51, and 56), 17 metastases with ITC in nodal parenchyma (cases 4, 7, 10, 14, 19, 22, 30, 31, 34, 36, 41, 42, 46, 48, 51, 55, and 56), six metastases very close to the 0.20-mm threshold (cases 8, 24, 25, 31, 39, and 40), and two cases with perinodal metastases (cases 5 and 52). The cases had one (n = 34), two (n = 18) or three (n = 4) images. Twenty-nine cases utilized cytokeratin IHC; the other 27 were H-E stains. Forty-one cases had multiple clusters or single cells.
Participant Pathologists and Study Format The same 56 cases were utilized in the post-test after random reordering by the study statistician (D.A.E.). Before the post-test, the MDs were asked to study a training program that included definitions, guidelines, example images, and instructions for classification, then to apply those criteria in the post-test despite any potential disagreement with the criteria. The study coordinator received, tabulated, and submitted the results to the statistician for analysis.
Training Program and Study Criteria
Statistical Analysis To assess the agreement of individual raters with the reference pathologist, we computed the percentage of agreement and the statistic. To assess the agreement among the set of six raters, we computed the multirater agreement (defined as the number of pairwise agreements/number of possible pairs) and the multirater statistic12 before and after training (computed with the MKAPPASC Macro in SPSS [SPSS Inc, Chicago, IL]). The Wilcoxon signed rank test and McNemars test were used to compare the level of agreement ( or proportion) between the pre- and post-tests. Statistical analyses were carried out with S-plus (Version 3.3; Statistical Sciences, Seattle, WA) version 6 (Insightful, Seattle, WA) and SPSS version 13.
Pre-Test Results Agreement between the MDs and the reference pathologist was only moderate in the pre-test, with a group average agreement of 76.2% (range, 57.1 to 96.4, ± 14.8) and a statistic of 0.575 (± 0.25; Table 2). In multirater analysis, the overall percentage of agreement among the six MDs before training was 71.5% (601 of 840), with a multirater of 0.487 (ASE, 0.039). This diagnostic variability was comparable with previously published studies,3,4 and indicated that the test images would provide a rigorous trial for the study criteria.
The pre-test identified discordances in the following areas: ITC in nodal parenchyma; ITC in perinodal lymphatics; metastatic lobular carcinoma with single-cell dispersed pattern; and clusters at or near the 0.20-mm threshold. The seven cases of lobular carcinoma included two with micrometastases (cases 17 and 23) and five with ITC, as determined by the reference MD. The dispersed pattern of lobular carcinoma with micrometastatic clusters (two cases) was frequently diagnosed as macrometastasis (10 of 12; Fig 1), and multiple single cells with ITC clusters were commonly diagnosed as micrometastases (eight of 30; Fig 2). The results for metastatic lobular carcinomas were significantly worse than those for ductal carcinoma cases (P = .03).
Post-Test Results Post-test agreement was greatly improved with average post-test pairwise agreement of 97.3% (± 2.4%) and a statistic of 0.947 (± 0.049). The improvement in percentage of agreement and were both statistically significant (P = .03). Multirater overall percentage of agreement between the six MDs post-training was 95.7% (804 of 840). Multirater was 0.915 (ASE, 0.037), a significant improvement from pre-test by Wilcoxon signed rank test (P < .001). Fifty-one cases (91%) had complete agreement among the six MDs and reference MD on post-test, compared with 20 cases (36%) with complete agreement on the pre-test. The training program criteria led to significant improvement with classification of lobular carcinoma metastases. Post-test agreement between the six MDs and reference MD was 100% (42 of 42), compared with 55% (23 of 42) in the pre-test (P < .001). The dispersed pattern with multiple single cells and small clusters demonstrated excellent concordance (100%) on post-test (Figs 1 and 2). The training program criteria led to significant improvement in agreement for ITC in nodal parenchyma (Fig 3). Post-test agreement between the six MDs and reference MD, for ITC in nodal parenchyma (17 cases), was 100 of 102 (98.0%) compared with 69 of 102 (67.6%) pre-test (P < .001).
One case (case 52) with a few minute clusters of tumor cells in perinodal afferent lymphatic spaces caused substantial discordance on the pre-test (six MDs: three other, one micrometastasis, two ITC) but was uniformly classified as ITC on the post-test. Tumor cell clusters very close to the 0.20-mm threshold were identified as a problem area on pre-test and also showed poor reproducibility on post-test. These were unusually difficult, borderline lesions. Six cases (cases 8, 24, 25, 31, 39, and 40) with clusters from 0.18 to 0.23 mm were discordantly classified compared with the reference MD on both pre-test (31%; 11 of 36) and post-test (25%; eight of 36). Case 31 demonstrated a small cluster difficult to visualize on H-E stain (Fig 4A). It was measured by the reference MD as 0.20 mm and considered an ITC cluster. On pre-test, four MDs favored micrometastasis and two MDs chose ITC. On post-test, two MDs chose micrometastasis and four chose ITC. For case 25 (Fig 4B), the reference MD measured the cluster at 0.205 mm, considered it indeterminate or borderline, and chose the lower category, ITC. On pre-test, four MDs classified it as micrometastasis and two chose ITC. On post-test, three MDs chose micrometastasis and three chose ITC.
Previous studies have demonstrated poor diagnostic reproducibility in the categoric pathologic classification of small-volume nodal metastases as outlined in the AJCC Sixth Edition.3,4 Diagnostic variation is apparent at major institutions and community hospital settings.13 We undertook the current study to determine whether a simple set of uniform histologic criteria presented as a training program could solve the problems with interobserver variation and provide data useful for the AJCC Seventh Edition revisions. We have demonstrated that pathologists provided with a well-illustrated training program can achieve excellent diagnostic agreement in a challenging set of 56 cases. The six MDs showed excellent agreement in the post-test, both in comparison with the reference MD and in multirater analysis, with significant improvement compared with their pre-test results (Wilcoxon signed rank P < .001). Two of the most problematic areas on pre-test, lobular carcinoma metastases and ITC in nodal parenchyma, also showed outstanding concordance on post-test (P < .001). The few persistent discordant classifications observed in the post-test occurred almost exclusively with tumor cell clusters of borderline size near the 0.20-mm threshold. Two cases interpreted by the reference MD as ITC had clusters that measured 0.20 mm (case 31) and 0.205 mm (case 25), respectively, and two cases interpreted as micrometastasis had clusters that measured 0.22 mm (case 8) and 0.23 mm (case 39), respectively. A useful guideline in the AJCC system and the current study training program to prevent overstaging of minimal nodal disease is that a pathologist faced with a borderline finding should select the lower category. Further refinements in these proposed criteria may help to achieve better reproducibility with borderline-size metastases. However, no training program would be expected to resolve minor variations in microscopic measurements,13 and it would not be expected that practicing pathologists be obligated to measure ITC as precisely as was done for this study. The concept of benign transport of epithelial cells to axillary lymph nodes, albeit of genuine concern,14,15 has contributed to diagnostic variation in the pathologic staging of breast cancer. The position taken with this study is that epithelial cells and minute clusters with atypia comparable to the patient's primary invasive carcinoma found in the SLN should be regarded as ITC [pN0(i+)], because histologic features of minimal nodal disease do not permit determination of metastatic potential.16,17 Cytokeratin-positive debris should be excluded and classified as pN0(i–). The current study has focused on interpretive issues for small-volume nodal metastases, but it should be noted that there are other causes of diagnostic variability for SLN biopsies, which include differences in intraoperative pathologic techniques, the number and depth of paraffin section levels examined, and the use of cytokeratin IHC or reverse transcriptase polymerase chain reaction methods.18-23 The widespread use of multiple level sections and IHC in current practice frequently detects small-volume metastases that are of uncertain clinical significance. This study utilized selected digital images to control for all variables except the interpretive component. Actual results in clinical practice with microscopic glass slides may not be quite as good, because the pathologist would first need to detect the small metastases.24
Axillary SLN status, for patients with early-stage invasive breast carcinoma, remains an important determinant of prognosis and a basis for treatment recommendations. At many institutions, patients with negative SLN results, including those with ITC [pN0(i+)], are spared further axillary node dissection or axillary radiation therapy, whereas node-positive patients, including those with micrometastasis (pN1mi), commonly receive further axillary treatment.25 Additionally, the choices for chemotherapy and hormonal therapy may be adjusted on the basis of nodal stage classification. The distinction between ITC and micrometastasis is important in its impact on the total number of positive lymph nodes with respect to categoric nodal classification. Lymph nodes containing ITC only are not included in the decision to classify a breast carcinoma as pN1 (one to three nodes positive), pN2 (four to nine nodes positive), or pN3 ( Arbitrarily set criteria, such as those used in the current study and the pTNM system,2,5 do not necessarily provide the best prognostic separation; but, without diagnostic reproducibility, meaningful prognostic separation is not possible. Although excellent agreement can be reached with clearly defined criteria, pathologists, including some in this study, are reluctant to classify a patient's SLN as negative [ie, pN0(i+)] when it contains numerous dispersed single tumor cells with a collective volume that would exceed a few dozen cohesive tumor cells classified as micrometastasis (pN1mi). This situation requires clarification in future staging manuals, while recognizing that reproducibility is a precondition for the clinical utility of a test result. As proven by our pre-test results and by prior studies3,4 reproducibility in breast cancer nodal stage classification cannot be achieved solely on the basis of the available AJCC/UICC text definitions. We have provided a method to improve on those definitions by including visual examples for a process that requires interpretation of visual input. Reproducibility of pathologic nodal stage classification is achievable through a set of simple and uniform diagnostic criteria presented in a visual training program and reported in this study. In an era with widespread access to electronic archives, illustrative examples would be a logical extension to the AJCC/UICC staging manual.
The author(s) indicated no potential conflicts of interest.
Conception and design: Roderick R. Turner, Susan C. Lester, David A. Elashoff, Stuart J. Schnitt Financial support: Armando E. Giuliano Administrative support: Karen Hirsch Provision of study materials or patients: Roderick R. Turner, Gabor Cserni Collection and assembly of data: Donald L. Weaver, Gabor Cserni, Susan C. Lester, Karen Hirsch, Patrick L. Fitzgibbons, Giuseppe Viale, Giovanni Mazzarol, Julio A. Ibarra Data analysis and interpretation: Roderick R. Turner, Gabor Cserni, Susan C. Lester, David A. Elashoff, Armando E. Giuliano Manuscript writing: Roderick R. Turner, Donald L. Weaver, Gabor Cserni, David A. Elashoff Final approval of manuscript: Roderick R. Turner, Donald L. Weaver, Gabor Cserni, Susan C. Lester, David A. Elashoff, Patrick L. Fitzgibbons, Giuseppe Viale, Giovanni Mazzarol, Julio A. Ibarra, Stuart J. Schnitt, Armando E. Giuliano
Supported by funding from the Ben B. and Joyce E. Eisenberg Foundation (Los Angeles, CA), the Fashion Footwear Association of New York Charitable Foundation (New York, NY), and the Associates for Breast and Prostate Cancer Studies (Santa Monica, CA). Supplemental data are available online at www.jco.org. Authors' disclosures of potential conflicts of interest and author contributions are found at the end of this article.
1. Giuliano AE, Dale PS, Turner RR, et al: Improved axillary staging of breast cancer with sentinel lymphadenectomy. Ann Surg 222:394-401, 1995[Medline] 2. Breast, in Greene FL, Page DL, Fleming ID (eds): AJCC Cancer Staging Manual (ed 6). New York, NY, Springer, 2002, pp 223-240 3. Roberts CA, Beitsch PD, Litz CE, et al: Interpretive disparity among pathologists in breast sentinel lymph node evaluation. Am J Surg 186:324-329, 2003[CrossRef][Medline] 4. Cserni G, Bianchi S, Boecker W, et al: Improving the reproducibility of diagnosing micrometastases and isolated tumor cells. Cancer 103:358-367, 2005[CrossRef][Medline] 5. Sobin LH, Wittekind C International Union Against Cancer TNM Classification of Malignant Tumours (ed 6). New York, New York, Wiley-Liss, 2002 6. Singletary SE, Greene FL, Sobin LH: Classification of isolated tumor cells: Clarification of the 6th edition of the American Joint Committee on Cancer Staging Manual. Cancer 98:2740-2741, 2003[CrossRef][Medline] 7. Schnitt SJ, Connolly JL, Tavassoli FA, et al: Interobserver reproducibility in the diagnosis of ductal proliferative breast lesions using standardized criteria. Am J Surg Pathol 16:1133-1143, 1992[Medline] 8. Longacre TA, Ennis M, Quenneville LA, et al: Interobserver agreement and reproducibility in classification of invasive breast carcinoma: An NCI breast cancer family registry study. Mod Pathol 19:195-207, 2006[CrossRef][Medline] 9. Dalton LW, Page DL, Dupont WD: Histologic grading of breast carcinoma: A reproducibility study. Cancer 73:2765-2770, 1994[CrossRef][Medline] 10. O'Malley FP, Mohsin SK, Badve S, et al: Interobserver reproducibility in the diagnosis of flat epithelial atypia of the breast. Mod Pathol 19:172-179, 2006[CrossRef][Medline] 11. Lester SC: Measuring with the microscope, in Lester SC (ed): Manual of Surgical Pathology (ed 2). China, Elsevier, 2006, pp 186-188 12. Siegel S, Castellan JN: Nonparametric Statistics for the Behavioral Sciences (ed 2). New York, NY, McGraw Hill, 1988 13. Cserni G, Sapino A, Decker T: Discriminating between micrometastases and isolated tumor cells in a regional and institutional setting. The Breast 15:347-354, 2006[CrossRef][Medline] 14. Bleiweiss IJ, Nagi CS, Jaffer S: Axillary sentinel lymph nodes can be falsely positive due to iatrogenic displacement and transport of benign epithelial cells in patients with breast carcinoma. J Clin Oncol 24:2013-2018, 2006 15. Carter BA, Jensen RA, Simpson JF, et al: Benign transport of breast epithelium in axillary lymph nodes after biopsy. Am J Clin Pathol 113:259-265, 2000 16. Schrenk P, Konstantiniuk P, Wolfl S, et al: Prediction of non-sentinel lymph node status in breast cancer with a micrometastatic sentinel node. Br J Surg 92:707-713, 2005[CrossRef][Medline] 17. Rampaul RS, Miremadi A, Pinder SE, et al: Pathological validation and significance of micrometastasis in sentinel nodes in primary breast cancer. Breast Cancer Res 3:113-116, 2001[CrossRef][Medline] 18. Bolster MJ, Bult P, Schapers RFM, et al: Differences in sentinel lymph node pathology protocols lead to differences in surgical strategy in breast cancer patients. Ann Surg Oncol 13:1466-1473, 2006 19. Turner RR, Ollila DW, Stern S, et al: Optimal histopathologic examination of the sentinel lymph node for breast carcinoma staging. Am J Surg Pathol 23:263-267, 1999[CrossRef][Medline] 20. Colleoni M, Rotmensz N, Peruzzotti G, et al: Size of breast cancer metastases in axillary lymph nodes: Clinical relevance of minimal lymph node involvement. J Clin Oncol 23:1379-1389, 2005 21. Cserni G, Gregori D, Merletti F: Meta-analysis of non-sentinel node metastases associated with micrometastatic sentinel nodes in breast cancer. Br J Surg 91:1245-1252, 2004[CrossRef][Medline] 22. Viale G, Mastropasqua MG, Maiorano E, et al: Pathologic examination of the axillary sentinel nodes in patients with early-stage breast carcinoma: Current and resolving controversies on the basis of the European Institute of Oncology experience. Virchows Arch 448:241-247, 2006[CrossRef][Medline] 23. Cserni G, Amendoeira I, Apostolikas N, et al: Discrepancies in current practice of pathological evaluation of sentinel lymph nodes in breast cancer: Results of a questionnaire based survey by the European Working Group for Breast Screening Pathology. J Clin Pathol 57:695-701, 2004 24. Weaver DL, Krag DN, Manna EA, et al: Detection of occult sentinel lymph node micrometastases by immunohistochemistry in breast cancer: An NSABP protocol B-32 quality assurance study. Cancer 107:661-667, 2006[CrossRef][Medline] 25. van Rijk MC, Peterse JL, Nieweg OE, et al: Additional axillary metastases and stage migration in breast cancer patients with micrometastases or submicrometastases in sentinel lymph nodes. Cancer 107:467-471, 2006[CrossRef][Medline] Submitted June 21, 2007; accepted October 11, 2007.
This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||
|
Copyright © 2008 by the American Society of Clinical Oncology, Online ISSN: 1527-7755. Print ISSN: 0732-183X
|