Skip Navigation
Skip to contents

JPMPH : Journal of Preventive Medicine and Public Health

OPEN ACCESS
SEARCH
Search

Articles

Page Path
HOME > J Prev Med Public Health > Volume 58(4); 2025 > Article
Original Article
Enhancing TNM Stage Completeness Using the SEER Summary Stage: A Nationwide Study From Korea
Chang Kyun Choi1orcid, Mina Suh2,3orcid, Kyu-Won Jung4,5orcid, E Hwa Yun4,5orcid
Journal of Preventive Medicine and Public Health 2025;58(4):415-421.
DOI: https://doi.org/10.3961/jpmph.25.099
Published online: April 23, 2025
  • 5,653 Views
  • 258 Download

1Division of Cancer Early Detection, National Cancer Control Institute, National Cancer Center, Goyang, Korea

2National Cancer Control Institute, National Cancer Center, Goyang, Korea

3Graduate School of Cancer Science and Policy, National Cancer Center, Goyang, Korea

4Korea Central Cancer Registry, National Cancer Center, Goyang, Korea

5Division of Cancer Registration and Surveillance, National Cancer Control Institute, National Cancer Center, Goyang, Korea

Corresponding author: E Hwa Yun, Korea Central Cancer Registry, National Cancer Center, 323 Ilsan-ro, Ilsandong-gu, Goyang 10408, Korea E-mail: ehwayun@ncc.re.kr
• Received: February 6, 2025   • Revised: March 20, 2025   • Accepted: March 27, 2025

Copyright © 2025 The Korean Society for Preventive Medicine

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

prev next
  • Objectives:
    Accurate and complete cancer staging is essential for effective prognosis and treatment planning. This study investigated the feasibility of enhancing the completeness of tumor‐node‐metastasis (TNM) staging by integrating Surveillance, Epidemiology, and End Results (SEER) Summary Stage data.
  • Methods:
    We analyzed data from 5 cancer types (stomach, colorectum, liver, lung, and breast) in Korea (2012-2017). The study assessed the impact of supplementing missing TNM information with SEER Summary Stage data on both staging completeness and 5‐year relative survival rates.
  • Results:
    The study included 173 061 stomach cancer, 159 199 colorectal cancer, 89 639 liver cancer, 137 103 lung cancer, and 110 286 breast cancer patients. The percentage of missing TNM stage data varied by cancer type, ranging from 65.1% (breast cancer) to 93.0% (liver cancer). Supplementation significantly reduced missing values—most notably in stomach cancer, where missing data dropped by 50.6 percentage points (%p), followed by liver (21.5%p) and breast cancers (13.6%p). For stomach cancer, supplementation led to a 3.6%p decrease in stage I survival rates, whereas liver cancer exhibited the most pronounced changes, with stage IV survival rates declining from 17.7% to 7.9%.
  • Conclusions:
    Integrating SEER Summary Stage data enhances TNM staging completeness. However, further evaluation incorporating treatment information is essential.
Cancer registries serve as indispensable resources for evaluating cancer burden, formulating control policies, and assessing the impact of healthcare advancements and policy interventions. Their utility hinges on the reliability and completeness of the data. In Korea, comprehensive nationwide cancer statistics have been reported since 1999 [1], achieving high levels of completeness in documenting cancer severity information, such as the Surveillance, Epidemiology, and End Results (SEER) Summary Stage.
The tumor-node-metastasis (TNM) staging system, the most extensively utilized clinical tool for cancer staging [2], provides detailed information on tumor size, lymph node involvement, and metastasis and is a critical prognostic indicator. Integrating TNM staging into cancer registries could substantially improve the assessment of cancer screening effectiveness and treatment outcomes, supplying essential data for policy evaluation in screening, treatment, and care. In contrast, the SEER Summary Stage facilitates comparisons of disease severity across different cancer types based on anatomical features [3], but lacks the detailed resolution offered by TNM staging. Although periodic revisions of the TNM system enhance its clinical utility by incorporating diagnostic and treatment advances, these updates also present challenges. Meanwhile, the SEER Summary Stage offers a more consistent framework yet does not provide the depth of insight achieved with TNM staging.
Multiple methods have been employed to enhance the completeness of cancer severity data. For global comparisons [4-6], TNM variables are often used to generate SEER Summary Stage data. However, the inverse process of imputing TNM variables based on SEER Summary Stage information has been underexplored, despite some investigations into imputation strategies using other clinical variables and data sources [7-9].
To bridge this gap, we evaluated the impact of augmenting TNM variables with SEER Summary Stage data from the Korean Cancer Registry. We focused on 5 major cancer types—stomach, colorectal, liver, lung, and breast—and analyzed how these supplementation strategies influenced 5‐year relative survival rates.
Data Source
Our study used data from the Korea National Cancer Incidence Database (KNCID) for 5 major cancer types (liver, colon, stomach, breast, and lung) diagnosed in Korea between 2012 and 2017 [1]. The KNCID, a population‐based cancer registry, collects data from the Korea Central Cancer Registry (KCCR) and 11 regional cancer registries. By incorporating cases not captured in hospital‐based registries, the KNCID achieved an impressive completeness rate of 98.2% in 2021 [1].
Initially, the database contained 173 084 stomach, 159 245 colorectal, 89 693 liver, 137 169 lung, and 110 292 female breast cancer cases. After excluding patients with uncertain diagnosis dates or in situ cancer, the final study population included 173 061 stomach, 159 199 colorectal, 89 639 liver, 137 103 lung, and 110 286 female breast cancer patients. Collection of SEER Summary Stage data is mandatory, whereas TNM information is gathered on an optional basis.
Tumor-node-metastasis Stage Classification
Using the seventh edition of the TNM manual [2], we converted the T, N, and M classifications into TNM stages (I, II, III, and IV) following site‐specific guidelines. For cases lacking TNM information, we directly assigned a predetermined stage using reverse logic that began with the most advanced tumor progression. Cases without classifiable TNM data were designated as “missing.”
Surveillance, Epidemiology, and End Results Summary Stage
The SEER Summary Stage categorizes cancer severity into several stages: in situ, localized, regional by direct extension only, regional with lymph node involvement only, regional with both direct extension and lymph node involvement, regional not otherwise specified (NOS), distant, and unknown.
Supplementation of Missing Tumor-node-metastasis Stage Information
We applied assumptions for missing T, N, and M values and devised two methods to determine the TNM stage. The first method defined the stage solely using available TNM data. The second method supplemented missing TNM components with SEER Summary Stage data, following approaches adopted by the International Cancer Benchmarking Partnership to improve stage data comparability and completeness across international registries [4-6]. Detailed assumptions for each cancer type and the corresponding supplementation strategies are provided in Supplemental Material 1.
Statistical Analysis
We calculated the case counts and proportions with available TNM stage data for the approaches. For each cancer site, we estimated 5‐year relative survival rates for all cases diagnosed during the study period up to 2022, including those with missing stages. Relative survival rates and their 95% confidence intervals (CIs) were computed using the Ederer II method via the ‘relsurv’ package in R [10]. The survival calculations incorporated age (in single-year increments), sex, and year of cancer registration as covariates. To evaluate the effect of supplementation on survival estimates, we compared 5‐year relative survival rates before and after supplementation for each TNM stage and cancer type. Statistical analyses were performed using SAS version 9.3 (SAS Institute Inc., Cary, NC, USA) and R version 4.4.1 (R Foundation for Statistical Computing, Vienna, Austria).
Ethics Statement
This study was reviewed and approved by the Institutional Review Board (IRB) of the National Cancer Center (IRB No. NCC 2024-0142). The data were deidentified, and the consent requirements were waived by the IRB.
Table 1 presents the distribution of TNM staging variables and SEER Summary Stage data across the 5 cancer types. Missing data varied considerably among cancer types, with liver cancer exhibiting the highest rates of missing TNM data (90.7% for T stage, 93.8% for N stage, and 93.5% for M stage). In contrast, breast cancer had the lowest missing rates (55.3% for T stage, 57.0% for N stage, and 65.5% for M stage). SEER Summary Stage data were substantially more complete, with unknown stages ranging from 3.0% in breast cancer to 14.1% in liver cancer—significantly lower than the missing rates for TNM variables.
Table 2 shows the effects of supplementing missing TNM staging variables using SEER Summary Stage data. The improvement in TNM completeness varied considerably by cancer type. Stomach cancer exhibited the greatest reduction in missing TNM data, decreasing from 73.8% to 23.2% after supplementation, with most previously missing cases reassigned to stage I (a 47.0 percentage point [%p] increase). Liver cancer also showed notable improvement, with missing proportions declining from 93.0% to 71.5%, primarily due to reassignment to stage IV (an 18.3%p increase). Breast cancer experienced a moderate reduction, with missing proportions decreasing from 65.1% to 51.5%. In contrast, lung and colorectal cancers showed relatively modest improvements, with reductions of 5.1%p and 4.5%p, respectively.
Table 3 compares the 5‐year relative survival rates across cancer types before and after supplementation of missing TNM stage variables using SEER Summary Stage data. Despite the substantial reassignment of missing TNM cases in stomach cancer, the changes in survival rates were modest, with the largest being a 3.6%p decrease for stage I (from 106.1 to 102.5%). Liver cancer demonstrated significant changes, particularly for stage IV, where survival rates dropped markedly from 17.7% to 7.9%, and for stage III, which increased by approximately 4%p. Additionally, survival rates for cases with missing TNM stages in liver cancer increased by 6.6%p, from 38.5% to 45.1%. Breast cancer exhibited a notable decrease in stage IV survival rates (from 47.8 to 43.5%) and a slight increase for missing stages (from 94.2 to 96.9%). Lung and colorectal cancers, which underwent minimal TNM reassignment, exhibited minor survival rate changes of less than 1%p across all stages.
This study represents the first nationwide analysis of TNM stage-specific survival rates using a Korean Cancer Registry. Our findings underscore that supplementing TNM data with SEER Summary Stage information improves data completeness while introducing only minimal changes to survival estimates.
These results align with previous studies suggesting that supplemented stage data can approximate the survival trends observed in directly staged cohorts [7-9]. However, it is important to acknowledge that those studies focused predominantly on prostate cancer, thereby limiting their generalizability. This narrow focus underscores the need for cancer-specific supplementation strategies, as the impact of supplementation varies significantly among cancer types. Expanding the range of variables collected by cancer registries to improve supplementation accuracy poses a considerable challenge; the additional burden on registry resources and personnel must be weighed against the benefits of obtaining more comprehensive data. Therefore, it is crucial to develop targeted approaches for acquiring missing information [11-13], that prioritize the most critical variables for each cancer type while minimizing the burden on registry systems.
Our evaluation of TNM supplementation using SEER Summary Stage data is situated within broader efforts to enhance cancer registry data quality. A recent study by Engholm et al. [14] examined how various assumptions for individual TNM components affected overall TNM staging using Nordic cancer registry data. Although our focus was on supplementing missing TNM information to improve data completeness, Engholm et al. [14] investigated the impact of different hierarchical rules and assumptions on TNM stage distribution. Their findings, which generally showed minor differences in stage distribution, are consistent with our observations that supplementation resulted in relatively modest changes in stage-specific survival estimates for several cancer types. Our approach—leveraging SEER Summary Stage data to supplement missing TNM information—offers a pragmatic solution when direct TNM data are unavailable, a common challenge in many registries.
Despite the improvements from supplementation, some survival estimates remain unstable or potentially biased. For instance, relative survival rates for stage I stomach, colorectal, and breast cancers exceeding 100% may reflect limitations of the Ederer II method used for estimation. A previous study in Sweden indicated that achieving stable relative survival estimates requires follow-up periods of at least 7 years [15]. Additionally, differences in patient characteristics and socioeconomic status, which can influence early cancer detection, might also affect outcomes [16-18]. Further research is necessary to determine the significance of these factors.
Liver cancer emerged as a notable exception, with supplementation-induced changes in 5‐year survival rates being considerably larger—particularly for stage IV, where there was a 9.8%p difference. This discrepancy is accentuated by the fact that only 7% of liver cancer cases had sufficient TNM data for staging before supplementation, compared with over 25% for other cancers. The extreme scarcity of TNM data in liver cancer suggests that relative survival rates derived solely from available data may be inaccurate. The low collection rate of TNM variables for liver cancer appears to be linked to its treatment landscape. Other registries have similarly reported high rates of missing data for liver cancer [19,20]. Unlike many cancers where surgical resection is common, liver cancer management often involves non-surgical treatments such as radiofrequency ablation or transarterial chemoembolization [21]. This reduced reliance on surgical procedures leads to fewer pathological reports, which are critical for staging in cancer registries. Moreover, the complex relationship between liver cancer and underlying conditions such as cirrhosis further complicates staging and treatment decisions [21-23].
Our study had several limitations that merit attention. First, our supplementation process did not distinguish between clinical and pathological TNM staging. Although this limitation may not have significantly influenced our results—a sensitivity analysis by Engholm et al. [14] demonstrated minimal differences when comparing the prioritization of pTNM and the inclusion of cTNM—the high completeness in reporting histologic types by the KCCR [1], suggests that pathology report information substantially contributed to determining both TNM and SEER Summary Stages in our study. Second, our analysis did not account for variations across histological types. This limitation arises from our primary focus on assessing the feasibility of supplementing TNM information, rather than exploring differences among histological variants. As such, our goal was to demonstrate the general potential of this approach, not to develop cancer-specific strategies. Future research should aim to develop and validate supplementation strategies tailored to specific cancer types. We anticipate that subsequent studies employing comprehensive staging surveys [24-26], which will include TNM information, SEER Summary Stage, and treatment data, will facilitate a more detailed evaluation of supplementation strategies for individual cancers. Third, we were unable to statistically evaluate the equivalence of relative survival rates before and after supplementation due to the lack of established statistical methods and benchmarks in the literature. Instead, consistent with previous studies [4-6,14], we assessed only the magnitude of the observed changes in survival rates. Fourth, our study has an additional limitation regarding the reliability and consistency of directly collected TNM stages compared to those derived via supplementation. Specifically, TNM stage data lacking detailed T, N, or M components were used only when supplementation through SEER Summary Stage or other sources was not feasible. However, we prioritized SEER Summary Stage data, as it is mandatorily collected by KCCR, assuming it to be more reliable than directly collected TNM stages. Consequently, further validation studies are needed to systematically evaluate the reliability of TNM stages generated through reverse logic based on TNM variables relative to those collected directly.
In conclusion, enhancing TNM stage completeness using the SEER Summary Stage appears feasible. Improved completeness of TNM staging and the resulting clinically relevant statistics may contribute to more effective treatment planning and informed policy formulation. Future research should consider incorporating additional data sources to evaluate broader impacts on cancer outcomes.
Supplemental material is available at https://doi.org/10.3961/jpmph.25.099.

Conflict of Interest

The authors have no conflicts of interest associated with the material presented in this paper.

Funding

This study was supported by the Intramural Research Program of the National Cancer Center Korea (NCC-2332570), and other research grants from the National Cancer Center Korea (NCC-2211820 and NCC-2410700).

Acknowledgements

None.

Author Contributions

Conceptualization: Choi CK, Yun EH. Data curation: Jung KW, Yun EH. Formal analysis: Choi CK, Yun EH. Funding acquisition: Choi CK, Yun EH. Writing – original draft: Choi CK, Yun EH. Writing – review & editing: Choi CK, Suh M, Jung KW, Yun EH.

Table 1.
Distribution of TNM and SEER variables across five cancer types in Korea (2012-2017)
Variables Stomach cancer Colorectal cancer Liver cancer Lung cancer Breast cancer
Total (n) 173 061 159 199 89 639 137 103 110 286
Death 59 928 (34.6) 58 623 (36.8) 63 661 (71.0) 102 621 (74.8) 12 081 (11.0)
Duration off follow-up (y) 5.3±3.0 5.3±3.0 3.1±3.0 2.8±2.9 6.5±2.1
Age (y) 63.9±12.6 64.9±12.8 63.3±12.4 68.7±11.1 52.3±11.5
Sex
 Male 116 525 (67.3) 95 182 (59.8) 67 252 (75.0) 95 872 (69.9) 0 (0)
 Female 56 536 (32.7) 64 017 (40.2) 22 387 (25.0) 41 231 (30.1) 110 286 (100)
T stage
 0 261 (0.2) 745 (0.5) 243 (0.3) 604 (0.4) 236 (0.2)
 1 42 826 (24.7) 9705 (6.1) 2679 (3.0) 13 956 (10.2) 27 205 (24.7)
 2 6310 (3.6) 7687 (4.8) 3250 (3.6) 17 164 (12.5) 18 162 (16.5)
 3 8358 (4.8) 33 923 (21.3) 1564 (1.7) 8431 (6.1) 2667 (2.4)
 4 10 453 (6.0) 10 523 (6.6) 596 (0.7) 8935 (6.5) 991 (0.9)
 Any 73 (0) 108 (0.1) 47 (0.1) 251 (0.2) 20 (0)
 Not available 104 780 (60.5) 96 508 (60.6) 81 260 (90.7) 87 762 (64.0) 61 005 (55.3)
N stage
 0 39 144 (22.6) 29 149 (18.3) 4798 (5.4) 20 208 (14.7) 29 288 (26.6)
 1 6280 (3.6) 16 682 (10.5) 699 (0.8) 5257 (3.8) 11 476 (10.4)
 2 5566 (3.2) 12 021 (7.6) 35 (0) 10 439 (7.6) 4120 (3.7)
 3 8062 (4.7) 144 (0.1) 17 (0) 11 778 (8.6) 2554 (2.3)
 Any 78 (0) 109 (0.1) 49 (0.1) 270 (0.2) 17 (0)
 Not available 113 931 (65.8) 101 094 (63.5) 84 041 (93.8) 89 151 (65.0) 62 831 (57.0)
M stage
 0 39 968 (23.1) 34 164 (21.5) 4567 (5.1) 24 357 (17.8) 36 814 (33.4)
 1 4969 (2.9) 7841 (4.9) 1216 (1.4) 18 932 (13.8) 1232 (1.1)
 Not available 128 124 (74.0) 117 194 (73.6) 83 856 (93.5) 93 814 (68.4) 72 240 (65.5)
SEER Summary Stage
 Localized 108 698 (62.8) 58 866 (37.0) 41 221 (46.0) 30 796 (22.5) 64 403 (58.4)
 Regional by direct extension only 5931 (3.4) 24 840 (15.6) 17 286 (19.3) 10 517 (7.7) 2018 (1.8)
 Regional lymph nodes involved only 15 353 (8.9) 11 042 (6.9) 1697 (1.9) 13 424 (9.8) 30 829 (28.0)
 Regional by both direct extension and lymph node involvement 14 948 (8.6) 29 703 (18.7) 2658 (3.0) 11 790 (8.6) 4124 (3.7)
 Regional, NOS 129 (0.1) 112 (0.1) 83 (0.1) 115 (0.1) 65 (0.1)
 Distant 19 059 (11.0) 25 675 (16.1) 14 084 (15.7) 59 389 (43.3) 5561 (5.0)
 Unknown 8943 (5.2) 8961 (5.6) 12 610 (14.1) 11 072 (8.1) 3286 (3.0)

Values are presented as mean±standard deviation or number (%).

TNM, tumor-node-metastasis; SEER, Surveillance, Epidemiology, and End Results; T, tumor size; N, nodal spread; M, distant metastasis; NOS, not otherwise specified.

Table 2.
Impact of supplementing the missing TNM stage using the SEER Summary Stage across different cancer sites
Cancer sites TNM stage TNM TNM+SEER Δ%p
Stomach cancer 0 17 (0) 37 (0) 0
I 28 650 (16.6) 110 025 (63.6) 47.0
II 5134 (3.0) 7827 (4.5) 1.5
III 6329 (3.7) 9790 (5.7) 2.0
IV 5199 (3.0) 5194 (3.0) 0
Missing 127 732 (73.8) 40 188 (23.2) -50.6
Colorectal cancer 0 10 (0) 0 (0) 0
I 9536 (6.0) 14 866 (9.3) 3.3
II 10 781 (6.8) 12 563 (7.9) 1.1
III 14 361 (9.0) 14 440 (9.1) 0.1
IV 8027 (5.0) 8027 (5.0) 0
Missing 116 484 (73.2) 109 303 (68.7) -4.5
Liver cancer 0 9 (0) 11 (0) 0
I 1555 (1.7) 2617 (2.9) 1.2
II 1842 (2.1) 2929 (3.3) 1.2
III 896 (1.0) 1588 (1.8) 0.8
IV 1966 (2.2) 18 417 (20.5) 18.3
Missing 83 371 (93.0) 64 077 (71.5) -21.5
Lung cancer 0 22 (0) 28 (0) 0
I 10 322 (7.5) 14 233 (10.4) 2.9
II 3826 (2.8) 5021 (3.7) 0.9
III 8953 (6.5) 10 931 (8.0) 1.5
IV 19 473 (14.2) 19 472 (14.2) 0
Missing 94 507 (68.9) 87 418 (63.8) -5.1
Breast cancer 0 68 (0.1) 94 (0.1) 0
I 16 455 (14.9) 21 554 (19.5) 4.6
II 14 882 (13.5) 19 373 (17.6) 4.1
III 5795 (5.3) 7169 (6.5) 1.2
IV 1328 (1.2) 5267 (4.8) 3.6
Missing 71 758 (65.1) 56 829 (51.5) -13.6

Values are presented as number (%).

TNM, tumor-node-metastasis; SEER, Surveillance, Epidemiology, and End Results; %p, percentage point.

Table 3.
Comparison of 5-year relative survival rates before and after TNM supplementation using the SEER Summary Stage
Cancer sites TNM stage TNM TNM+SEER
Stomach cancer I 106.1 (105.7, 106.5) 102.5 (102.2, 102.7)
II 86.8 (85.4, 88.4) 87.3 (86.1, 88.5)
III 50.8 (49.4, 52.3) 50.3 (49.1, 51.5)
IV 15.2 (14.1, 16.3) 15.2 (14.1, 16.3)
Missing 79.9 (79.6, 80.3) 38.9 (38.4, 39.5)
Colorectal cancer I 106.7 (105.9, 107.5) 106.6 (105.9, 107.2)
II 100.1 (99.1, 101.1) 100 (99.0, 100.9)
III 84.0 (83.1, 85.0) 84.0 (83.0, 84.9)
IV 28.2 (27.2, 29.3) 28.2 (27.2, 29.3)
Missing 93.9 (93.4, 94.4) 93.2 (92.7, 93.7)
Liver cancer I 86.9 (84.5, 89.4) 88.6 (86.7, 90.5)
II 77.9 (75.5, 80.3) 76.9 (74.9, 78.9)
III 39.7 (36.3, 43.5) 43.8 (41.1, 46.7)
IV 17.7 (16.0, 19.6) 7.9 (7.5, 8.3)
Missing 38.5 (38.2, 38.9) 45.1 (44.6, 45.5)
Lung cancer I 91.2 (90.2, 92.2) 92.0 (91.2, 92.9)
II 61.8 (59.9, 63.9) 61.3 (59.6, 63.1)
III 33.4 (32.3, 34.6) 32.7 (31.7, 33.7)
IV 15.0 (14.5, 15.6) 15.0 (14.5, 15.6)
Missing 32.2 (31.8, 32.5) 29.1 (28.7, 29.4)
Breast cancer I 101.4 (101.2, 101.7) 101.5 (101.3, 101.7)
II 98.6 (98.3, 99.0) 98.6 (98.2, 98.9)
III 87.4 (86.4, 88.4) 87.9 (87.0, 88.8)
IV 47.8 (45.1, 50.7) 43.5 (42.1, 44.9)
Missing 94.2 (94.0, 94.4) 96.9 (96.7, 97.1)

Values are presented as % (95% confidence interval).

TNM, tumor-node-metastasis; SEER, Surveillance, Epidemiology, and End Results.

Figure & Data

References

    Citations

    Citations to this article as recorded by  

      Enhancing TNM Stage Completeness Using the SEER Summary Stage: A Nationwide Study From Korea
      Enhancing TNM Stage Completeness Using the SEER Summary Stage: A Nationwide Study From Korea
      Variables Stomach cancer Colorectal cancer Liver cancer Lung cancer Breast cancer
      Total (n) 173 061 159 199 89 639 137 103 110 286
      Death 59 928 (34.6) 58 623 (36.8) 63 661 (71.0) 102 621 (74.8) 12 081 (11.0)
      Duration off follow-up (y) 5.3±3.0 5.3±3.0 3.1±3.0 2.8±2.9 6.5±2.1
      Age (y) 63.9±12.6 64.9±12.8 63.3±12.4 68.7±11.1 52.3±11.5
      Sex
       Male 116 525 (67.3) 95 182 (59.8) 67 252 (75.0) 95 872 (69.9) 0 (0)
       Female 56 536 (32.7) 64 017 (40.2) 22 387 (25.0) 41 231 (30.1) 110 286 (100)
      T stage
       0 261 (0.2) 745 (0.5) 243 (0.3) 604 (0.4) 236 (0.2)
       1 42 826 (24.7) 9705 (6.1) 2679 (3.0) 13 956 (10.2) 27 205 (24.7)
       2 6310 (3.6) 7687 (4.8) 3250 (3.6) 17 164 (12.5) 18 162 (16.5)
       3 8358 (4.8) 33 923 (21.3) 1564 (1.7) 8431 (6.1) 2667 (2.4)
       4 10 453 (6.0) 10 523 (6.6) 596 (0.7) 8935 (6.5) 991 (0.9)
       Any 73 (0) 108 (0.1) 47 (0.1) 251 (0.2) 20 (0)
       Not available 104 780 (60.5) 96 508 (60.6) 81 260 (90.7) 87 762 (64.0) 61 005 (55.3)
      N stage
       0 39 144 (22.6) 29 149 (18.3) 4798 (5.4) 20 208 (14.7) 29 288 (26.6)
       1 6280 (3.6) 16 682 (10.5) 699 (0.8) 5257 (3.8) 11 476 (10.4)
       2 5566 (3.2) 12 021 (7.6) 35 (0) 10 439 (7.6) 4120 (3.7)
       3 8062 (4.7) 144 (0.1) 17 (0) 11 778 (8.6) 2554 (2.3)
       Any 78 (0) 109 (0.1) 49 (0.1) 270 (0.2) 17 (0)
       Not available 113 931 (65.8) 101 094 (63.5) 84 041 (93.8) 89 151 (65.0) 62 831 (57.0)
      M stage
       0 39 968 (23.1) 34 164 (21.5) 4567 (5.1) 24 357 (17.8) 36 814 (33.4)
       1 4969 (2.9) 7841 (4.9) 1216 (1.4) 18 932 (13.8) 1232 (1.1)
       Not available 128 124 (74.0) 117 194 (73.6) 83 856 (93.5) 93 814 (68.4) 72 240 (65.5)
      SEER Summary Stage
       Localized 108 698 (62.8) 58 866 (37.0) 41 221 (46.0) 30 796 (22.5) 64 403 (58.4)
       Regional by direct extension only 5931 (3.4) 24 840 (15.6) 17 286 (19.3) 10 517 (7.7) 2018 (1.8)
       Regional lymph nodes involved only 15 353 (8.9) 11 042 (6.9) 1697 (1.9) 13 424 (9.8) 30 829 (28.0)
       Regional by both direct extension and lymph node involvement 14 948 (8.6) 29 703 (18.7) 2658 (3.0) 11 790 (8.6) 4124 (3.7)
       Regional, NOS 129 (0.1) 112 (0.1) 83 (0.1) 115 (0.1) 65 (0.1)
       Distant 19 059 (11.0) 25 675 (16.1) 14 084 (15.7) 59 389 (43.3) 5561 (5.0)
       Unknown 8943 (5.2) 8961 (5.6) 12 610 (14.1) 11 072 (8.1) 3286 (3.0)
      Cancer sites TNM stage TNM TNM+SEER Δ%p
      Stomach cancer 0 17 (0) 37 (0) 0
      I 28 650 (16.6) 110 025 (63.6) 47.0
      II 5134 (3.0) 7827 (4.5) 1.5
      III 6329 (3.7) 9790 (5.7) 2.0
      IV 5199 (3.0) 5194 (3.0) 0
      Missing 127 732 (73.8) 40 188 (23.2) -50.6
      Colorectal cancer 0 10 (0) 0 (0) 0
      I 9536 (6.0) 14 866 (9.3) 3.3
      II 10 781 (6.8) 12 563 (7.9) 1.1
      III 14 361 (9.0) 14 440 (9.1) 0.1
      IV 8027 (5.0) 8027 (5.0) 0
      Missing 116 484 (73.2) 109 303 (68.7) -4.5
      Liver cancer 0 9 (0) 11 (0) 0
      I 1555 (1.7) 2617 (2.9) 1.2
      II 1842 (2.1) 2929 (3.3) 1.2
      III 896 (1.0) 1588 (1.8) 0.8
      IV 1966 (2.2) 18 417 (20.5) 18.3
      Missing 83 371 (93.0) 64 077 (71.5) -21.5
      Lung cancer 0 22 (0) 28 (0) 0
      I 10 322 (7.5) 14 233 (10.4) 2.9
      II 3826 (2.8) 5021 (3.7) 0.9
      III 8953 (6.5) 10 931 (8.0) 1.5
      IV 19 473 (14.2) 19 472 (14.2) 0
      Missing 94 507 (68.9) 87 418 (63.8) -5.1
      Breast cancer 0 68 (0.1) 94 (0.1) 0
      I 16 455 (14.9) 21 554 (19.5) 4.6
      II 14 882 (13.5) 19 373 (17.6) 4.1
      III 5795 (5.3) 7169 (6.5) 1.2
      IV 1328 (1.2) 5267 (4.8) 3.6
      Missing 71 758 (65.1) 56 829 (51.5) -13.6
      Cancer sites TNM stage TNM TNM+SEER
      Stomach cancer I 106.1 (105.7, 106.5) 102.5 (102.2, 102.7)
      II 86.8 (85.4, 88.4) 87.3 (86.1, 88.5)
      III 50.8 (49.4, 52.3) 50.3 (49.1, 51.5)
      IV 15.2 (14.1, 16.3) 15.2 (14.1, 16.3)
      Missing 79.9 (79.6, 80.3) 38.9 (38.4, 39.5)
      Colorectal cancer I 106.7 (105.9, 107.5) 106.6 (105.9, 107.2)
      II 100.1 (99.1, 101.1) 100 (99.0, 100.9)
      III 84.0 (83.1, 85.0) 84.0 (83.0, 84.9)
      IV 28.2 (27.2, 29.3) 28.2 (27.2, 29.3)
      Missing 93.9 (93.4, 94.4) 93.2 (92.7, 93.7)
      Liver cancer I 86.9 (84.5, 89.4) 88.6 (86.7, 90.5)
      II 77.9 (75.5, 80.3) 76.9 (74.9, 78.9)
      III 39.7 (36.3, 43.5) 43.8 (41.1, 46.7)
      IV 17.7 (16.0, 19.6) 7.9 (7.5, 8.3)
      Missing 38.5 (38.2, 38.9) 45.1 (44.6, 45.5)
      Lung cancer I 91.2 (90.2, 92.2) 92.0 (91.2, 92.9)
      II 61.8 (59.9, 63.9) 61.3 (59.6, 63.1)
      III 33.4 (32.3, 34.6) 32.7 (31.7, 33.7)
      IV 15.0 (14.5, 15.6) 15.0 (14.5, 15.6)
      Missing 32.2 (31.8, 32.5) 29.1 (28.7, 29.4)
      Breast cancer I 101.4 (101.2, 101.7) 101.5 (101.3, 101.7)
      II 98.6 (98.3, 99.0) 98.6 (98.2, 98.9)
      III 87.4 (86.4, 88.4) 87.9 (87.0, 88.8)
      IV 47.8 (45.1, 50.7) 43.5 (42.1, 44.9)
      Missing 94.2 (94.0, 94.4) 96.9 (96.7, 97.1)
      Table 1. Distribution of TNM and SEER variables across five cancer types in Korea (2012-2017)

      Values are presented as mean±standard deviation or number (%).

      TNM, tumor-node-metastasis; SEER, Surveillance, Epidemiology, and End Results; T, tumor size; N, nodal spread; M, distant metastasis; NOS, not otherwise specified.

      Table 2. Impact of supplementing the missing TNM stage using the SEER Summary Stage across different cancer sites

      Values are presented as number (%).

      TNM, tumor-node-metastasis; SEER, Surveillance, Epidemiology, and End Results; %p, percentage point.

      Table 3. Comparison of 5-year relative survival rates before and after TNM supplementation using the SEER Summary Stage

      Values are presented as % (95% confidence interval).

      TNM, tumor-node-metastasis; SEER, Surveillance, Epidemiology, and End Results.


      JPMPH : Journal of Preventive Medicine and Public Health
      TOP