Bray, F. et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 74, 229–263 (2024).
Google Scholar
Han, B. et al. Cancer incidence and mortality in China, 2022. J. Natl Cancer Cent. 4, 47–53 (2024).
Google Scholar
Liu, C. et al. Population-level economic burden of lung cancer in China: provisional prevalence-based estimations, 2017-2030. Chin. J. Cancer Res. 33, 79–92 (2021).
Google Scholar
Zeng, H. et al. Disparities in stage at diagnosis for five common cancers in China: a multicentre, hospital-based, observational study. Lancet Public Health 6, e877–e887 (2021).
Google Scholar
Slatore, C. G. & Wiener, R. S. Pulmonary nodules: a small problem for many, severe distress for some, and how to communicate about it. Chest 153, 1004–1015 (2018).
Google Scholar
Gould, M. et al. Recent trends in the identification of incidental pulmonary nodules. Am. J. Respir. Crit. Care Med. 192, 1208–1214 (2015).
Google Scholar
Li, N. et al. One-off low-dose CT for lung cancer screening in China: a multicentre, population-based, prospective cohort study. Lancet Respir. Med. 10, 378–391 (2022).
Google Scholar
Edelman Saul, E. et al. The challenges of implementing low-dose computed tomography for lung cancer screening in low- and middle-income countries. Nat. Cancer 1, 1140–1152 (2020).
Google Scholar
Alexander, R. et al. Mandating limits on workload, duty, and speed in radiology. Radiology 304, 274–282 (2022).
Google Scholar
Mazzone, P. J. & Lam, L. Evaluating the patient with a pulmonary nodule: a review. JAMA 327, 264–273 (2022).
Google Scholar
van Riel, S. J. et al. Observer variability for classification of pulmonary nodules on low-dose CT images and its effect on nodule management. Radiology 277, 863–871 (2015).
Google Scholar
Nair, A. et al. Variable radiological lung nodule evaluation leads to divergent management recommendations. Eur. Respir. J. 52, 1801359 (2018).
Yuan, J., Xu, F., Ren, H., Chen, M. & Feng, S. Distress and its influencing factors among Chinese patients with incidental pulmonary nodules: a cross-sectional study. Sci. Rep. 14, 1189 (2024).
Google Scholar
Cui, X. et al. Comparison of Veterans Affairs, Mayo, Brock classification models and radiologist diagnosis for classifying the malignancy of pulmonary nodules in Chinese clinical population. Transl. Lung Cancer Res. 8, 605–613 (2019).
Google Scholar
Vachani, A. et al. The probability of lung cancer in patients with incidentally detected pulmonary nodules: clinical characteristics and accuracy of prediction models. Chest 161, 562–571 (2022).
Google Scholar
Lv, W. et al. Development and validation of a clinically applicable deep learning strategy (HONORS) for pulmonary nodule classification at CT: a retrospective multicentre study. Lung Cancer 155, 78–86 (2021).
Google Scholar
Venkadesh, K. V. et al. Deep learning for malignancy risk estimation of pulmonary nodules detected at low-dose screening CT. Radiology 300, 438–447 (2021).
Google Scholar
Massion, P. P. et al. Assessing the accuracy of a deep learning method to risk stratify indeterminate pulmonary nodules. Am. J. Respir. Crit. Care Med. 202, 241–249 (2020).
Google Scholar
Zhang, R. et al. Deep learning for malignancy risk estimation of incidental sub-centimeter pulmonary nodules on CT images. Eur. Radiol. 34, 4218–4229 (2024).
Google Scholar
Schreuder, A., Scholten, E. T., van Ginneken, B. & Jacobs, C. Artificial intelligence for detection and characterization of pulmonary nodules in lung cancer CT screening: ready for practice? Transl. Lung Cancer Res. 10, 2378–2388 (2021).
Google Scholar
Ardila, D. et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 25, 954–961 (2019).
Google Scholar
Wang, C. et al. Data-driven risk stratification and precision management of pulmonary nodules detected on chest computed tomography. Nat. Med. 30, 3184–3195 (2024).
Li, D. et al. The performance of deep learning algorithms on automatic pulmonary nodule detection and classification tested on different datasets that are not derived from LIDC-IDRI: a systematic review. Diagnostics 9, 207 (2019).
Chang, T.G., Park, S., Schaffer, A.A., Jiang, P. & Ruppin, E. Hallmarks of artificial intelligence contributions to precision oncology. Nat. Cancer 6, 417–431 (2025).
Dosovitskiy, A. et al. An image is worth 16×16 words: transformers for image recognition at scale. In International Conference on Learning Representations (ICLR 2021) https://openreview.net/pdf?id=YicbFdNTTy (ICLR, 2021).
Zhao, G., Feng, Q., Chen, C., Zhou, Z. & Yu, Y. Diagnose like a radiologist: hybrid neuro-probabilistic reasoning for attribute-based medical image diagnosis. IEEE Trans. Pattern Anal. Mach. Intell. 44, 7400–7416 (2022).
Google Scholar
Obuchowski, N. A. & Bullen, J. Multireader diagnostic accuracy imaging studies: fundamentals of design and analysis. Radiology 303, 26–34 (2022).
Google Scholar
Brady, A. P. et al. Developing, purchasing, implementing and monitoring AI tools in radiology: practical considerations. A multi-society statement from the ACR, CAR, ESR, RANZCR and RSNA. Radiol. Artif. Intell. 6, e230513 (2024).
Google Scholar
Seah, J. C. Y. et al. Effect of a comprehensive deep-learning model on the accuracy of chest x-ray interpretation by radiologists: a retrospective, multireader multicase study. Lancet Digit. Health 3, e496–e506 (2021).
Google Scholar
Kim, R. Y. et al. Artificial intelligence tool for assessment of indeterminate pulmonary nodules detected with CT. Radiology 304, 683–691 (2022).
Google Scholar
Lee, J. H., Hong, H., Nam, G., Hwang, E. J. & Park, C. M. Effect of human-AI interaction on detection of malignant lung nodules on chest radiographs. Radiology 307, e222976 (2023).
Google Scholar
Yu, F. et al. Heterogeneity and predictors of the effects of AI assistance on radiologists. Nat. Med. 30, 837–849 (2024).
Google Scholar
Gaube, S. et al. Do as AI say: susceptibility in deployment of clinical decision-aids. NPJ Digit. Med. 4, 31 (2021).
Google Scholar
Choe, J., Lee, S. & Shim, H. Attention-based dropout layer for weakly supervised single object localization and semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 43, 4256–4271 (2021).
Google Scholar
Rao, Y., Chen, G., Lu, J. & Zhou, J. Counterfactual attention learning for fine-grained visual categorization and re-identification. In IEEE International Conference on Computer Vision (ICCV) 1005–1014 (IEEE, 2021).
Tammemagi, M. C. et al. Selection criteria for lung-cancer screening. N. Engl. J. Med. 368, 728–736 (2013).
Google Scholar
Swensen, S., Silverstein, M., Ilstrup, D., Schleck, C. & Edell, E. The probability of malignancy in solitary pulmonary nodules. Application to small radiologically indeterminate nodules. Arch. Intern. Med. 157, 849–155 (1997).
Google Scholar
Selvaraju, R. R. et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 128, 336–359 (2020).
Google Scholar
Wang, T. W. et al. Standalone deep learning versus experts for diagnosis lung cancer on chest computed tomography: a systematic review. Eur. Radiol. 34, 7397–7407 (2024).
Pan, Z. et al. Predicting invasiveness of lung adenocarcinoma at chest CT with deep learning ternary classification models. Radiology 311, e232057 (2024).
Google Scholar
Wulaningsih, W. et al. Deep learning models for predicting malignancy risk in CT-detected pulmonary nodules: a systematic review and meta-analysis. Lung 202, 625–636 (2024).
Ridge, C. A. et al. Differentiating between subsolid and solid pulmonary nodules at CT: inter- and intraobserver agreement between experienced thoracic radiologists. Radiology 278, 888–896 (2016).
Google Scholar
Wiener, R. S. et al. Resource use and guideline concordance in evaluation of pulmonary nodules for cancer: too much and too little care. JAMA Intern. Med. 174, 871–880 (2014).
Google Scholar
Rosenkrantz, A. B., Xue, X., Gyftopoulos, S., Kim, D. C. & Nicola, G. N. Downstream costs associated with incidental pulmonary nodules detected on CT. Acad. Radiol. 26, 798–802 (2019).
Google Scholar
Gierada, D. S., Rydzak, C. E., Zei, M. & Rhea, L. Improved interobserver agreement on Lung-RADS classification of solid nodules using semiautomated CT volumetry. Radiology 297, 675–684 (2020).
Google Scholar
Shu, J. et al. Improved interobserver agreement on nodule type and Lung-RADS classification of subsolid nodules using computer-aided solid component measurement. Eur. J. Radiol. 152, 110339 (2022).
Google Scholar
Office of the Leading Group for the Seventh National Population Census of the State Council. Major Figures on 2020 Population Census of China (China Statistics Press, 2021); https://www.stats.gov.cn/sj/pcsj/rkpc/d7c/202303/P020230301403217959330.pdf (in Chinese).
Wang, Z., Yang, G. & Guo, Y. Harnessing the opportunity to achieve health equity in China. Lancet Public Health 6, e867–e868 (2021).
Google Scholar
Alcaraz, K. I. et al. Understanding and addressing social determinants to advance cancer health equity in the United States: a blueprint for practice, research, and policy. CA Cancer J. Clin. 70, 31–46 (2020).
Google Scholar
Jabbour, S. et al. Measuring the impact of AI in the diagnosis of hospitalized patients: a randomized clinical vignette survey study. JAMA 330, 2275–2284 (2023).
Google Scholar
Prinster, D. et al. Care to explain? AI explanation types differentially impact chest radiograph diagnostic performance and physician trust in AI. Radiology 313, e233261 (2024).
Google Scholar
Lang, K. et al. Artificial intelligence-supported screen reading versus standard double reading in the Mammography Screening with Artificial Intelligence trial (MASAI): a clinical safety analysis of a randomised, controlled, non-inferiority, single-blinded, screening accuracy study. Lancet Oncol. 24, 936–944 (2023).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2770–2778 (IEEE, 2016).
Choe, J. & Shim, H. Attention-based dropout layer for weakly supervised object localization. In 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2214–2223 (IEEE, 2019).
Hu, T., Qi, H., Huang, Q. & Lu, Y. See better before looking closer: weakly supervised data augmentation network for fine-grained visual classification. Preprint at https://arxiv.org/abs/1901.09891 (2019).
Li, G., Müller, M., Thabet, A. & Ghanem, B. DeepGCNs: can GCNs go as deep as CNNs? In IEEE International Conference on Computer Vision (ICCV) 9267–9276 (IEEE, 2019).
Paszke, A. et al. PyTorch: an imperative style, high-performance deep learning library. In Proc. 33rd International Conference on Neural Information Processing Systems 8026–8037 (2019).
Hillis, S. L. & Schartz, K. M. Multireader sample size program for diagnostic studies: demonstration and methodology. J. Med. Imaging 5, 045503 (2018).
Google Scholar
Sounderajah, V. et al. The STARD-AI reporting guideline for diagnostic accuracy studies using artificial intelligence. Nat. Med. 31, 3283–3289.

