BI Lingfeng 1,2 , LI Jiayi 1,2 , WANG Xin 1,2 , HOU Wang 1,2,3,4 , WANG Zhoufeng 1,2 , WANG Jing 5 , LI Weimin 1,2,3,4
  • 1. Department of Respiratory and Critical Care Medicine, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Respiratory Health and Multimorbidity, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, P.R.China;
  • 2. Institute of Respiratory Health, State Key Laboratory of Respiratory Health and Multimorbidity, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, P.R.China;
  • 3. Precision Medicine Center, Precision Medicine Key Laboratory of Sichuan Province, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, P.R.China;
  • 4. The Research Units of West China, Chinese Academy of Medical Sciences, West China Hospital, Chengdu, Sichuan 610041, P.R.China;
  • 5. Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, P.R.China;
LI Weimin, Email: weimi003@scu.edu.cn
Export PDF Favorites Scan Get Citation

Objective To develop a multimodal recurrence prediction model for stage I NSCLC by integrating radiomics, genomics, and clinical data through hierarchical feature engineering and cross-omics interaction algorithms.Methods This study pioneered a hierarchical multimodal integration framework, synergizing radiomics (2 265 nodule features), genomics (116 whole-exome sequencing profiles), and clinical data from 323 stage I NSCLC patients. A two-stage feature engineering pipeline (correlation analysis + Random Forest selection) optimized discriminative features, and a cross-omics interaction algorithm dynamically quantified spatial associations between imaging phenotypes and genomic alterations. Results Single-modality model performance: The radiomics model (XGBoost, AUC=0.896±0.088) demonstrated the best predictive efficacy among single-modality models, significantly outperforming the genomics-based model (Random Forest, AUC=0.644±0.196) and the clinical model (Random Forest, AUC=0.742±0.160). Dual-modality model performance: The radiomics-genomics integrated model achieved the best performance (XGBoost, AUC=0.942±0.077) among dual-modality models, superior to the radiomics-clinical model (XGBoost, AUC=0.929±0.086) and the genomics-clinical model (Random Forest, AUC=0.733±0.164).Trimodal integration performance: The trimodal model integrating radiomics, genomics, and clinical data achieved the peak performance (XGBoost, AUC=0.971±0.082), which was significantly better than all single-modality and dual-modality benchmarks.Conclusions This work establishes a methodological framework for predicting postoperative recurrence in stage I NSCLC through multimodal feature integration guided by domain-specific machine learning. Its superior performance highlights the advantage of multimodal integration, offering actionable insights for personalized surveillance strategies.

Citation: BI Lingfeng, LI Jiayi, WANG Xin, HOU Wang, WANG Zhoufeng, WANG Jing, LI Weimin. Multimodal Integration of Radiomics, Genomics, and Clinical Data Predicts Postoperative Recurrence in Stage I Non-Small Cell Lung Cancer. Chinese Journal of Respiratory and Critical Care Medicine, 2025, 24(12): 850-858. doi: 10.7507/1671-6205.202510027 Copy

Copyright ? the editorial department of Chinese Journal of Respiratory and Critical Care Medicine of West China Medical Publisher. All rights reserved

  • Previous Article

    Study on the safety and efficacy of bronchial thermoplasty and omalizumab after bronchial thermoplasty in severe asthma
  • Next Article

    Risk factors and predictive model for fibrosis progression in patients with CT-defined usual interstitial pneumonia pattern