Background. For gastric cancer(GC), accurately identifying lymphovascular invasion (LVI) and perineural invasion (PNI) is critical for guiding treatment decisions. In recent years, researchers have explored machine learning (ML) for preoperative detection of LVI and PNI in GC; however, systematic evidence on its diagnostic accuracy remains scarce. To address this gap, this study was undertaken to comprehensively review ML's performance for detecting LVI and PNI, thereby providing empirical evidence to enhance or update intelligent diagnostic techniques.
Methods. Multiple databases, PubMed, the Cochrane Library, Embase, and Web of Science, were retrieved for ML articles on the prediction of LVI and PNI in GC, covering publications from inception through January 13, 2025. Researchers employed PROBAST to ascertain the potential risk of bias in the selected studies. Subgroup analyses were executed by modeling variables. All meta-analyses were executed in Stata 15.0.
Results. Twenty-four eligible studies were ultimately incorporated, most of which employed logistic regression models (LRMs). For LVI diagnosis in the validation sets, models using clinical features (CFs) alone demonstrated a sensitivity (SEN) of 0.71 (95% CI: 0.61-0.80) and a specificity (SPC) of 0.77 (95% CI: 0.67-0.85). Models utilizing radiomics features (RFs) alone yielded a SEN of 0.80 (95% CI: 0.71-0.86) and a SPC of 0.66 (95% CI: 0.56-0.75). The models combining CFs and RFs outperformed others, with a SEN of 0.81 (95% CI: 0.75-0.86) and a SPC of 0.73 (95% CI: 0.65-0.80). Regarding the detection of PNI in validation sets, CF-based models yielded a SEN of 0.71 (95% CI: 0.60-0.80) with a corresponding SPC of 0.62 (95% CI: 0.52-0.71). RF-based models demonstrated a SEN of 0.75 (95% CI: 0.71-0.79) and a SPC of 0.76 (95% CI: 0.72-0.79); while the models based on both CFs and RFs exhibited superior performance with a SEN of 0.81 (95% CI: 0.76-0.84) and a SPC of 0.82 (95% CI: 0.79-0.85).
Conclusions. ML models combining radiomics with CFs appear to be a feasible approach for identifying LVI and PNI in GC. However, the number of included studies was small, and these studies used a random split approach for internal validation. Future research should incorporate more multicenter studies to develop more robust prediction models for clinical implementation.
If you have any questions about submitting your review, please email us at [email protected].