Skip to main navigation Skip to search Skip to main content

Identifying past-year self-reported suicidality in outpatients with somatic symptom disorder using an interpretable machine-learning model: a multicenter study with an online calculator

  • Xing Wang
  • , Shuixiu Lai
  • , Peng Wang
  • , Yibo Li
  • , Yunhui Zhong*
  • , Tieshi Zhu*
  • *Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Background: Somatic symptom disorder (SSD) is associated with an elevated risk of suicidality. However, clinically implementable tools to identify outpatients with SSD who may warrant prioritized suicidality assessment remain limited. We therefore aimed to develop an interpretable model using routinely available outpatient data to stratify the likelihood of past-year self-reported suicidality. Methods: We analyzed a multicenter cross-sectional registry from 3 hospitals in Ganzhou including adults aged 18–60 years with DSM-5–defined SSD. Past-year self-reported suicidality was assessed using a prespecified binary (yes/no) item. Data were split 70/30 into training/test sets. Candidate predictors were selected by the intersection of least absolute shrinkage and selection operator and Boruta. Eight algorithms were trained with repeated 5-fold cross-validation and compared primarily by area under the receiver operating characteristic curve (AUC) and Brier score; the top model underwent calibration and decision-curve analysis. Shapley additive explanations (SHAP) provided model explanations; a Shiny web calculator was implemented. Results: Of 899 participants (median age, 33 years; 64.4% female), 19.9% reported past-year suicidality. All models showed high discrimination in the test set (AUCs > 0.900). The random forest (RANGER implementation) performed best (AUC, 0.978; 95% CI, 0.955–1.000; area under the precision–recall curve, 0.960; Brier, 0.028; accuracy, 0.967; sensitivity, 0.927; specificity, 0.977), with good calibration and favorable net clinical benefit on DCA. SHAP ranked insomnia severity index as the leading contributor, followed by the five facet mindfulness questionnaire and the repeatable battery for the assessment of neuropsychological status. Conclusions: In SSD outpatients, an interpretable RANGER-based model showed strong internal performance for classifying participants who reported past-year self-reported suicidality, and yielded favorable clinical net benefit across relevant decision thresholds. A web-based calculator illustrates potential usability in outpatient settings; external validation and prospective implementation studies are warranted before routine adoption.

Original languageEnglish
Article number255
JournalBMC Psychiatry
Volume26
Issue number1
DOIs
Publication statusPublished - Dec 2026

Bibliographical note

Publisher Copyright:
© The Author(s) 2026.

Keywords

  • Machine learning
  • Risk prediction
  • Somatic symptom disorder
  • Suicidality
  • Web-based calculator

Fingerprint

Dive into the research topics of 'Identifying past-year self-reported suicidality in outpatients with somatic symptom disorder using an interpretable machine-learning model: a multicenter study with an online calculator'. Together they form a unique fingerprint.

Cite this