Speech Detection via Respiratory Inductance Plethysmography, Thoracic Impedance, Accelerometers, and Gyroscopes: A Machine Learning-Informed Comparative Study

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Speech production interferes with the measurement of changes in cardiac vagal activity during acute stress by attenuating the expected drop in heart rate variability. Speech also induces cardiac sympathetic changes similar to those induced by psychological stress. In the laboratory, confounding of physiological stress reactivity by speech may be controlled experimentally. In ambulatory assessments, however, detection of speech episodes would be necessary to separate the physiological effects of psychosocial stress from those of speech. Using machine learning (https://osf.io/bk9nf), we trained and tested speech classification models on data from 56 participants (ages 18-39) under controlled laboratory conditions. They were equipped with privacy-secure wearables measuring thoracoabdominal respiratory inductance plethysmography (RIP from a single and a dual-band set-up), thoracic impedance pneumography, and an upper sternum positioned unit with triaxial accelerometers and gyroscopes. Following an 80/20 train-test split, nested cross-validations were run with the machine learning algorithms XGBoost, gradient boosting, random forest, and logistic regression on the training set to get generalized performance estimates. Speech classification by the best model per method was then validated in the test set. Speech versus no-speech classification performance (AUC) for both nested cross-validation and test set predictions was excellent for thorax-abdomen RIP (nested cross-validation: 96.6%, test set prediction: 98.5%), thorax-only RIP (97.5%, 99.1%), impedance (97.0%, 97.8%), and accelerometry (99.3%, 99.6%). The sternal accelerometer method outperformed others. These open-access models leveraging biosignals have the potential to also work in daily life settings. This could enhance the trustworthiness of ambulatory psychophysiology, by enabling detection of speech and controlling for its confounding effects on physiology.

Original languageEnglish
Article numbere70021
Pages (from-to)e70021
JournalPsychophysiology
Volume62
Issue number2
DOIs
Publication statusPublished - Feb 2025

Bibliographical note

© 2025 The Author(s). Psychophysiology published by Wiley Periodicals LLC on behalf of Society for Psychophysiological Research.

Funding

Funding: This work was supported by Nederlandse Organisatie voor Wetenschappelijk Onderzoek (Grant 024.005.010). This work is funded by Stress in Action. The research project \u201CStress in Action\u201D: www.stress-in-action.nl is financially supported by the Dutch Research Council and the Dutch Ministry of Education, Culture and Science (NWO gravitation grant number 024.005.010). This work is funded by Stress in Action. The research project \u201CStress in Action\u201D: www.stress\u2010in\u2010action.nl is financially supported by the Dutch Research Council and the Dutch Ministry of Education, Culture and Science (NWO gravitation grant number 024.005.010).

FundersFunder number
Ministerie van Onderwijs, Cultuur en Wetenschap
Nederlandse Organisatie voor Wetenschappelijk Onderzoek024.005.010

    Keywords

    • Humans
    • Machine Learning
    • Adult
    • Male
    • Young Adult
    • Female
    • Adolescent
    • Speech/physiology
    • Accelerometry/instrumentation
    • Plethysmography
    • Cardiography, Impedance
    • Wearable Electronic Devices

    Fingerprint

    Dive into the research topics of 'Speech Detection via Respiratory Inductance Plethysmography, Thoracic Impedance, Accelerometers, and Gyroscopes: A Machine Learning-Informed Comparative Study'. Together they form a unique fingerprint.

    Cite this