Iterative Machine Learning for Classification and Discovery of Single-Molecule Unfolding Trajectories from Force Spectroscopy Data

Vanni Doffini, Haipei Liu, Zhaowei Liu, Michael A. Nash

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

We report the application of machine learning techniques to expedite classification and analysis of protein unfolding trajectories from force spectroscopy data. Using kernel methods, logistic regression, and triplet loss, we developed a workflow called Forced Unfolding and Supervised Iterative Online (FUSION) learning where a user classifies a small number of repeatable unfolding patterns encoded as images, and a machine is tasked with identifying similar images to classify the remaining data. We tested the workflow using two case studies on a multidomain XMod-Dockerin/Cohesin complex, validating the approach first using synthetic data generated with a Monte Carlo algorithm and then deploying the method on experimental atomic force spectroscopy data. FUSION efficiently separated traces that passed quality filters from unusable ones, classified curves with high accuracy, and identified unfolding pathways that were undetected by the user. This study demonstrates the potential of machine learning to accelerate data analysis and generate new insights in protein biophysics.
Original languageEnglish
Pages (from-to)10406-10413
JournalNano Letters
Volume23
Issue number22
DOIs
Publication statusPublished - 22 Nov 2023
Externally publishedYes

Funding

This work was supported by the University of Basel, the Swiss Federal Institute of Technology in Zürich (ETH Zürich), and the Swiss Nanoscience Institute (SNI, project P1802). The authors declare that some text was edited using ChatGPT ( https://chat.openai.com ).

FundersFunder number
Swiss Federal Institute of Technology in ZürichETH Zürich
Universität Basel
Swiss Nanoscience InstituteP1802
Swiss Nanoscience Institute

    Fingerprint

    Dive into the research topics of 'Iterative Machine Learning for Classification and Discovery of Single-Molecule Unfolding Trajectories from Force Spectroscopy Data'. Together they form a unique fingerprint.

    Cite this