Identification of Polymers with a Small Data Set of Mid-infrared Spectra: A Comparison between Machine Learning and Deep Learning Models

Xin Tian*, Frederic Beén, Yiqun Sun, Peter van Thienen, Patrick S. Bäuerlein

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Identifying environmental polymers and microplastics is crucial for the scientific world, environmental agencies, and water authorities to estimate their environmental impact and increase efforts to decrease emissions. On the basis of different spectroscopy techniques, e.g., laser-directed infrared imaging and Raman spectroscopy, polymers can be observed and represented as spectroscopic signals. The latter can be further analyzed and classified by data science, in particular, machine learning (ML). Past studies applied a variety of ML models to identify polymers from small or large data sets. However, a comprehensive comparison of multiple models across different data set sizes is still needed, which is presented in this study. Furthermore, we also provide a practical data augmentation technique to generate synthetic samples when only a limited number of samples are available. Our results show that the ensemble ML model, compared to neural network models, takes the least training time to achieve the best performance, i.e., a classification accuracy of 99.5%. This study provides a generic framework for selecting ML models and boosting model performance to accurately identify polymers.

Original languageEnglish
Pages (from-to)1030-1035
Number of pages6
JournalEnvironmental Science and Technology Letters
Volume10
Issue number11
Early online date11 Jan 2023
DOIs
Publication statusPublished - 14 Nov 2023

Bibliographical note

Funding Information:
This research was funded by the joint research program of the Dutch and Flemish water utilities (BTO 402045/228). The authors thank three anonymous reviewers, who provided helpful comments to improve the quality of this paper.

Publisher Copyright:
© 2023 The Authors. Published by American Chemical Society.

Funding

This research was funded by the joint research program of the Dutch and Flemish water utilities (BTO 402045/228). The authors thank three anonymous reviewers, who provided helpful comments to improve the quality of this paper.

Keywords

  • data science
  • deep learning
  • ensemble-supervised learning
  • LDIR
  • microplastics
  • polymers

Fingerprint

Dive into the research topics of 'Identification of Polymers with a Small Data Set of Mid-infrared Spectra: A Comparison between Machine Learning and Deep Learning Models'. Together they form a unique fingerprint.

Cite this