Towards an automatic requirements classification in a new Spanish dataset

Maria Isabel Limaylla-Lunarejo, Nelly Condori-Fernandez, Miguel R. Luaces

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Machine Learning (ML) algorithms have become a powerful instrument in software requirements classification. Nevertheless, most of the research focusing on requirements is in English, with less attention to other languages. Given a lack of datasets in Spanish, we created a new dataset from a collection of requirements from final degree projects from the University of A Coruña. In this paper, we investigate which combinations of text vectorization techniques with ML algorithms perform best for requirements classification in a Spanish dataset. We found that SVM with TF-IDF gives the highest f1-score (0.95 and 0.79 for functional and non-functional classification).

Original languageEnglish
Title of host publication2022 IEEE 30th International Requirements Engineering Conference (RE)
Subtitle of host publication[Proceedings]
EditorsEric Knauss, Gunter Mussbacher, Chetan Arora, Muneera Bano, Jean-Guy Schneider
PublisherIEEE Computer Society
Pages270-271
Number of pages2
ISBN (Electronic)9781665470001
ISBN (Print)9781665470018
DOIs
Publication statusPublished - 19 Oct 2022
Event30th IEEE International Requirements Engineering Conference, RE 2022 - Virtual, Online, Australia
Duration: 15 Aug 202219 Aug 2022

Publication series

NameProceedings of the IEEE International Conference on Requirements Engineering
NumberAugust
Volume2022
ISSN (Print)1090-705X
ISSN (Electronic)2332-6441

Conference

Conference30th IEEE International Requirements Engineering Conference, RE 2022
Country/TerritoryAustralia
CityVirtual, Online
Period15/08/2219/08/22

Bibliographical note

Funding Information:
Acknowledgments Partially funded by Xunta de Galicia/FEDER-UE GRC: ED431C 2021/53.

Publisher Copyright:
© 2022 IEEE.

Funding

Acknowledgments Partially funded by Xunta de Galicia/FEDER-UE GRC: ED431C 2021/53.

Keywords

  • automatic classification requirements
  • machine learning algorithms
  • natural language processing
  • Spanish requirements

Fingerprint

Dive into the research topics of 'Towards an automatic requirements classification in a new Spanish dataset'. Together they form a unique fingerprint.

Cite this