Abstract
Context and motivation: Machine Learning (ML) algorithms and Natural Language Processing (NLP) techniques have effectively supported the automatic software requirements classification. The emergence of pre-trained language models, like BERT, provides promising results in several downstream NLP tasks, such as text classification. Question/problem: Most ML/DL approaches on requirements classification show a lack of analysis for requirements written in the Spanish language. Moreover, there has not been much research on pre-trained language models, like fastText and BETO (BERT for the Spanish language), neither in the validation of the generalization of the models. Principal ideas/results: We aim to investigate the classification performance and generalization of fastText and BETO classifiers in comparison with other ML/DL algorithms. The findings show that Shallow ML algorithms outperformed fastText and BETO when training and testing in the same dataset, but BETO outperformed other classifiers on prediction performance in a dataset with different origins. Contribution: Our evaluation provides a quantitative analysis of the classification performance of fastTest and BETO in comparison with ML/DL algorithms, the external validity of trained models on another Spanish dataset, and the translation of the PROMISE NFR dataset in Spanish.
Original language | English |
---|---|
Title of host publication | Requirements Engineering: Foundation for Software Quality |
Subtitle of host publication | 29th International Working Conference, REFSQ 2023, Barcelona, Spain, April 17–20, 2023, Proceedings |
Editors | Alessio Ferrari, Birgit Penzenstadler |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 159-176 |
Number of pages | 18 |
ISBN (Electronic) | 9783031297861 |
ISBN (Print) | 9783031297854 |
DOIs | |
Publication status | Published - 2023 |
Event | 29th International Working Conference on Requirements Engineering: Foundation for Software Quality, REFSQ 2023 - Barcelona, Spain Duration: 17 Apr 2023 → 20 Apr 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Publisher | Springer |
Volume | 13975 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 29th International Working Conference on Requirements Engineering: Foundation for Software Quality, REFSQ 2023 |
---|---|
Country/Territory | Spain |
City | Barcelona |
Period | 17/04/23 → 20/04/23 |
Bibliographical note
Funding Information:Acknowledgement. This research was partially funded by Xunta de Galicia/ FEDER-UE ED413C 2021/53 (Database Lab, UDC) and Galician Ministry of Culture, Education, Professional Training, and University (grants ED431G2019/04, ED431C2022/19).
Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Funding
Acknowledgement. This research was partially funded by Xunta de Galicia/ FEDER-UE ED413C 2021/53 (Database Lab, UDC) and Galician Ministry of Culture, Education, Professional Training, and University (grants ED431G2019/04, ED431C2022/19).
Funders | Funder number |
---|---|
Galician Ministry of Culture | |
University | ED431G2019/04, ED431C2022/19 |
Xunta de Galicia | ED413C 2021/53 |
Xunta de Galicia |
Keywords
- Automatic classification requirements
- BETO
- fastText
- Spanish requirements