Requirements Classification Using FastText and BETO in Spanish Documents

María Isabel Limaylla-Lunarejo, Nelly Condori-Fernandez*, Miguel R. Luaces

*Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

28 Downloads (Pure)

Abstract

Context and motivation: Machine Learning (ML) algorithms and Natural Language Processing (NLP) techniques have effectively supported the automatic software requirements classification. The emergence of pre-trained language models, like BERT, provides promising results in several downstream NLP tasks, such as text classification. Question/problem: Most ML/DL approaches on requirements classification show a lack of analysis for requirements written in the Spanish language. Moreover, there has not been much research on pre-trained language models, like fastText and BETO (BERT for the Spanish language), neither in the validation of the generalization of the models. Principal ideas/results: We aim to investigate the classification performance and generalization of fastText and BETO classifiers in comparison with other ML/DL algorithms. The findings show that Shallow ML algorithms outperformed fastText and BETO when training and testing in the same dataset, but BETO outperformed other classifiers on prediction performance in a dataset with different origins. Contribution: Our evaluation provides a quantitative analysis of the classification performance of fastTest and BETO in comparison with ML/DL algorithms, the external validity of trained models on another Spanish dataset, and the translation of the PROMISE NFR dataset in Spanish.

Original languageEnglish
Title of host publicationRequirements Engineering: Foundation for Software Quality
Subtitle of host publication29th International Working Conference, REFSQ 2023, Barcelona, Spain, April 17–20, 2023, Proceedings
EditorsAlessio Ferrari, Birgit Penzenstadler
PublisherSpringer Science and Business Media Deutschland GmbH
Pages159-176
Number of pages18
ISBN (Electronic)9783031297861
ISBN (Print)9783031297854
DOIs
Publication statusPublished - 2023
Event29th International Working Conference on Requirements Engineering: Foundation for Software Quality, REFSQ 2023 - Barcelona, Spain
Duration: 17 Apr 202320 Apr 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer
Volume13975 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Working Conference on Requirements Engineering: Foundation for Software Quality, REFSQ 2023
Country/TerritorySpain
CityBarcelona
Period17/04/2320/04/23

Bibliographical note

Funding Information:
Acknowledgement. This research was partially funded by Xunta de Galicia/ FEDER-UE ED413C 2021/53 (Database Lab, UDC) and Galician Ministry of Culture, Education, Professional Training, and University (grants ED431G2019/04, ED431C2022/19).

Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Funding

Acknowledgement. This research was partially funded by Xunta de Galicia/ FEDER-UE ED413C 2021/53 (Database Lab, UDC) and Galician Ministry of Culture, Education, Professional Training, and University (grants ED431G2019/04, ED431C2022/19).

FundersFunder number
Galician Ministry of Culture
UniversityED431G2019/04, ED431C2022/19
Xunta de GaliciaED413C 2021/53
Xunta de Galicia

    Keywords

    • Automatic classification requirements
    • BETO
    • fastText
    • Spanish requirements

    Fingerprint

    Dive into the research topics of 'Requirements Classification Using FastText and BETO in Spanish Documents'. Together they form a unique fingerprint.

    Cite this