Abstract
Machine Learning (ML) algorithms have become a powerful instrument in software requirements classification. Nevertheless, most of the research focusing on requirements is in English, with less attention to other languages. Given a lack of datasets in Spanish, we created a new dataset from a collection of requirements from final degree projects from the University of A Coruña. In this paper, we investigate which combinations of text vectorization techniques with ML algorithms perform best for requirements classification in a Spanish dataset. We found that SVM with TF-IDF gives the highest f1-score (0.95 and 0.79 for functional and non-functional classification).
| Original language | English |
|---|---|
| Title of host publication | 2022 IEEE 30th International Requirements Engineering Conference (RE) |
| Subtitle of host publication | [Proceedings] |
| Editors | Eric Knauss, Gunter Mussbacher, Chetan Arora, Muneera Bano, Jean-Guy Schneider |
| Publisher | IEEE Computer Society |
| Pages | 270-271 |
| Number of pages | 2 |
| ISBN (Electronic) | 9781665470001 |
| ISBN (Print) | 9781665470018 |
| DOIs | |
| Publication status | Published - 19 Oct 2022 |
| Event | 30th IEEE International Requirements Engineering Conference, RE 2022 - Virtual, Online, Australia Duration: 15 Aug 2022 → 19 Aug 2022 |
Publication series
| Name | Proceedings of the IEEE International Conference on Requirements Engineering |
|---|---|
| Number | August |
| Volume | 2022 |
| ISSN (Print) | 1090-705X |
| ISSN (Electronic) | 2332-6441 |
Conference
| Conference | 30th IEEE International Requirements Engineering Conference, RE 2022 |
|---|---|
| Country/Territory | Australia |
| City | Virtual, Online |
| Period | 15/08/22 → 19/08/22 |
Bibliographical note
Funding Information:Acknowledgments Partially funded by Xunta de Galicia/FEDER-UE GRC: ED431C 2021/53.
Publisher Copyright:
© 2022 IEEE.
Funding
Acknowledgments Partially funded by Xunta de Galicia/FEDER-UE GRC: ED431C 2021/53.
Keywords
- automatic classification requirements
- machine learning algorithms
- natural language processing
- Spanish requirements