ARAÚJO, Victor Eduardo Borges.
Resumo:
The software development process generates several textual artifacts that are mostly written in natural language. Establishing connections between these artifacts can have a positive impact on performing a variety of tasks, including code understanding and
maintaining. The use of Information Retrieval (IR) and Machine Learning (ML) techniques in order to recover the traceability between bug reports and test cases has already been proposed, however, the results indicated the need for improvements, especially to deal with the differences in vocabulary. In this paper, we created a Vocabulary Unifier using a thesaurus to expand the vocabulary encountered in bug reports, aiming to unify their terms in line with the terms from the test cases. We evaluated the techniques comparing its recall, precision and f2-score rates with those reached by previous works, observing slight improvements in its values.