TY - JOUR
T1 - Sentiment analysis for software quality assessment
AU - Liang, Feng
AU - Hou, Fang
AU - Farshidi, Siamak
AU - Jansen, Slinger
N1 - Publisher Copyright:
© 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
PY - 2023
Y1 - 2023
N2 - During the software selection process, software engineers often rely on text reviews from repository platforms, communities, or forums to collect software quality information. As these reviews offer direct insight into users' experience and perception of the software components. However, text reviews are often formulated implicitly, and the process of gathering user feedback from multiple sources can be a time-intensive endeavor, posing challenges in the collection and analysis of substantial volumes of data. We conducted a systematic literature review to explore the state-of-the-art solutions in sentiment analysis. By leveraging the knowledge derived from the literature review, we developed a sentiment analysis tool to measure software component quality by analyzing the sentiment of reviews from experienced users. Our goal is to provide a channel to help software stakeholders gain insight into the software quality attributes, thus enhancing the overall health of software and the software ecosystem. This tool consists of TextRank, which extracts keywords related to software quality attributes from raw data, an Aho-Corasick automaton used to search for these keywords in reviews and map them to software quality attributes, and a sentiment analysis model to perform sentiment analysis. We compare four widely mentioned models in the literature review, namely BERT, BERT-BiLSTM, BERT-BiLSTM-Attention, and RoBERTa, in terms of performance metrics such as accuracy, F1, precision, and recall. BERT-BiLSTM-Attention is selected as the sentiment analysis model due to its superior performance in both training and test datasets. In addition, we integrated a decision algorithm that computes the fuzzy group consensus sentiment for the relevant quality attributes of each software component and visualizes it through a sentiment quality matrix.
AB - During the software selection process, software engineers often rely on text reviews from repository platforms, communities, or forums to collect software quality information. As these reviews offer direct insight into users' experience and perception of the software components. However, text reviews are often formulated implicitly, and the process of gathering user feedback from multiple sources can be a time-intensive endeavor, posing challenges in the collection and analysis of substantial volumes of data. We conducted a systematic literature review to explore the state-of-the-art solutions in sentiment analysis. By leveraging the knowledge derived from the literature review, we developed a sentiment analysis tool to measure software component quality by analyzing the sentiment of reviews from experienced users. Our goal is to provide a channel to help software stakeholders gain insight into the software quality attributes, thus enhancing the overall health of software and the software ecosystem. This tool consists of TextRank, which extracts keywords related to software quality attributes from raw data, an Aho-Corasick automaton used to search for these keywords in reviews and map them to software quality attributes, and a sentiment analysis model to perform sentiment analysis. We compare four widely mentioned models in the literature review, namely BERT, BERT-BiLSTM, BERT-BiLSTM-Attention, and RoBERTa, in terms of performance metrics such as accuracy, F1, precision, and recall. BERT-BiLSTM-Attention is selected as the sentiment analysis model due to its superior performance in both training and test datasets. In addition, we integrated a decision algorithm that computes the fuzzy group consensus sentiment for the relevant quality attributes of each software component and visualizes it through a sentiment quality matrix.
KW - Sentiment analysis
KW - software engineering
KW - software quality
UR - http://www.scopus.com/inward/record.url?scp=85179840348&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85179840348
SN - 1613-0073
VL - 3567
SP - 17
EP - 24
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 22nd Belgium-Netherlands Software Evolution Workshop, BENEVOL 2023
Y2 - 27 November 2023 through 28 November 2023
ER -