Requirements Classification with Interpretable Machine Learning and Dependency Parsing

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Requirements classification is a traditional application of machine learning (ML) to RE that helps handle large requirements datasets. A prime example of an RE classification problem is the distinction between functional and non-functional (quality) requirements. State-of-the-art classifiers build their effectiveness on a large set of word features like text n-grams or POS n-grams, which do not fully capture the essence of a requirement. As a result, it is arduous for human analysts to interpret the classification results by exploring the classifier's inner workings. We propose the use of more general linguistic features, such as dependency types, for the construction of interpretable ML classifiers for RE. Through a feature engineering effort, in which we are assisted by modern introspection tools that reveal the hidden inner workings of ML classifiers, we derive a set of 17 linguistic features. While classifiers that use our proposed features fit the training set slightly worse than those that use high-dimensional feature sets, our approach performs generally better on validation datasets and it is more interpretable.
Original languageEnglish
Title of host publicationProceedings of the 27th IEEE International Requirements Engineering Conference (RE'19)
PublisherIEEE
DOIs
Publication statusPublished - 2019

Fingerprint

Dive into the research topics of 'Requirements Classification with Interpretable Machine Learning and Dependency Parsing'. Together they form a unique fingerprint.

Cite this