Improving the Accuracy of Vulnerability Report Classification Using Term Frequency-Inverse Gravity Moment

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Abstract

Software vulnerability analysis is one of the critical issues in the software industry, and vulnerability classification plays a major role in this analysis. A typical vulnerability classification model usually involves a stage of term selection, in which the relevant terms are identified via feature selection. It also involves a stage of term weighting, in which document weights for the selected terms are computed, and a stage for classifier learning. Generally, the term frequency-inverse document frequency (TF-IDF) is the most widely used term-weighting method. However, empirical evidence shows that the TF-IDF is plagued with issues pertaining to its effectiveness. This paper introduces a new approach for vulnerability classification, which is based on term frequency and inverse gravity moment (TF-IGM). The proposed method is validated by empirical experiments using three machine learning algorithms on ten publicly available vulnerability datasets. The result shows that TF-IGM outperforms the benchmark method across the applications studied.

Description

Research Article

Citation

P. K. Kudjo, J. Chen, M. Zhou, S. Mensah and R. Huang, "Improving the Accuracy of Vulnerability Report Classification Using Term Frequency-Inverse Gravity Moment," 2019 IEEE 19th International Conference on Software Quality, Reliability and Security (QRS), Sofia, Bulgaria, 2019, pp. 248-259. doi: 10.1109/QRS.2019.00041

Endorsement

Review

Supplemented By

Referenced By