An automatic software vulnerability classification framework using term frequency-inverse gravity moment and feature selection
| dc.contributor.author | Mensah, S. | |
| dc.contributor.author | Chen, J. | |
| dc.contributor.author | Kudjo, P.K. | |
| dc.contributor.author | Brown, S.A. | |
| dc.contributor.author | Akorfu, G. | |
| dc.date.accessioned | 2020-07-02T13:51:04Z | |
| dc.date.available | 2020-07-02T13:51:04Z | |
| dc.date.issued | 2020-05-15 | |
| dc.description | Research Article | en_US |
| dc.description.abstract | Vulnerability classification is an important activity in software development and software quality main- tenance. A typical vulnerability classification model usually involves a stage of term selection, in which the relevant terms are identified via feature selection. It also involves a stage of term-weighting, in which the document weights for the selected terms are computed, and a stage for classifier learning. Generally, the term frequency-inverse document frequency (TF-IDF) model is the most widely used term-weighting metric for vulnerability classification. However, several issues hinder the effectiveness of the TF-IDF model for document classification. To address this problem, we propose and evaluate a general framework for vulnerability severity classification using the term frequency-inverse gravity moment (TF-IGM). Specifi- cally, we extensively compare the term frequency-inverse gravity moment, term frequency-inverse doc- ument frequency, and information gain feature selection using five machine learning algorithms on ten vulnerable software applications containing a total number of 27,248 security vulnerabilities . The exper- imental result shows that: (i) the TF-IGM model is a promising term weighting metric for vulnerability classification compared to the classical term-weighting metric, (ii) the effectiveness of feature selection on vulnerability classification varies significantly across the studied datasets and (iii) feature selection improves vulnerability classification. | en_US |
| dc.description.sponsorship | National Natural Science Foundation of China (NSFC U1836116 , 6170022430 and61872167 ), the Project of Jiangsu Provincial Six Talent Peaks (Grant num- ber: XXJS-016 ), The Postdoctoral Science Foundation of China 1112019T120399 and theGraduateResearchInnovation Projectof Jiangsu Province (Grant numbers KYCX17 1807 ). | en_US |
| dc.identifier.uri | http://ugspace.ug.edu.gh/handle/123456789/35446 | |
| dc.language.iso | en | en_US |
| dc.publisher | Journal of Systems and Software | en_US |
| dc.relation.ispartofseries | 167; | |
| dc.subject | Software vulnerability | en_US |
| dc.subject | Classification | en_US |
| dc.subject | Feature selection | en_US |
| dc.subject | Machine learning algorithms | en_US |
| dc.subject | Severity | en_US |
| dc.subject | Term-weighting | en_US |
| dc.title | An automatic software vulnerability classification framework using term frequency-inverse gravity moment and feature selection | en_US |
| dc.type | Article | en_US |
Files
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.6 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
