An Optimal Spacing Approach for Sampling Small-sized Datasets for Software Effort Estimation

dc.contributor.authorAbedu, S.
dc.contributor.authorMensah, S.
dc.contributor.authorBoafo, F.
dc.date.accessioned2023-09-26T09:07:52Z
dc.date.available2023-09-26T09:07:52Z
dc.date.issued2023
dc.descriptionResearch Articleen_US
dc.description.abstractContext: There has been a growing research focus in conventional machine learning techniques for software effort estimation (SEE). However, there is a limited number of studies that seek to assess the performance of deep learning approaches in SEE. This is because the sizes of SEE datasets are relatively small. Purpose: This study seeks to define a threshold for small-sized datasets in SEE, and investigates the performance of selected conventional machine learning and deep learning models on small-sized datasets. Method: Plausible SEE datasets with their number of project instances and features are extracted from existing literature and ranked. Eubank’s optimal spacing theory is used to discretize the ranking of the project instances into three classes (small, medium and large). Five conventional machine learning models and two deep learning models are trained on each dataset classified as small-sized using the leave-one-out cross-validation. The mean absolute error is used to assess the prediction performance of each model. Result: Findings from the study contradicts existing knowledge by demonstrating that deep learning models provide improved prediction performance as compared to the conventional machine learning models on small-sized datasets. Conclusion: Deep learning can be adopted for SEE with the application of regularisation techniques.en_US
dc.identifier.other10.18293/SEKE2023-176
dc.identifier.urihttp://ugspace.ug.edu.gh:8080/handle/123456789/40111
dc.language.isoenen_US
dc.subjectDeep learningen_US
dc.subjectConventional Machine learningen_US
dc.subjectSoftware effort estimationen_US
dc.subjectSmall-sizeden_US
dc.subjectOptimal spacing theoryen_US
dc.titleAn Optimal Spacing Approach for Sampling Small-sized Datasets for Software Effort Estimationen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
An-Optimal-Spacing-Approach-for-Sampling-Smallsized-Datasets-for-Software-Effort-Estimation.pdf
Size:
519.43 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: