Statistical Analysis of Public Sentiment on the Ghanaian Government: A Machine Learning Approach

dc.contributor.authorAndoh, J.
dc.contributor.authorAsiedu, L.
dc.contributor.authorLotsi, A.
dc.contributor.authorChapman-Wardy, C.
dc.date.accessioned2022-01-14T16:14:20Z
dc.date.available2022-01-14T16:14:20Z
dc.date.issued2021
dc.descriptionResearch Articleen_US
dc.description.abstractGathering public opinions on the Internet and Internet-based applications like Twitter has become popular in recent times, as it provides decision-makers with uncensored public views on products, government policies, and programs. Through natural language processing and machine learning techniques, unstructured data forms from these sources can be analyzed using traditional statistical learning. The challenge encountered in machine learning method-based sentiment classification still remains the abundant amount of data available, which makes it difficult to train the learning algorithms in feasible time. This eventually degrades the classification accuracy of the algorithms. From this assertion, the effect of training data sizes in classification tasks cannot be overemphasized. This study statistically assessed the performance of Naive Bayes, support vector machine (SVM), and random forest algorithms on sentiment text classification task. The research also investigated the optimal conditions such as varying data sizes, trees, and kernel types under which each of the respective algorithms performed best. The study collected Twitter data from Ghanaian users which contained sentiments about the Ghanaian Government. The data was preprocessed, manually labeled by the researcher, and then trained using the aforementioned algorithms. These algorithms are three of the most popular learning algorithms which have had lots of success in diverse fields. 'e Naive Bayes classifier was adjudged the best algorithm for the task as it outperformed the other two machine learning algorithms with an accuracy of 99%, F1 score of 86.51%, and Matthews correlation coefficient of 0.9906. The algorithm also performed well with increasing data sizes. 'e Naive Bayes classifier is recommended as viable for sentiment text classification, especially for text classification systems which work with Big Data.en_US
dc.identifier.otherhttps://doi.org/10.1155/2021/5561204
dc.identifier.urihttp://ugspace.ug.edu.gh/handle/123456789/37646
dc.language.isoenen_US
dc.publisherHindawien_US
dc.titleStatistical Analysis of Public Sentiment on the Ghanaian Government: A Machine Learning Approachen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Statistical-Analysis-of-Public-Sentiment-on-the-Ghanaian-Government-A-Machine-Learning-ApproachAdvances-in-HumanComputer-Interaction.pdf
Size:
1.78 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.6 KB
Format:
Item-specific license agreed upon to submission
Description: