Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification

Vivian, Lee Lay Shan (2022) Incorporating Informative Score For Instance Selection In Semi-supervised Sentiment Classification. Masters thesis, Universiti Sains Malaysia.

[img]
Preview
PDF
Download (558kB) | Preview

Abstract

Sentiment classification is a useful tool to classify reviews that contain a wealth of information about sentiments and attitudes towards a product or service. Existing studies are heavily relying on sentiment classification methods that require fully annotated input. However, there are limited labelled text available, making the acquirement process of the fully annotated input costly and labour intensive. In recent years, semi-supervised methods have been positively recommended as they require only partially labelled input and performed comparably to the current preferred methods. At the same time, there are some works reported the performance of semi-supervised model degraded after adding unlabelled instances into training. The contrast of the current literature shows that not all unlabelled instances are equally useful; thus identifying the informative unlabelled instances is beneficial in training a semi-supervised model. To achieve this, informative score is proposed and incorporated into semi-supervised sentiment classification. The experiment compared the accuracy and loss of supervised method, semi-supervised method without informative score and semi-supervised method with informative score. With the help of informative score to identify informative unlabelled instances, semi-supervised models can perform better compared to semi-supervised models that do not incorporate informative score into its training. Although performance of semi-supervised models incorporated with informative score are not able to surpass the supervised models, the results are still found promising as the differences in performance are subtle and the number of labelled instances used are greatly reduced.

Item Type: Thesis (Masters)
Subjects: Q Science > QA Mathematics > QA76.9.M35 Computer science -- Mathematics
Divisions: Pusat Pengajian Sains Komputer (School of Computer Sciences) > Thesis
Depositing User: Mr Noor Azizan Abu Hashim
Date Deposited: 12 Mar 2024 03:51
Last Modified: 12 Mar 2024 03:51
URI: http://eprints.usm.my/id/eprint/60138

Actions (login required)

View Item View Item
Share