Aspect and Opinion Term Extraction for Hotel Reviews using Transfer Learning and Auxiliary Labels
Abstract
Transfer learning with BERT and CRF improves token and entity-level sentiment analysis in informal bahasa Indonesia hotel reviews.
Aspect and opinion term extraction is a critical step in Aspect-Based Sentiment Analysis (ABSA). Our study focuses on evaluating transfer learning using pre-trained BERT (Devlin et al., 2018) to classify tokens from hotel reviews in bahasa Indonesia. The primary challenge is the language informality of the review texts. By utilizing transfer learning from a multilingual model, we achieved up to 2% difference on token level F1-score compared to the state-of-the-art Bi-LSTM model with fewer training epochs (3 vs. 200 epochs). The fine-tuned model clearly outperforms the Bi-LSTM model on the entity level. Furthermore, we propose a method to include CRF with auxiliary labels as an output layer for the BERT-based models. The CRF addition further improves the F1-score for both token and entity level.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 2
Spaces citing this paper 1
Collections including this paper 0
No Collection including this paper