Authors: | Žunić, Anastazia Corcoran, Padraig Spasić, Irena |
Title: | Improving the performance of sentiment analysis in health and wellbeing using domain knowledge | Conference: | Third UK Healthcare Text Analytics Conference (HealTAC), London, UK, 22-24 April 2020 | Issue Date: | 2020 | Abstract: | Sentiment analysis is a natural language processing task that aims to automatically classify the sentiment expressed in text. In this study, we compare the performance of five publicly available sentiment analysis tools along with the ensemble method that combines them. Their performance was evaluated on two datasets, which represent user-generated content. One of these, namely drug reviews, is related to health and wellbeing. The second one, movie reviews, is used for cross-domain comparison of sentiment analysis. Explicit domain knowledge formally modelled by the Unified Medical Language System was used for semantic enrichment to investigate whether it can improve the performance of the sentiment analysis tools considered by reducing the bias towards the negative sentiment. Our experiments demonstrated an improvement in F-score by 7 percent points. |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.