Topic and sentiment analysis of Slovene news media using natural language processing
DOI:
https://doi.org/10.31449/upinf.159Keywords:
sentiment analysis, latent Dirichlet allocation, topic modeling, model BERT, natural language processing, Slovenian news mediaAbstract
We compare topics and sentiment in Slovenian news media. We analysed the sentiment of seven media concerning specific political events or topics in 2019 and 2020. We used two phases of LDA modelling to detect a number of specific topics. For the sentiment analysis task, we fine-tuned large pretrained Slovenian masked language model, SloBERTa, and used it to classify articles in one of three classes (positive, neutral or negative). In the set of selected topics, we observed considerable differences between media in frequency and sentiment of reporting.Downloads
Published
2022-05-04
How to Cite
[1]
Bajt, J. and Robnik-Šikonja, M. 2022. Topic and sentiment analysis of Slovene news media using natural language processing. Applied Informatics. 30, 1 (May 2022). DOI:https://doi.org/10.31449/upinf.159.
Issue
Section
Scientific articles