Topic and sentiment analysis of Slovene news media using natural language processing

Authors

  • Jan Bajt Fakulteta za računalništvo in informatiko
  • Marko Robnik-Šikonja Fakulteta za računalništvo in informatiko

DOI:

https://doi.org/10.31449/upinf.159

Keywords:

sentiment analysis, latent Dirichlet allocation, topic modeling, model BERT, natural language processing, Slovenian news media

Abstract

We compare topics and sentiment in Slovenian news media. We analysed the sentiment of seven media concerning specific political events or topics in 2019 and 2020. We used two phases of LDA modelling to detect a number of specific topics. For the sentiment analysis task, we fine-tuned large pretrained Slovenian masked language model, SloBERTa, and used it to classify articles in one of three classes (positive, neutral or negative). In the set of selected topics, we observed considerable differences between media in frequency and sentiment of reporting.

Downloads

Published

2022-05-04

How to Cite

[1]
Bajt, J. and Robnik-Šikonja, M. 2022. Topic and sentiment analysis of Slovene news media using natural language processing. Applied Informatics. 30, 1 (May 2022). DOI:https://doi.org/10.31449/upinf.159.

Issue

Section

Scientific articles