Classifying Hate Speech using Fine-tuned Language Models

Detta är en Master-uppsats från Uppsala universitet/Statistiska institutionen

Författare: Erik Brorson; [2018]

Nyckelord: machine learning; natural language processing; hate speech; transfer learning; semi-supervised learning; recurrent neural networks;

Sammanfattning: Given the explosion in the size of social media, the amount of hate speech is also growing. To efficiently combat this issue we need reliable and scalable machine learning models. Current solutions rely on crowdsourced datasets that are limited in size, or using training data from self-identified hateful communities, that lacks specificity. In this thesis we introduce a novel semi-supervised modelling strategy. It is first trained on the freely available data from the hateful communities and then fine-tuned to classify hateful tweets from crowdsourced annotated datasets. We show that our model reach state of the art performance with minimal hyper-parameter tuning.

HÄR KAN DU HÄMTA UPPSATSEN I FULLTEXT. (följ länken till nästa sida)

Classifying Hate Speech using Fine-tuned Language Models

Sökningar just nu

Populära sökningar

Uppsatser med många visningar igår (2024-04-23)