Publication:
SocialHaterBERT: A dichotomous approach for automatically detecting hate speech on Twitter through textual analysis and user profiles

dc.affiliation.institutoUC3M. Instituto UC3M - Santander de Big Dataes
dc.contributor.authorValle Cano, Gloria del
dc.contributor.authorQuijano Sánchez, Lara
dc.contributor.authorLiberatore, Federico
dc.contributor.authorGómez, Jesús
dc.contributor.funderEuropean Commissiones
dc.contributor.funderMinisterio de Ciencia e Innovación (España)es
dc.date.accessioned2023-11-16T10:10:24Z
dc.date.available2023-11-16T10:10:24Z
dc.date.issued2023-04-15
dc.description.abstractSocial media platforms have evolved into an online representation of our social interactions. We may use the resources they provide to analyze phenomena that occur within them, such as the development and viralization of offensive and hostile content. In today's polarized world, the escalating nature of this behavior is cause for concern in modern society. This research includes an in-depth examination of previous efforts and strategies for detecting and preventing hateful content on the social network Twitter, as well as a novel classification approach based on users' profiles, related social environment and generated tweets. This paper's contribution is threefold: (i) an improvement in the performance of the HaterNet algorithm, an expert system developed in collaboration with the Spanish National Office Against Hate Crimes of the Spanish State Secretariat for Security (Ministry of the Interior) that is capable of identifying and monitoring the evolution of hate speech on Twitter using an LTSM + MLP neural network architecture. To that end, a model based on BERT, HaterBERT, has been created and tested using HaterNet's public dataset, providing results that show a significant improvement; (ii) A methodology to create a user database in the form of a relational network to infer textual and centrality features. This contribution, SocialGraph, has been independently tested with various traditional Machine Learning and Deep Learning algorithms, demonstrating its usefulness in spotting haters; (iii) a final model, SocialHaterBERT, that integrates the previous two approaches by analyzing features other than those inherent in the text. Experiment results reveal that this last contribution greatly improves outcomes, establishing a new field of study that transcends textual boundaries, paving the way for future research in coupled models from a diachronic and dynamic perspective.en
dc.description.sponsorshipThe research of Quijano-Sánchez was conducted with financial support from the Spanish Ministry of Science and Innovation, grant PID2019-108965GB-I00. The research of Liberatore is partially funded by the European Commission's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie, grant number MSCA-RISE 691161 (GEO-SAFE), and the Government of Spain , grant MTM2015-65803-R.en
dc.description.statusPublicadoes
dc.format.extent17
dc.identifier.bibliographicCitationExpert Systems with Applications, (2023), 216:119446, (17 p.).en
dc.identifier.doihttps://doi.org/10.1016/j.eswa.2022.119446
dc.identifier.issn0957-4174
dc.identifier.publicationfirstpage1
dc.identifier.publicationissue119446
dc.identifier.publicationlastpage17
dc.identifier.publicationtitleEXPERT SYSTEMS WITH APPLICATIONSen
dc.identifier.publicationvolume216
dc.identifier.urihttps://hdl.handle.net/10016/38884
dc.identifier.uxxiAR/0000033524
dc.language.isoengen
dc.publisherElsevieren
dc.relation.projectIDGobierno de España. PID2019-108965GB-I00es
dc.relation.projectIDGobierno de España. MTM2015-65803-Res
dc.rights© 2022 The Author(s). Published by Elsevier Ltd.en
dc.rightsThis is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/bync- nd/4.0/).en
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 España*
dc.rights.accessRightsopen accessen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subject.ecienciaInformáticaes
dc.subject.otherHate speechen
dc.subject.otherTwitteren
dc.subject.otherDeep learningen
dc.subject.otherSocial network analysisen
dc.subject.otherBidirectional encoder representations from transformersen
dc.subject.otherBERTen
dc.subject.otherTopic modelingen
dc.titleSocialHaterBERT: A dichotomous approach for automatically detecting hate speech on Twitter through textual analysis and user profilesen
dc.typeresearch article*
dc.type.hasVersionVoR*
dspace.entity.typePublication
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SocialHaterBERT_ESWA_2023.pdf
Size:
876.62 KB
Format:
Adobe Portable Document Format