Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers

García Díaz, José Antonio; Jiménez Zafra, Salud María; García Cumbreras, Miguel Ángel; Valencia García, Rafael

Publication:
Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers

Files

s40747-022-00693-x.pdf(2.94 MB)

Date

2023

relationships.isAuthorOfPublication

Person

García Díaz, José Antonio

Person

Valencia García, Rafael

Authors

García Díaz, José Antonio ; Jiménez Zafra, Salud María ; García Cumbreras, Miguel Ángel ; Valencia García, Rafael

item.page.secondaryauthor

Facultades de la UMU::Facultad de Informática

Publisher

Springer

publication.page.department

Informática y Sistemas

DOI

https://doi.org/10.1007/s40747-022-00693-x

item.page.type

info:eu-repo/semantics/article

Abstract

The rise of social networks has allowed misogynistic, xenophobic, and homophobic people to spread their hate-speech to intimidate individuals or groups because of their gender, ethnicity or sexual orientation. The consequences of hate-speech are devastating, causing severe depression and even leading people to commit suicide. Hate-speech identification is challenging as the large amount of daily publications makes it impossible to review every comment by hand. Moreover, hate-speech is also spread by hoaxes that requires language and context understanding. With the aim of reducing the number of comments that should be reviewed by experts, or even for the development of autonomous systems, the automatic identification of hate-speech has gained academic relevance. However, the reliability of automatic approaches is still limited specifically in languages other than English, in which some of the state-of-the-art techniques have not been analyzed in detail. In this work, we examine which features are most effective in identifying hate-speech in Spanish and how these features can be combined to develop more accurate systems. In addition, we characterize the language present in each type of hate-speech by means of explainable linguistic features and compare our results with state-of-the-art approaches. Our research indicates that combining linguistic features and transformers by means of knowledge integration outperforms current solutions regarding hate-speech identification in Spanish.

publication.page.subject

Hate speech , Feature engineering , Knowledge integration , Text classification , Natural language processing

Citation

Complex & Intelligent Systems, 2023, Vol. 9, pp. 2893–2914

URI

http://hdl.handle.net/10201/186890

Collections

Artículos

Full item page

Ir a Estadísticas

Este ítem está sujeto a una licencia Creative Commons. http://creativecommons.org/licenses/by/4.0/

Publication:
Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers

Files

Date

relationships.isAuthorOfPublication

relationships.isSecondaryAuthorOf

relationships.isDirectorOf

Authors

item.page.secondaryauthor

item.page.director

Publisher

publication.page.editor

publication.page.department

DOI

item.page.type

Description

Abstract

publication.page.subject

Citation

URI

item.page.embargo

Collections

Publication: Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers

Files

Date

relationships.isAuthorOfPublication

relationships.isSecondaryAuthorOf

relationships.isDirectorOf

Authors

item.page.secondaryauthor

item.page.director

Publisher

publication.page.editor

publication.page.department

DOI

item.page.type

Description

Abstract

publication.page.subject

Citation

URI

item.page.embargo

Collections

Publication:
Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers