Development of a smart tracker that identifies offensive tweets against women and migrants
A research team from the University of Jaén is applying a system to recognise misogynist and xenophobic racist comments on Twitter. This technology learns and distinguishes the nuances of a wide range of phrases, words and insults in Spanish. Experts are utilizing this artificial intelligence to detect hate messages on this social network.
The research group SINAI (Sistemas de Acceso Inteligente a la Información, i.e. Intelligent Access to Information Systems) of the University of Jaén has developed a system based on artificial intelligence to identify misogynist and xenophobic messages on Twitter. This method can be used in areas such as policing and the application of the law to detect hate messages, as well as to moderate the language of the tweets generated on this social network. The program is based on machine learning algorithms, including neural networks, structures that mimic the functioning of the human brain and ‘learn’ to capture the nuances of messages.
Experts point out that this system identifies Spanish offensive content and language ambiguities in order to catch hate messages on Twitter. They add that this system could be used as a warning mechanism to detect comments that incite hatred and violence towards women and migrants.
To identify offensive statements, researchers use neural networks that behave just like the human brain: they connect nodes that interpret information and order it. It is an intelligent system that uses training data made up of insults, terms and derogatory expressions to ‘learn’ the patterns and structure of language in order to predict new tweets and detect those that are offensive. “Some phrases include essential pronouns or determinants that can completely change the meaning of a wording. With our system and with the help of linguistic resources, we can identify expressions related to hate speech,” explains the researcher Flor Miriam Plaza at the University of Jaén to Fundación Descubre.
In the study, entitled ‘Detecting Misogyny and Xenophobia in Spanish Tweets Using Language Technologies’ and published in ACM Transactions on Internet Technology, the researchers explain that in order to ‘instruct’ the system they semi-automatically generated four lists of words in Spanish consisting of offensive and insulting expressions and words against women and migrants.
On this basis and after detecting the tweet, artificial intelligence identifies hate speech aimed specifically against these two demographic groups. “Currently, we continue to integrate new lexical resources into this technology, such as dictionaries and word lists, so that the system will increasingly distinguish nuances in tweets; in this way we increase its accuracy and effectiveness,” explains María Teresa Martín, researcher at the University of Jaén.
Message collection and analysis
During the research process, experts collected tweets using offensive terms such as ‘zorra’ (‘bitch’) or ‘negrata’ (‘nigger’). The system identifies the context in which these words are used and recognises whether they are being applied as insults. Thereby the program detects and collects pejorative messages towards women or migrants.

María Dolores Molina, Flor Miriam Plaza del Arco, María Teresa Valdivia and L. Alfonso Ureña, authors of the study.
The researchers explain that this technology is suitable for application in various areas, such as legal processes or marketing, since collecting and analysing each message manually is a process requiring time and dedication from a specialist, while the program performs it automatically. “This system can be useful for the police or agencies that currently work with complaints of bullying or hate speech,” explains María Dolores Molina, researcher at the University of Jaén.
In previous studies, the research group SINAI focused its efforts on the detection of anorexia and bulimia cases on social media, as well as the recognition of the emotions of Twitter users. “We want to improve technology based on artificial intelligence and machine learning by applying techniques focused on sentiment analysis. This will allow it to be applied in a wider variety of areas and to offer support to organisations that could need it”, says L. Alfonso Ureña, researcher at the University of Jaén.
This research work was supported through the University of Jaén SINAI (Sistemas de Acceso Inteligente a la Información, i.e. Intelligent Access to Information Systems) research group’s own funds, the European Regional Development Fund (FEDER), the LIVING-LANG project, the REDES project and a grant for FPI pre-doctoral contracts (under reference PRE2019-089310) awarded by the Spanish Ministry of Science, Innovation and Universities.
Spanish version: Desarrollan un rastreador inteligente que identifica tuits ofensivos contra mujeres y migrantes
References
Plaza-del-Arco, F.M; Molina-González, M.D; Ureña-López, L.A. & Martín- Valdivia, M.T. (2020). “Detecting Misogyny and Xenophobia in Spanish Tweets Using Language Technologies”. ACM Trans. Internet Technol. 20, 2, Article 12, 19 pages.
Más información:
#CienciaDirecta, agencia de noticias de ciencia andaluza, financiada por la Consejería de Transformación Económica, Industria, Conocimiento y Universidades de la Junta de Andalucía.
Teléfono: 954 232 349
Últimas publicaciones
Las excavaciones desarrolladas por la Universidad de Málaga han permitido conocer cómo se organizaba un inmueble de la época, en el que se ha observado la existencia de sectores de taller, dedicados a actividades metalúrgicas, así como otros de almacenamiento o de carácter doméstico. Asimismo, han constatado por primera vez, la existencia de restos romanos alejados del núcleo fenicio.
Sigue leyendoInvestigadores del Hospital Regional Universitario de Málaga y del Hospital Universitario Virgen de la Victoria publican un estudio que permite mejorar la respuesta tumoral a través de radioterapia de precisión y prolongar los beneficios clínicos de la inmunoterapia al evitar o retrasar la progresión del cáncer de pulmón y melanoma metastásico.
Sigue leyendoLa iniciativa DOCU-CLIM, que cuenta entre sus miembros con el grupo investigador de la UPO Vareclim sobre la Variabilidad y Reconstrucción del Clima, reúne en una plataforma única datos sobre el pasado del clima en la Tierra de todo el mundo. La investigación paleoclimática resulta fundamental para entender sus dinámicas actuales, sobre todo en un contexto de crisis climática como el que vivimos.
Sigue leyendoPolítica de cookies
Las cookies necesarias son absolutamente esenciales para que el sitio web funcione correctamente. Esta categoría solo incluye cookies que garantizan funcionalidades básicas y características de seguridad del sitio web. Estas cookies no almacenan ninguna información personal.
Las cookies de rendimiento se utilizan para comprender y analizar los índices de rendimiento clave del sitio web, lo que ayuda a brindar una mejor experiencia de usuario a los visitantes.
Las cookies analíticas se utilizan para comprender cómo los visitantes interactúan con el sitio web. Estas cookies ayudan a proporcionar información sobre métricas, el número de visitantes, la tasa de rebote, la fuente de tráfico, etc.
Las cookies publicitarias se utilizan para proporcionar a los visitantes anuncios y campañas de marketing relevantes. Estas cookies rastrean a los visitantes en los sitios web y recopilan información para proporcionar anuncios personalizados.