Team Curie at HSD-2Lang 2024: Hate Speech Detection in Turkish and Arabic Tweets using BERT-based models

Ehsan Barkhordar*, Işık S. Topçu*, Ali Hürriyetoğlu*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperAcademicpeer-review

1 Citation (Scopus)

Abstract

This study focuses on hate speech detection in Turkish and Arabic tweets using advanced BERT-based models. Performance metrics demonstrate the models' effectiveness, with the Turkish variant achieving a 71.8% F1 score and the Arabic model a 76.9% F1 score, ranking them fourth and third, respectively, in a competitive leaderboard. Performance enhancements were realized through targeted preprocessing, including emoji translation and user mention exclusion, and thoughtful data balancing approaches. Future directions include refining model accuracy and broadening language support. Our reproducible approach and detailed findings are accessible on GitHub.

Original languageEnglish
Title of host publicationCASE 2024 - 7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, Proceedings of the Workshop
EditorsAli Hurriyetoglu, Hristo Tanev, Surendrabikram Thapa, Gokce Uludogan
Place of PublicationSt Julians
PublisherAssociation for Computational Linguistics (ACL)
Pages215-220
Number of pages6
ISBN (Electronic)9798891760707
Publication statusPublished - 2024
Event7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, CASE 2024 - St. Julian's, Malta
Duration: 22 Mar 2024 → …

Conference/symposium

Conference/symposium7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, CASE 2024
Country/TerritoryMalta
CitySt. Julian's
Period22/03/24 → …

Fingerprint

Dive into the research topics of 'Team Curie at HSD-2Lang 2024: Hate Speech Detection in Turkish and Arabic Tweets using BERT-based models'. Together they form a unique fingerprint.
  • EFRA

    Mutlu, O. (PhD candidate), Fensel, A. (Promotor), Hürriyetoğlu, A. (Co-promotor) & van der Velden, B. (Co-promotor)

    1/12/23 → …

    Project: PhD

  • EFRA: Extreme Food Risk Analytics

    1/01/2331/12/25

    Project: EU research project

  • EU23033 - EFRA (BO-64-101-014)

    van der Velden, B. (Project Leader)

    1/01/2331/12/23

    Project: LVVN project

Cite this