Abstract
Automatic abusive language detection is a difficult but important task for online social media. Our research explores a two-step approach of performing classification on abusive language and then classifying into specific types and compares it with one-step approach of doing one multi-class classification for detecting sexist and racist languages. With a public English Twitter corpus of 20 thousand tweets in the type of sexism and racism, our approach shows a promising performance of 0.827 F-measure by using HybridCNN in one-step and 0.824 F-measure by using logistic regression in two-steps.
| Original language | English |
|---|---|
| Title of host publication | 1st Workshop on Abusive Language Online, ALW 2017 at the 55th Annual Meeting of the Association for Computational Linguistic, ACL 2017 - Proceedings of the Workshop |
| Editors | Zeerak Waseem, Wendy Hui Kyong Chung, Dirk Hovy, Joel Tetreault |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 41-45 |
| Number of pages | 5 |
| ISBN (Electronic) | 9781945626661 |
| Publication status | Published - 2017 |
| Event | 1st Workshop on Abusive Language Online, ALW 2017 at the 55th Annual Meeting of the Association for Computational Linguistic, ACL 2017 - Proceedings of the Workshop - Vancouver, Canada Duration: 4 Aug 2017 → … |
Publication series
| Name | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
|---|---|
| ISSN (Print) | 0736-587X |
Conference
| Conference | 1st Workshop on Abusive Language Online, ALW 2017 at the 55th Annual Meeting of the Association for Computational Linguistic, ACL 2017 - Proceedings of the Workshop |
|---|---|
| Country/Territory | Canada |
| City | Vancouver |
| Period | 4/08/17 → … |
Bibliographical note
Publisher Copyright:© 2017 Association for Computational Linguistics
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 5 Gender Equality
Fingerprint
Dive into the research topics of 'One-step and two-step classification for abusive language detection on twitter'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver