The ever-increasing use of social media in the internet space have induced a number of problems like cyberbullying and cyberaggression over the internet. Researchers have made a commendable progress on the ongoing fight against cyberbullying but a lot of unresolved issues still persist that primarily motivates the purpose of the research. The paper aims to integrate recent advances in the field of word embedding like fastText, ELMo and stacked flair embeddings combined with a host of robust deep learning techniques to further the efficiency of detection over the state-of-art. Two distinct datasets Formspring and Wikipedia were requested and processed for the purpose of the research. A number of different combinations of word embedding with deep learning methods were tested and compared with CNN with ELMo embedding delivering the most promising results with an F1 score of 0.82 on both datasets. On the other hand, CNN with fastText obtained F1 score of 0.82 on Formspring and 0.64 on Wikipedia dataset but was computationally faster than the counterparts. Moreover, transfer learning was performed using the models to test and prove the robustness and efficacy of the models. The system performed considerably well with superior scores in precision, recall and F1 over the state-of-the-art across all the test cases performed.
We have developed the following database of research theses on bullying from all academic institutions in the UK and Ireland. The aim of this database is to assist those who are interested in the field of bullying and want to see what research has already been done. We have attempted to ensure that we have included all relevant theses here; but if there is an omission please let us know by emailing geraldine.kiernan@dcu.ie.
The database is here for information purposes. Those who want access to the texts of the theses need to contact the author, the relevant institution, or both.