Annotation Guidelines Refinement

2 minute read

Published:

To concretely define and come up with the approach for classification, it is required to think of the best architecture and techniques so as to beat the state of the art. Hence, I explored a lot of literature regarding the same which concerns with the newest approaches. Hence, I read the following papers :

We decided to move forward with sub-word level approach with BPE and hand-crafted feature analysis.

Annotation

Based on the discussion with Pulkit Parikh sir, we decided to change the guidelines. We added both class along with group and individual. Also, proper definitions were introduced. However, there still remains some confusion about the classification of hate classes. These guidelines are mentioned here :

https://docs.google.com/document/d/1__HEQjTVmcONpc_LY1J0R-zZN1sf7l39oWtioUpWbVg/edit?usp=sharing

We referred to following papers and article to come up with the above stated classes :

  • https://arxiv.org/abs/1807.03688?fbclid=IwAR3aaPCppgrCUXijDCjPooVNqxhuFZHPl28EiM2M6jE3v8oKxF4DnYcZky0
  • https://arxiv.org/abs/1812.01693?fbclid=IwAR2eFSlWnWhwjw5yHlRvZ0jyirFZ44AfufXKuiOQE3hfUFfj318iBgjYt3Q
  • https://www.facebook.com/communitystandards/hate_speech
  • https://help.twitter.com/en/rules-and-policies/hateful-conduct-policy
  • etc :

We will finalize the annotations soon and then will start annotating around 400 posts to ensure everything is in check.