Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Comparative Adjudication of Noisy and Subjective Data Annotation Disagreements for Deep Learning

Abstract Details

2023, Master of Science (MS), Wright State University, Computer Science.
Obtaining accurate inferences from deep neural networks is difficult when models are trained on instances with conflicting labels. Algorithmic recognition of online hate speech illustrates this. No human annotator is perfectly reliable, so multiple annotators evaluate and label online posts in a corpus. Labeling scheme limitations, differences in annotators' beliefs, and limits to annotators' honesty and carefulness cause some labels to disagree. Consequently, decisive and accurate inferences become less likely. Some practical applications such as social research can tolerate some indecisiveness. However, an online platform using an indecisive classifier for automated content moderation could create more problems than it solves. Disagreements can be addressed in training by using the label a majority of annotators assigned (majority vote), training only with unanimously annotated cases (clean filtering), and representing training labels as probabilities (soft labeling). This study shows clean filtering occasionally outperforming majority voting, and soft labeling outperforming both.
Krishnaprasad Thirunarayan, Ph.D. (Advisor)
Shu Schiller, Ph.D. (Committee Member)
Michael Raymer, Ph.D. (Committee Member)
58 p.

Recommended Citations

Citations

  • Williams, S. D. (2023). Comparative Adjudication of Noisy and Subjective Data Annotation Disagreements for Deep Learning [Master's thesis, Wright State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=wright1682700129672253

    APA Style (7th edition)

  • Williams, Scott. Comparative Adjudication of Noisy and Subjective Data Annotation Disagreements for Deep Learning. 2023. Wright State University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=wright1682700129672253.

    MLA Style (8th edition)

  • Williams, Scott. "Comparative Adjudication of Noisy and Subjective Data Annotation Disagreements for Deep Learning." Master's thesis, Wright State University, 2023. http://rave.ohiolink.edu/etdc/view?acc_num=wright1682700129672253

    Chicago Manual of Style (17th edition)