Files
Download Full Text (871 KB)
Date of Graduation
5-2025
Description
NLP (natural language processing) models often rely on human-labeled data for training and evaluation. Many approaches crowdsource this data from a large number of annotators with varying skills, backgrounds, and motivations, resulting in conflicting annotations. These conflicts have traditionally been resolved by aggregation methods that assume disagreements are errors. Recent work has argued that for many tasks annotators may have genuine disagreements and that variation should be treated as signal rather than noise. However, limited work has combined the two frameworks to separate signal from noise in human-labeled data. In this work, we introduce NUTMEG, a new Bayesian model that incorporates information about annotator backgrounds to remove noisy annotations from human-labeled training data while preserving systematic disagreements. We then use a synthetic data evaluation framework to show that NUTMEG is more effective at recovering ground-truth from annotations with systematic disagreement than traditional aggregation methods. We provide further analysis characterizing how differences in subpopulation sizes, rates of disagreement, and rates of spam affect the performance of our model. Finally, we demonstrate that downstream models trained on data aggregated by NUTMEG significantly outperform both models trained on traditionally aggregated data and models trained on the full set of disaggregated annotations. Our results highlight the importance of accounting for both annotator competence and systematic disagreements when training on human-labeled data.
Publication Date
2025
Document Type
Book
Degree Name
Bachelor of Science in Data Science
Degree Level
Undergraduate
Department
Mathematical Sciences
Advisor/Mentor
Gauch, Susan
Disciplines
Engineering
Keywords
Engineering; Data Science
Citation
Ivey, J. W., Gauch, S., & Jurgens, D. (2025). Separating Signal From Noise in Annotator Disagreement. 2025 Research Poster Competition. Retrieved from https://scholarworks.uark.edu/hnrcsturpc25/5