论文标题
通过建模常见的困惑向人群学习
Learning from Crowds by Modeling Common Confusions
论文作者
论文摘要
众包提供了一种实用方法,可以低成本获得大量标记数据。但是,注释者的注释质量差异很大,这在从众包注释中学习高质量模型时构成了新的挑战。在这项工作中,我们提供了一种新的观点,可以将注释噪声分解为常见的噪声和个体噪声,并根据实例难度和注释者的专业知识来区分混乱的来源。我们通过端到端学习解决方案具有两种类型的噪声适应层:一个在注释者之间共享一个新的众包模型,以捕获其常见的共同困惑,而另一个则与每个注释者有关,以实现个人困惑。为了识别每个注释中的噪声源,我们使用辅助网络选择相对于实例和注释者的两个噪声适应层。对合成和实际基准测试的广泛实验证明了我们提出的共同噪声适应解决方案的有效性。
Crowdsourcing provides a practical way to obtain large amounts of labeled data at a low cost. However, the annotation quality of annotators varies considerably, which imposes new challenges in learning a high-quality model from the crowdsourced annotations. In this work, we provide a new perspective to decompose annotation noise into common noise and individual noise and differentiate the source of confusion based on instance difficulty and annotator expertise on a per-instance-annotator basis. We realize this new crowdsourcing model by an end-to-end learning solution with two types of noise adaptation layers: one is shared across annotators to capture their commonly shared confusions, and the other one is pertaining to each annotator to realize individual confusion. To recognize the source of noise in each annotation, we use an auxiliary network to choose the two noise adaptation layers with respect to both instances and annotators. Extensive experiments on both synthesized and real-world benchmarks demonstrate the effectiveness of our proposed common noise adaptation solution.