论文标题

具有众包功能的多任务合奏可改善皮肤病变诊断

Multi-task Ensembles with Crowdsourced Features Improve Skin Lesion Diagnosis

论文作者

Raumanns, Ralf, Contar, Elif K, Schouten, Gerard, Cheplygina, Veronika

论文摘要

机器学习对大量注释数据有认可的需求。由于专家注释的高成本,众包被要求作为替代品标记或大纲图像标签或大纲图像。尽管报道了许多有希望的结果,但是诊断性众包标签的质量尚不清楚。我们建议通过向人群询问图像的视觉特征来解决这一问题,这些功能可以更直观地提供,并通过合奏策略在多任务学习框架中使用这些功能。我们将提出的方法与基线模型与ISIC 2017挑战数据集的2000个皮肤病变进行了比较。基线模型仅预测皮肤病变图像的二进制标签,而我们的多任务模型还预测了以下特征之一:病变,边界不规则性和颜色的不对称性。我们表明,具有单个众包特征的多任务模型对模型的影响有限,但是当合并合并时,可以改善概括。基线模型的接收器操作特性曲线下的面积为0.794,多任务合奏分别为0.811和0.808。最后,我们讨论发现,确定一些局限性,并建议进一步研究的方向。该模型的代码可从https://github.com/raumannsr/hints_crowd获得。

Machine learning has a recognised need for large amounts of annotated data. Due to the high cost of expert annotations, crowdsourcing, where non-experts are asked to label or outline images, has been proposed as an alternative. Although many promising results are reported, the quality of diagnostic crowdsourced labels is still unclear. We propose to address this by instead asking the crowd about visual features of the images, which can be provided more intuitively, and by using these features in a multi-task learning framework through ensemble strategies. We compare our proposed approach to a baseline model with a set of 2000 skin lesions from the ISIC 2017 challenge dataset. The baseline model only predicts a binary label from the skin lesion image, while our multi-task model also predicts one of the following features: asymmetry of the lesion, border irregularity and color. We show that multi-task models with individual crowdsourced features have limited effect on the model, but when combined in an ensembles, leads to improved generalisation. The area under the receiver operating characteristic curve is 0.794 for the baseline model and 0.811 and 0.808 for multi-task ensembles respectively. Finally, we discuss the findings, identify some limitations and recommend directions for further research. The code of the models is available at https://github.com/raumannsr/hints_crowd.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源