通过颜色失真预测进行的半监督时尚兼容性预测

论文标题

通过颜色失真预测进行的半监督时尚兼容性预测

Semi-supervised Fashion Compatibility Prediction by Color Distortion Prediction

论文作者

Xiao, Ling, Yamasaki, Toshihiko

论文摘要

监督的学习方法已经遭受了这样一个事实，即必须获得一个大规模标记的数据集，这很难获得。对于时尚兼容性预测，这是一个更重要的问题，因为兼容性旨在捕捉人们对稀疏和变化的美学的看法。因此，由于快速时尚，标记的数据集可能会迅速过时。此外，标记数据集始终需要一些专业知识。至少他们应该有很好的美学感。但是，该领域的自我/半监督学习技术有限。在本文中，我们提出了一项一般的色彩失真预测任务，迫使基线识别低级图像信息，以了解时尚兼容性预测的更多判别性表示。具体而言，我们首先建议通过调整图像平衡，对比度，清晰度和亮度来扭曲图像。然后，我们建议将高斯噪声添加到扭曲的图像中，然后再将其传递到卷积神经网络（CNN）骨架上，以了解所有可能的扭曲的概率分布。提出的借口任务是在时尚兼容性的最新方法中采用的，并显示了其在提高这些方法提取更好特征表示能力方面的有效性。将提出的借口任务应用于基线可以始终优于原始基线。

Supervised learning methods have been suffering from the fact that a large-scale labeled dataset is mandatory, which is difficult to obtain. This has been a more significant issue for fashion compatibility prediction because compatibility aims to capture people's perception of aesthetics, which are sparse and changing. Thus, the labeled dataset may become outdated quickly due to fast fashion. Moreover, labeling the dataset always needs some expert knowledge; at least they should have a good sense of aesthetics. However, there are limited self/semi-supervised learning techniques in this field. In this paper, we propose a general color distortion prediction task forcing the baseline to recognize low-level image information to learn more discriminative representation for fashion compatibility prediction. Specifically, we first propose to distort the image by adjusting the image color balance, contrast, sharpness, and brightness. Then, we propose adding Gaussian noise to the distorted image before passing them to the convolutional neural network (CNN) backbone to learn a probability distribution over all possible distortions. The proposed pretext task is adopted in the state-of-the-art methods in fashion compatibility and shows its effectiveness in improving these methods' ability in extracting better feature representations. Applying the proposed pretext task to the baseline can consistently outperform the original baseline.

下载PDF全文

下载文献需遵守相关版权规定

论文标题