论文标题
通过标签平滑为手术场景理解的标签平滑学习的节奏训练课程学习
Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding
论文作者
论文摘要
课程学习和自定进度学习是培训策略,逐渐将样品从易于到更复杂的情况下。由于他们在机器人视觉中的出色表现,他们引起了人们的关注。最近的作品着重于根据输入样品的难度水平设计课程或平滑特征图。但是,仍未探索平滑标签以以课程方式控制学习实用程序。在这项工作中,我们使用标签平滑(P-CBL)设计了一个节奏的课程(P-CBL),使用带有均匀标签平滑(ULS)进行分类任务的节奏学习,并以课程方式以课程方式进行融合均匀和空间变化的标签平滑(SVL)。在ULS和SVL中,更大的平滑因子值在真实的标签中执行了沉重的平滑惩罚,并限制了学习信息更少的信息。因此,我们通过标签平滑(CBL)设计课程。我们在训练开始时设定了更大的平滑值,并将其逐渐减少到零,以控制模型学习实用程序从下部到更高。我们还设计了一种自信的起搏功能,并将其与CBL相结合,以研究各种课程的好处。该技术在多个多类,多标签分类,字幕和分割任务的四个机器人手术数据集上进行了验证。我们还通过将验证数据损坏为不同的严重程度来研究我们方法的鲁棒性。我们的广泛分析表明,提出的方法提高了预测准确性和鲁棒性。
Curriculum learning and self-paced learning are the training strategies that gradually feed the samples from easy to more complex. They have captivated increasing attention due to their excellent performance in robotic vision. Most recent works focus on designing curricula based on difficulty levels in input samples or smoothing the feature maps. However, smoothing labels to control the learning utility in a curriculum manner is still unexplored. In this work, we design a paced curriculum by label smoothing (P-CBLS) using paced learning with uniform label smoothing (ULS) for classification tasks and fuse uniform and spatially varying label smoothing (SVLS) for semantic segmentation tasks in a curriculum manner. In ULS and SVLS, a bigger smoothing factor value enforces a heavy smoothing penalty in the true label and limits learning less information. Therefore, we design the curriculum by label smoothing (CBLS). We set a bigger smoothing value at the beginning of training and gradually decreased it to zero to control the model learning utility from lower to higher. We also designed a confidence-aware pacing function and combined it with our CBLS to investigate the benefits of various curricula. The proposed techniques are validated on four robotic surgery datasets of multi-class, multi-label classification, captioning, and segmentation tasks. We also investigate the robustness of our method by corrupting validation data into different severity levels. Our extensive analysis shows that the proposed method improves prediction accuracy and robustness.