论文标题
图流程:双重有效医疗图像分割的跨层图流量蒸馏
Graph Flow: Cross-layer Graph Flow Distillation for Dual Efficient Medical Image Segmentation
论文作者
论文摘要
随着深度卷积神经网络的发展,近年来,医学图像分割取得了一系列突破。但是,高性能卷积神经网络始终意味着许多参数和高计算成本,这将阻碍在临床情况下的应用。同时,大规模注释的医学图像数据集的稀缺性进一步阻碍了高性能网络的应用。为了解决这些问题,我们提出了图形流,即一个全面的知识蒸馏框架,以解决网络效率和注释效率的医学图像分割。具体而言,我们的核心图流动蒸馏将跨层变化的本质从训练有素的繁琐教师网络转移到未经训练的紧凑型学生网络。此外,无监督的解释器模块被整合在一起以净化教师网络的知识,这也对稳定培训程序也有益。此外,我们通过集成对抗性蒸馏和香草逻辑蒸馏来建立一个统一的蒸馏框架,这可以进一步完善紧凑网络的最终预测。借助不同的教师网络(常规的卷积架构或普遍的变压器架构)和学生网络,我们对四个具有不同模态的医学图像数据集进行了广泛的实验(胃癌,Synapse,Busi和CVC-ClinicDB)。我们证明了在这些数据集中实现竞争性能的重要能力。此外,我们证明了图形通过新型半监督范式进行双重有效医学图像分割的有效性。我们的代码将在图流量下可用。
With the development of deep convolutional neural networks, medical image segmentation has achieved a series of breakthroughs in recent years. However, the high-performance convolutional neural networks always mean numerous parameters and high computation costs, which will hinder the applications in clinical scenarios. Meanwhile, the scarceness of large-scale annotated medical image datasets further impedes the application of high-performance networks. To tackle these problems, we propose Graph Flow, a comprehensive knowledge distillation framework, for both network-efficiency and annotation-efficiency medical image segmentation. Specifically, our core Graph Flow Distillation transfer the essence of cross-layer variations from a well-trained cumbersome teacher network to a non-trained compact student network. In addition, an unsupervised Paraphraser Module is integrated to purify the knowledge of the teacher network, which is also beneficial for the stabilization of training procedure. Furthermore, we build a unified distillation framework by integrating the adversarial distillation and the vanilla logits distillation, which can further refine the final predictions of the compact network. With different teacher networks (conventional convolutional architecture or prevalent transformer architecture) and student networks, we conduct extensive experiments on four medical image datasets with different modalities (Gastric Cancer, Synapse, BUSI, and CVC-ClinicDB).We demonstrate the prominent ability of our method which achieves competitive performance on these datasets. Moreover, we demonstrate the effectiveness of our Graph Flow through a novel semi-supervised paradigm for dual efficient medical image segmentation. Our code will be available at Graph Flow.