论文标题
EDN:通过极度降采样网络检测显着的对象检测
EDN: Salient Object Detection via Extremely-Downsampled Network
论文作者
论文摘要
显着对象检测(SOD)的最新进展主要受益于多尺度学习,在该学习中,高级和低级功能在定位显着对象和发现细节方面进行了协作。但是,大多数努力通过融合多尺度功能或增强边界表示来致力于低级功能学习。高级功能,尽管长期以来证明对许多其他任务有效,但几乎没有研究SOD。在本文中,我们利用了这一差距,并表明增强高级功能对于草皮也是必不可少的。为此,我们引入了一个极为下降的网络(EDN),该网络采用了极端的倒置技术来有效地了解整个图像的全局视图,从而导致准确的显着对象定位。为了完成更好的多层次功能融合,我们构建了比例尺相关的金字塔卷积(SCPC),以构建一个优雅的解码器,用于从上述极端下采样中恢复对象细节。广泛的实验表明,EDN以实时速度实现最先进的性能。我们有效的EDN-Lite还以316fps的速度实现了竞争性能。因此,这项工作有望在SOD中引发一些新的想法。代码可在https://github.com/yuhuan-wu/edn上找到。
Recent progress on salient object detection (SOD) mainly benefits from multi-scale learning, where the high-level and low-level features collaborate in locating salient objects and discovering fine details, respectively. However, most efforts are devoted to low-level feature learning by fusing multi-scale features or enhancing boundary representations. High-level features, which although have long proven effective for many other tasks, yet have been barely studied for SOD. In this paper, we tap into this gap and show that enhancing high- level features is essential for SOD as well. To this end, we introduce an Extremely-Downsampled Network (EDN), which employs an extreme downsampling technique to effectively learn a global view of the whole image, leading to accurate salient object localization. To accomplish better multi-level feature fusion, we construct the Scale-Correlated Pyramid Convolution (SCPC) to build an elegant decoder for recovering object details from the above extreme downsampling. Extensive experiments demonstrate that EDN achieves state-of-the-art performance with real-time speed. Our efficient EDN-Lite also achieves competitive performance with a speed of 316fps. Hence, this work is expected to spark some new thinking in SOD. Code is available at https://github.com/yuhuan-wu/EDN.