学习丰富的特征，以进行真实的图像恢复和增强

论文标题

学习丰富的特征，以进行真实的图像恢复和增强

Learning Enriched Features for Real Image Restoration and Enhancement

论文作者

Zamir, Syed Waqas, Arora, Aditya, Khan, Salman, Hayat, Munawar, Khan, Fahad Shahbaz, Yang, Ming-Hsuan, Shao, Ling

论文摘要

为了从降级版本中恢复高质量的图像内容，图像修复均享有许多应用，例如监视，计算摄影，医学成像和遥感。最近，卷积神经网络（CNN）对图像恢复任务的常规方法实现了巨大的改进。现有的基于CNN的方法通常在完整分辨率或逐步的低分辨率表示上运行。在前一种情况下，实现了空间精确但上下文较少的稳健结果，而在后一种情况下，会生成语义上可靠但在空间上产生精确的输出。在本文中，我们介绍了一种新颖的体系结构，其集体目标是通过整个网络维持空间高分辨率的高分辨率表示，并从低分辨率表示中接收强大的上下文信息。我们方法的核心是一个多尺度的残差块，其中包含几个关键要素：（a）平行多分辨率卷积流，用于提取多尺度特征，（b）跨多分辨率流的信息交换，（c）用于捕获上下文信息的空间和通道注意机制，以及（d）基于多数尺度的集合，（d）基于多项式信息。简而言之，我们的方法学习了一组丰富的功能，这些功能结合了多个尺度上的上下文信息，同时保留了高分辨率的空间细节。在五个真实图像基准数据集上进行的大量实验表明，我们的方法（称为mirnet）为各种图像处理任务（包括图像deoinging，超级分辨率和图像增强）实现了最新结果。源代码和预培训模型可在https://github.com/swz30/mirnet上找到。

With the goal of recovering high-quality image content from its degraded version, image restoration enjoys numerous applications, such as in surveillance, computational photography, medical imaging, and remote sensing. Recently, convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task. Existing CNN-based methods typically operate either on full-resolution or on progressively low-resolution representations. In the former case, spatially precise but contextually less robust results are achieved, while in the latter case, semantically reliable but spatially less accurate outputs are generated. In this paper, we present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network and receiving strong contextual information from the low-resolution representations. The core of our approach is a multi-scale residual block containing several key elements: (a) parallel multi-resolution convolution streams for extracting multi-scale features, (b) information exchange across the multi-resolution streams, (c) spatial and channel attention mechanisms for capturing contextual information, and (d) attention based multi-scale feature aggregation. In a nutshell, our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details. Extensive experiments on five real image benchmark datasets demonstrate that our method, named as MIRNet, achieves state-of-the-art results for a variety of image processing tasks, including image denoising, super-resolution, and image enhancement. The source code and pre-trained models are available at https://github.com/swz30/MIRNet.

下载PDF全文

下载文献需遵守相关版权规定

论文标题