论文标题
CompoSitEtbasking:通过任务的空间组成理解图像
CompositeTasking: Understanding Images by Spatial Composition of Tasks
论文作者
论文摘要
我们将CompoSiteTasking的概念定义为图像理解的各个方面的多个,空间分布的任务的融合。学习执行空间分布式任务的是由于跨任务的稀疏标签频繁可用,以及对紧凑的多任务网络的渴望。为了促进CompoSiteTasking,我们介绍了一个新型的任务调节模型,这是一个单个编码器网络,一次执行多个空间变化的任务。所提出的网络将图像和一组像素密度任务请求作为输入,并为每个像素执行所请求的预测任务。此外,我们还学习了需要根据某些CompoSiteTBESKing规则执行任务的组成,其中包括决定在何处应用哪个任务的决定。它不仅为我们提供了一个紧凑的网络用于多任务处理,而且还允许进行任务编辑。通过只需要提供每个任务的稀疏监督来证明所提出方法的另一个优势。获得的结果与我们使用密集监督和多头任务设计的基线相当。源代码将在www.github.com/nikola3794/composite-tasking上公开提供。
We define the concept of CompositeTasking as the fusion of multiple, spatially distributed tasks, for various aspects of image understanding. Learning to perform spatially distributed tasks is motivated by the frequent availability of only sparse labels across tasks, and the desire for a compact multi-tasking network. To facilitate CompositeTasking, we introduce a novel task conditioning model -- a single encoder-decoder network that performs multiple, spatially varying tasks at once. The proposed network takes an image and a set of pixel-wise dense task requests as inputs, and performs the requested prediction task for each pixel. Moreover, we also learn the composition of tasks that needs to be performed according to some CompositeTasking rules, which includes the decision of where to apply which task. It not only offers us a compact network for multi-tasking, but also allows for task-editing. Another strength of the proposed method is demonstrated by only having to supply sparse supervision per task. The obtained results are on par with our baselines that use dense supervision and a multi-headed multi-tasking design. The source code will be made publicly available at www.github.com/nikola3794/composite-tasking.