论文标题

潜在指南针:导航创建

Latent Compass: Creation by Navigation

论文作者

Schwettmann, Sarah, Strobelt, Hendrik, Martino, Mauro

论文摘要

在Marius von senden的空间和视线中,一个新近视觉的盲人将角落的体验描述为像柠檬一样,因为像柠檬一样角落“刺”景象,刺痛了舌头。在这里,定态是感官体验的特征空间中的一个维度,这是感知者对两者相互作用的感知者的影响。在新近景象的说法中,一种相互作用所熟悉的效果转化为一种新颖的环境。感知是泛化的工具,因为在不同的经验中分享的效果会产生以这些经验为基础的具体抽象。塞尚(Cezanne)和后印象派主义者(后印象派)精通经验翻译的语言,意识到,最能反映现实的混凝土形式的绘画方法不是绘画他们所看到的,而是要看到的感觉。我们设想了使用AI的创造未来的未来,在该未来的情况下,可以看到的感觉是可复制,可转让,可操作的 - 艺术家调色板的一部分,既基于特定的环境,又可以推广。 积极的研究线将人解剖特征映射到甘恩潜在空间中的方向上。受到监督和自我监督的方法,以寻找预期的方向或使用现成的分类器在嵌入空间中驱动图像操纵的限制是他们可以发现的各种功能的限制。发现有用的新方向的无监督方法表明,感知有意义的方向的空间几乎没有完全映射。由于这个空间很广泛并且充满了创造力,因此我们希望使用指导发现的工具来捕捉人类感知的丰富性和普遍性。我们的方法使创建者在实时工具使用期间将创建者进入发现循环,以识别对他们感知意义上有意义的方向,并沿着这些方向生成可解释的图像翻译。

In Marius von Senden's Space and Sight, a newly sighted blind patient describes the experience of a corner as lemon-like, because corners "prick" sight like lemons prick the tongue. Prickliness, here, is a dimension in the feature space of sensory experience, an effect of the perceived on the perceiver that arises where the two interact. In the account of the newly sighted, an effect familiar from one interaction translates to a novel context. Perception serves as the vehicle for generalization, in that an effect shared across different experiences produces a concrete abstraction grounded in those experiences. Cezanne and the post-impressionists, fluent in the language of experience translation, realized that the way to paint a concrete form that best reflected reality was to paint not what they saw, but what it was like to see. We envision a future of creation using AI where what it is like to see is replicable, transferrable, manipulable - part of the artist's palette that is both grounded in a particular context, and generalizable beyond it. An active line of research maps human-interpretable features onto directions in GAN latent space. Supervised and self-supervised approaches that search for anticipated directions or use off-the-shelf classifiers to drive image manipulation in embedding space are limited in the variety of features they can uncover. Unsupervised approaches that discover useful new directions show that the space of perceptually meaningful directions is nowhere close to being fully mapped. As this space is broad and full of creative potential, we want tools for direction discovery that capture the richness and generalizability of human perception. Our approach puts creators in the discovery loop during real-time tool use, in order to identify directions that are perceptually meaningful to them, and generate interpretable image translations along those directions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源