论文标题

TOPOMAP:0维的同源性保留高维数据的投影

TopoMap: A 0-dimensional Homology Preserving Projection of High-Dimensional Data

论文作者

Doraiswamy, Harish, Tierny, Julien, Silva, Paulo J. S., Nonato, Luis Gustavo, Silva, Claudio

论文摘要

多维投影是高维数据分析和可视化的基本工具。除少数例外外,投影技术旨在将数据从高维空间映射到视觉空间,以保留一些差异(相似性)度量,例如欧几里得距离。实际上,尽管采用了旨在偏爱数据不同方面的不同数学公式,但大多数多维投影方法努力保留构成几何特性(例如距离或数据对象之间的邻近关系)的差异措施。但是,几何关系并不是在投影中保存的唯一有趣的属性。例如,如果映射过程可以保证拓扑不变性,例如连接的组件和循环,则可以更可靠地执行对特定结构(例如簇和离群值)的分析。本文介绍了Topomap,这是一种新型投影技术,可在映射过程中提供拓扑保证。特别是,所提出的方法执行了从高维空间到视觉空间的映射,同时保留了高维数据的RIPS过滤的0维持续图,以确保过滤在应用于原始数据以及投影数据时生成相同的连接组件。提出的案例研究表明,Topomap提供的拓扑保证不仅为视觉分析过程带来了信心,而且可以用于协助评估其他投影方法。

Multidimensional Projection is a fundamental tool for high-dimensional data analytics and visualization. With very few exceptions, projection techniques are designed to map data from a high-dimensional space to a visual space so as to preserve some dissimilarity (similarity) measure, such as the Euclidean distance for example. In fact, although adopting distinct mathematical formulations designed to favor different aspects of the data, most multidimensional projection methods strive to preserve dissimilarity measures that encapsulate geometric properties such as distances or the proximity relation between data objects. However, geometric relations are not the only interesting property to be preserved in a projection. For instance, the analysis of particular structures such as clusters and outliers could be more reliably performed if the mapping process gives some guarantee as to topological invariants such as connected components and loops. This paper introduces TopoMap, a novel projection technique which provides topological guarantees during the mapping process. In particular, the proposed method performs the mapping from a high-dimensional space to a visual space, while preserving the 0-dimensional persistence diagram of the Rips filtration of the high-dimensional data, ensuring that the filtrations generate the same connected components when applied to the original as well as projected data. The presented case studies show that the topological guarantee provided by TopoMap not only brings confidence to the visual analytic process but also can be used to assist in the assessment of other projection methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源