改善概率盒嵌入中的本地可识别性

论文标题

改善概率盒嵌入中的本地可识别性

Improving Local Identifiability in Probabilistic Box Embeddings

论文作者

Dasgupta, Shib Sankar, Boratko, Michael, Zhang, Dongxu, Vilnis, Luke, Li, Xiang Lorraine, McCallum, Andrew

论文摘要

几何嵌入方式最近引起了人们对通过遏制代表横向不对称关系的自然能力的关注。盒子嵌入物体由n维超矩形表示，是一个特别有希望的嵌入示例，因为它们在交点下关闭并可以轻松计算其体积，从而可以自然地表示校准的概率分布。几何嵌入的好处也引入了局部识别性问题，但是，整个参数邻里会导致等效损失，从而阻碍学习。先前的工作通过使用框参数的高斯卷积来解决其中一些问题，但是，此相交操作也增加了梯度的稀疏性。在这项工作中，我们对框参数进行建模，其中包括最小和最大牙龈分布，这些参数被选中，以使得在交叉路口的操作下仍关闭空间。预期相交量的计算涉及所有参数，我们通过实验证明，这大大提高了此类模型学习的能力。

Geometric embeddings have recently received attention for their natural ability to represent transitive asymmetric relations via containment. Box embeddings, where objects are represented by n-dimensional hyperrectangles, are a particularly promising example of such an embedding as they are closed under intersection and their volume can be calculated easily, allowing them to naturally represent calibrated probability distributions. The benefits of geometric embeddings also introduce a problem of local identifiability, however, where whole neighborhoods of parameters result in equivalent loss which impedes learning. Prior work addressed some of these issues by using an approximation to Gaussian convolution over the box parameters, however, this intersection operation also increases the sparsity of the gradient. In this work, we model the box parameters with min and max Gumbel distributions, which were chosen such that space is still closed under the operation of the intersection. The calculation of the expected intersection volume involves all parameters, and we demonstrate experimentally that this drastically improves the ability of such models to learn.

下载PDF全文

下载文献需遵守相关版权规定

论文标题