学习增强二进制搜索树

论文标题

学习增强二进制搜索树

Learning Augmented Binary Search Trees

论文作者

Lin, Honghao, Luo, Tian, Woodruff, David P.

论文摘要

TREAP是一种经典的随机二进制搜索树数据结构，易于实现，并支持O（\ log n）预期的时间访问。但是，经典的Treap不会利用输入中的输入分布或模式。鉴于算法的最新进展，我们建议将Treaps与机器建议配对，以形成一个学习的Treap。我们是第一个提出学习的数据结构的人，该数据结构支持二进制搜索树操作，例如范围广告和后继功能。假设我们可以从频率估计的甲骨文中获得建议，我们将学习的优先级分配给节点，以更好地改善TREAP的结构。我们从理论上分析了在各种输入分布下的学习吸引的Treap的性能，并表明在这种情况下，我们的学习增强的Treap比经典的Treaps和其他经典的基于树的数据结构具有更强的保证。此外，我们在实验中评估了我们对合成数据集的TREAP，并证明了比其他搜索树数据结构具有性能优势。我们还介绍了具有已知频率估计序列的现实世界数据集的实验，并显示了改进。

A treap is a classic randomized binary search tree data structure that is easy to implement and supports O(\log n) expected time access. However, classic treaps do not take advantage of the input distribution or patterns in the input. Given recent advances in algorithms with predictions, we propose pairing treaps with machine advice to form a learning-augmented treap. We are the first to propose a learning-augmented data structure that supports binary search tree operations such as range-query and successor functionalities. With the assumption that we have access to advice from a frequency estimation oracle, we assign learned priorities to the nodes to better improve the treap's structure. We theoretically analyze the learning-augmented treap's performance under various input distributions and show that under those circumstances, our learning-augmented treap has stronger guarantees than classic treaps and other classic tree-based data structures. Further, we experimentally evaluate our learned treap on synthetic datasets and demonstrate a performance advantage over other search tree data structures. We also present experiments on real world datasets with known frequency estimation oracles and show improvements as well.

下载PDF全文

下载文献需遵守相关版权规定

论文标题