论文标题
Duoquest:一种表达SQL查询的双规格系统
Duoquest: A Dual-Specification System for Expressive SQL Queries
论文作者
论文摘要
查询关系数据库很困难,因为它要求用户同时了解SQL语言并熟悉架构。另一方面,许多用户具有足够的领域熟悉度或专业知识,可以通过替代方式描述其所需的查询。对于此类用户,编写SQL的两个主要替代方法是自然语言接口(NLIS)和示例编程(PBE)。这两种替代方案都面临着某些陷阱:自然语言查询(NLQ)通常是模棱两可的,即使对于人类口译员来说,当前的PBE方法都需要低复杂性查询,用户架构知识,用户的确切示例示例,或者是封闭世界的假设。因此,我们提出了双规格查询综合,该查询综合既消耗nlq and optional PBE状的表格素描查询,从而使用户能够表达不同级别的领域特定知识。我们介绍了新型的双规格二人组系统,该系统利用引导部分查询枚举来有效地探索可能的查询空间。我们提出了用户研究的结果,其中二人组表明,在最先进的NLI中,查询构造准确性的绝对增长率为62.5%,并且在PBE系统支持的更有限的工作负载下,与PBE系统的准确性相当。在对著名蜘蛛基准测试的仿真研究中,Duoquest在NLI和PBE的TOP-1精度上表现出> 2倍。
Querying a relational database is difficult because it requires users to know both the SQL language and be familiar with the schema. On the other hand, many users possess enough domain familiarity or expertise to describe their desired queries by alternative means. For such users, two major alternatives to writing SQL are natural language interfaces (NLIs) and programming-by-example (PBE). Both of these alternatives face certain pitfalls: natural language queries (NLQs) are often ambiguous, even for human interpreters, while current PBE approaches require either low-complexity queries, user schema knowledge, exact example tuples from the user, or a closed-world assumption to be tractable. Consequently, we propose dual-specification query synthesis, which consumes both a NLQ and an optional PBE-like table sketch query that enables users to express varied levels of domain-specific knowledge. We introduce the novel dual-specification Duoquest system, which leverages guided partial query enumeration to efficiently explore the space of possible queries. We present results from user studies in which Duoquest demonstrates a 62.5% absolute increase in query construction accuracy over a state-of-the-art NLI and comparable accuracy to a PBE system on a more limited workload supported by the PBE system. In a simulation study on the prominent Spider benchmark, Duoquest demonstrates a >2x increase in top-1 accuracy over both NLI and PBE.