校准和完善！ ASR-Error强大意图检测的新颖而敏捷的框架

论文标题

校准和完善！ ASR-Error强大意图检测的新颖而敏捷的框架

Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection

论文作者

Zhou, Peilin, Chong, Dading, Wang, Helin, Zeng, Qingcheng

论文摘要

在过去的十年中，基于文本的意图检测的迅速发展，其基准表演已经被深度学习技术带到了一个显着的水平。但是，由于环境噪声，独特的语音模式等，在现实世界应用中不可避免地会出现自动语音识别（ASR）错误，从而导致基于最新的基于文本的意图检测模型的急剧下降。从本质上讲，这种现象是由ASR错误带来的语义漂移引起的，大多数现有作品倾向于专注于设计新的模型结构以减少其影响，这是以多功能性和灵活性为代价的。与以前的单件模型不同的是，在本文中，我们提出了一个新颖而敏捷的框架，称为CR-ID，使用两个插件播放模块，即语义漂移校准模块（SDCM）和音素改进模块（PRM），而不是易于整体的构造模型。 SNIPS数据集的实验结果表明，我们提出的CR-ID框架实现了竞争性能，并且优于ASR输出上的所有基线方法，这证明CR-ID可以有效地减轻ASR错误引起的语义漂移。

The past ten years have witnessed the rapid development of text-based intent detection, whose benchmark performances have already been taken to a remarkable level by deep learning techniques. However, automatic speech recognition (ASR) errors are inevitable in real-world applications due to the environment noise, unique speech patterns and etc, leading to sharp performance drop in state-of-the-art text-based intent detection models. Essentially, this phenomenon is caused by the semantic drift brought by ASR errors and most existing works tend to focus on designing new model structures to reduce its impact, which is at the expense of versatility and flexibility. Different from previous one-piece model, in this paper, we propose a novel and agile framework called CR-ID for ASR error robust intent detection with two plug-and-play modules, namely semantic drift calibration module (SDCM) and phonemic refinement module (PRM), which are both model-agnostic and thus could be easily integrated to any existing intent detection models without modifying their structures. Experimental results on SNIPS dataset show that, our proposed CR-ID framework achieves competitive performance and outperform all the baseline methods on ASR outputs, which verifies that CR-ID can effectively alleviate the semantic drift caused by ASR errors.

下载PDF全文

下载文献需遵守相关版权规定

论文标题