论文标题

一项关于普通话中文提示的演讲的试点研究

A Pilot Study on Mandarin Chinese Cued Speech

论文作者

Li, Liu, Gang, Feng

论文摘要

提示语音(CS)是为聋人开发的通信系统,它利用手提示在语音层面上补充语音阅读。目前,据估计,CS已适用于60多种语言。但是,没有用于普通话的官方CS系统。本文提出了一种新颖而有效的普通话中文CS系统,满足了手动编码构成嘴唇运动的主要标准。我们建议将元音[i,u,y]作为半符号元音,然后将其遵循其他普通话决赛,从而减少36个普通话总决赛的数量。我们建立了剩下的16个元音之间的普通话中国和法国元音之间的连贯相似性,这使我们可以利用法国CS系统。此外,通过基于新的语料库研究嘴唇比较分布,可以将16个普通话元音的最佳分配到不同的手部位置。使用高斯分类器来评估不同分配元音组的平均分离性,该元音组分别为92.08 \%,92.33 \%和92.73 \%,分别为三位扬声器。这些辅音主要根据与法国CS系统的相似之处以及对特殊的普通话辅音的一些考虑。在我们的系统中,普通话的音调用头部运动进行编码。

Cued Speech (CS) is a communication system developed for deaf people, which exploits hand cues to complement speechreading at the phonetic level. Currently, it is estimated that CS has been adapted to over 60 languages; however, no official CS system is available for Mandarin Chinese. This article proposes a novel and efficient Mandarin Chinese CS system, satisfying the main criterion that the hand coding constitutes a complement to the lips movements. We propose to code vowels [i, u, y] as semiconsonants when they are followed by other Mandarin finals, which reduces the number of Mandarin finals to be coded from 36 to 16. We establish a coherent similarity between Mandarin Chinese and French vowels for the remaining 16 vowels, which allows us to take advantage of the French CS system. Furthermore, by investigating the lips viseme distribution based on a new corpus, an optimal allocation of the 16 Mandarin vowels to different hand positions is obtained. A Gaussian classifier was used to evaluate the average separability of different allocated vowel groups, which gives 92.08\%, 92.33\%, and 92.73\% for the three speakers, respectively. The consonants are mainly designed according to their similarities with the French CS system, as well as some considerations on the special Mandarin consonants. In our system, the tones of Mandarin are coded with head movements.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源