基于双重RTF-vector的基于相干的频率子集选择基于多扬声器的到达估算方向

论文标题

基于双重RTF-vector的基于相干的频率子集选择基于多扬声器的到达估算方向

Coherence-Based Frequency Subset Selection For Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers

论文作者

Fejgin, Daniel, Doclo, Simon

论文摘要

最近，已经提出了一种方法来估计单个扬声器的到达方向（DOA），方法是通过最大程度地减少估计的相对传递函数（RTF）向量（RTF）矢量和原型室内rtf矢量数据库之间的频率平均均值。在本文中，我们通过引入频率平均的Hermitian角度光谱并选择该空间频谱的峰来扩展到多演讲者的定位。为了构建hermitian角度频谱，我们仅考虑一部分频率，其中一个说话者可能是主导的。我们将广义幅度平方相干性和两个相干与扩散比（CDR）估计器作为频率选择标准的有效性。使用双耳听力设备在混响环境中估算带有分散的Babble噪声的两个扬声器DOA的仿真结果表明，使用基于双耳有效固定的CDR估计作为频率选择标准，可产生最佳性能。

Recently, a method has been proposed to estimate the direction of arrival (DOA) of a single speaker by minimizing the frequency-averaged Hermitian angle between an estimated relative transfer function (RTF) vector and a database of prototype anechoic RTF vectors. In this paper, we extend this method to multi-speaker localization by introducing the frequency-averaged Hermitian angle spectrum and selecting peaks of this spatial spectrum. To construct the Hermitian angle spectrum, we consider only a subset of frequencies, where it is likely that one speaker is dominant. We compare the effectiveness of the generalized magnitude squared coherence and two coherent-to-diffuse ratio (CDR) estimators as frequency selection criteria. Simulation results for estimating the DOAs of two speakers in a reverberant environment with diffuse-like babble noise using binaural hearing devices show that using the binaural effective-coherence-based CDR estimate as a frequency selection criterion yields the best performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题