论文标题

确定的盲源分离的一致独立的低级矩阵分析

Consistent Independent Low-Rank Matrix Analysis for Determined Blind Source Separation

论文作者

Kitamura, Daichi, Yatabe, Kohei

论文摘要

在确定的情况下(麦克风的数量大于或等于源信号的麦克风数量),独立的低级基质分析(ILRMA)是盲源分离(BSS)的最新算法。 ILRMA通过通过非负矩阵分解(NMF)对源信号的功率谱图进行建模,从而实现了出色的分离性能。如此高度发达的源模型可以在很大程度上解决频域BS的排列问题,这是ILRMA卓越的原因。在本文中,我们通过考虑频谱图的一般结构(称为一致性)进一步提高了ILRMA的分离性能,因此我们称为提出的方法一致的ILRMA。由于频谱图是通过重叠窗口计算的(并且窗口函数诱导频谱涂抹称为主和侧唇),因此时频垃圾箱相互取决于彼此。换句话说,时频组件通过不确定性原理相互关联。光谱成分之间的这种同时出现可以作为解决排列问题的助手,这是最近一项研究证明的。根据这些事实,我们提出了一种通过稍微修改原始算法来实现一致ILRMA的算法。通过各种窗口长度和移位长度进行的实验,对其性能进行了广泛的评估。结果表明原始和提议的ILRMA的几种趋势包括文献中未完全讨论的一些主题。例如,与混合系统的混音时间相比,当窗口长度足够长时,提出的一致ILRMA倾向于优于原始ILRMA。

Independent low-rank matrix analysis (ILRMA) is the state-of-the-art algorithm for blind source separation (BSS) in the determined situation (the number of microphones is greater than or equal to that of source signals). ILRMA achieves a great separation performance by modeling the power spectrograms of the source signals via the nonnegative matrix factorization (NMF). Such a highly developed source model can solve the permutation problem of the frequency-domain BSS to a large extent, which is the reason for the excellence of ILRMA. In this paper, we further improve the separation performance of ILRMA by additionally considering the general structure of spectrograms, which is called consistency, and hence we call the proposed method Consistent ILRMA. Since a spectrogram is calculated by an overlapping window (and a window function induces spectral smearing called main- and side-lobes), the time-frequency bins depend on each other. In other words, the time-frequency components are related to each other via the uncertainty principle. Such co-occurrence among the spectral components can function as an assistant for solving the permutation problem, which has been demonstrated by a recent study. On the basis of these facts, we propose an algorithm for realizing Consistent ILRMA by slightly modifying the original algorithm. Its performance was extensively evaluated through experiments performed with various window lengths and shift lengths. The results indicated several tendencies of the original and proposed ILRMA that include some topics not fully discussed in the literature. For example, the proposed Consistent ILRMA tends to outperform the original ILRMA when the window length is sufficiently long compared to the reverberation time of the mixing system.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源