使用双向LSTMS的心脏声音分割

论文标题

使用双向LSTMS的心脏声音分割

Heart Sound Segmentation using Bidirectional LSTMs with Attention

论文作者

Fernando, Tharindu, Ghaemmaghami, Houman, Denman, Simon, Sridharan, Sridha, Hussain, Nayyar, Fookes, Clinton

论文摘要

This paper proposes a novel framework for the segmentation of phonocardiogram (PCG) signals into heart states, exploiting the temporal evolution of the PCG as well as considering the salient information that it provides for the detection of the heart state.我们建议使用复发性神经网络，并利用基于注意力的学习中的最新进步来分割PCG信号。这使网络可以识别信号的最显着方面并无视非信息信息。提出的方法在包括人类和动物心脏记录在内的多个基准上达到了最先进的表现。 Furthermore, we empirically analyse different feature combinations including envelop features, wavelet and Mel Frequency Cepstral Coefficients (MFCC), and provide quantitative measurements that explore the importance of different features in the proposed approach.我们证明，复发性神经网络以及注意机制可以有效地从不规则和嘈杂的PCG记录中学习。 Our analysis of different feature combinations shows that MFCC features and their derivatives offer the best performance compared to classical wavelet and envelop features.心脏声音细分是许多诊断应用的关键预处理步骤。 The proposed method provides a cost effective alternative to labour extensive manual segmentation, and provides a more accurate segmentation than existing methods.因此，它可以提高进一步分析的性能，包括检测杂音和射血点击。该提出的方法还适用于检测和分割其他一个维度生物医学信号。

This paper proposes a novel framework for the segmentation of phonocardiogram (PCG) signals into heart states, exploiting the temporal evolution of the PCG as well as considering the salient information that it provides for the detection of the heart state. We propose the use of recurrent neural networks and exploit recent advancements in attention based learning to segment the PCG signal. This allows the network to identify the most salient aspects of the signal and disregard uninformative information. The proposed method attains state-of-the-art performance on multiple benchmarks including both human and animal heart recordings. Furthermore, we empirically analyse different feature combinations including envelop features, wavelet and Mel Frequency Cepstral Coefficients (MFCC), and provide quantitative measurements that explore the importance of different features in the proposed approach. We demonstrate that a recurrent neural network coupled with attention mechanisms can effectively learn from irregular and noisy PCG recordings. Our analysis of different feature combinations shows that MFCC features and their derivatives offer the best performance compared to classical wavelet and envelop features. Heart sound segmentation is a crucial pre-processing step for many diagnostic applications. The proposed method provides a cost effective alternative to labour extensive manual segmentation, and provides a more accurate segmentation than existing methods. As such, it can improve the performance of further analysis including the detection of murmurs and ejection clicks. The proposed method is also applicable for detection and segmentation of other one dimensional biomedical signals.

下载PDF全文

下载文献需遵守相关版权规定

论文标题