重力波数据分析的深度学习：重新采样白盒方法

论文标题

重力波数据分析的深度学习：重新采样白盒方法

Deep learning for gravitational-wave data analysis: A resampling white-box approach

论文作者

Morales, Manuel D., Antelis, Javier M., Moreno, Claudia, Nesterov, Alexander I.

论文摘要

在这项工作中，我们使用来自LIGO探测器的单个相互处理计数据，应用卷积神经网络（CNN）来检测紧凑型二元煤层的引力波（GW）信号。作为新颖的贡献，我们采用了一种重采样白框方法，以促进GW数据分析中对CNN的不确定性的统计理解。重复采样是通过重复的$ k $折叠交叉验证实验执行的，对于白色框方法，CNN的行为详细描述了CNNS的行为。通过Morlet小波变换，应变时间序列被转换为时频图像，这些图像又在生成输入数据集之前会减少。此外，为了重现更现实的实验条件，我们仅处理非高斯噪声和硬件注射的数据，从而消除了GW模板中设置信噪比（SNR）值（SNR）值的自由。经过高参数调整后，我们发现，通过将平均准确性扰动降低为3.6美元，重新采样平滑的迷你批次随机梯度下降。 CNN非常精确地检测噪声，但敏感不足以回忆GW信号，这意味着CNN比GW触发器的产生更好。但是，应用后分析时，我们发现，对于带有H1数据的SNR $ \ GEQ 21.80 $的GW信号，带有L1数据的SNR $ \ GEQ 26.80 $，CNN可以作为检测GW信号的暂定替代方案。此外，在接收操作特征曲线的情况下，我们发现CNN表现出的性能要比幼稚的贝叶斯和支持矢量机模型的性能要好得多，并且具有$ 5 \％$的显着性水平，我们估计CNN的预测与随机分类器的预测显着不同。最后，我们阐明了CNN的性能是高度依赖性的，这是因为SoftMax层输出的概率分数的分布。

In this work, we apply Convolutional Neural Networks (CNNs) to detect gravitational wave (GW) signals of compact binary coalescences, using single-interferometer data from LIGO detectors. As novel contribution, we adopted a resampling white-box approach to advance towards a statistical understanding of uncertainties intrinsic to CNNs in GW data analysis. Resampling is performed by repeated $k$-fold cross-validation experiments, and for a white-box approach, behavior of CNNs is mathematically described in detail. Through a Morlet wavelet transform, strain time series are converted to time-frequency images, which in turn are reduced before generating input datasets. Moreover, to reproduce more realistic experimental conditions, we worked only with data of non-Gaussian noise and hardware injections, removing freedom to set signal-to-noise ratio (SNR) values in GW templates by hand. After hyperparameter adjustments, we found that resampling smooths stochasticity of mini-batch stochastic gradient descend by reducing mean accuracy perturbations in a factor of $3.6$. CNNs were quite precise to detect noise but not sensitive enough to recall GW signals, meaning that CNNs are better for noise reduction than generation of GW triggers. However, applying a post-analysis, we found that for GW signals of SNR $\geq 21.80$ with H1 data and SNR $\geq 26.80$ with L1 data, CNNs could remain as tentative alternatives for detecting GW signals. Besides, with receiving operating characteristic curves we found that CNNs show much better performances than those of Naive Bayes and Support Vector Machines models and, with a significance level of $5\%$, we estimated that predictions of CNNs are significant different from those of a random classifier. Finally, we elucidated that performance of CNNs is highly class dependent because of the distribution of probabilistic scores outputted by the softmax layer.

下载PDF全文

下载文献需遵守相关版权规定

论文标题