论文标题
Dalek- TARDIS的深度学习仿真器
Dalek -- a deep-learning emulator for TARDIS
论文作者
论文摘要
超新星光谱时间序列包含有关这些充满活力事件的祖细胞和爆炸过程的大量信息。这些数据的建模需要使用昂贵的辐射转移代码来探索非常高维后验概率。即使是超新星的适度参数化也包含十多个参数,并且详细的探索要求至少数百万个功能评估。物理现实的模型每次评估至少需要数十分钟的CPU分钟,这将爆炸的详细重建置于传统方法的影响之际。可广泛可用的库来培训神经网络的库结合其能够高精度近似几乎任意功能的能力,可以解决此问题的新方法。与其评估辐射转移模型本身,还可以建立在模拟上训练但更快地评估数量级的神经网络代理。这样的框架称为模拟器或替代模型。在这项工作中,我们提出了一种适用于IA Supernova型光谱的Tardis Supernova辐射转移代码的仿真器。我们证明,鉴于数十万个光谱训练集(在现代超级计算机上易于计算),我们可以训练该问题的模拟器。结果表明,具有多个数量级的加速度的百分比(由TARDIS而不是模拟器的蒙特卡洛本质主导)的精度。该方法具有更广泛的应用程序集,并且不限于提出的问题。
Supernova spectral time series contain a wealth of information about the progenitor and explosion process of these energetic events. The modeling of these data requires the exploration of very high dimensional posterior probabilities with expensive radiative transfer codes. Even modest parametrizations of supernovae contain more than ten parameters and a detailed exploration demands at least several million function evaluations. Physically realistic models require at least tens of CPU minutes per evaluation putting a detailed reconstruction of the explosion out of reach of traditional methodology. The advent of widely available libraries for the training of neural networks combined with their ability to approximate almost arbitrary functions with high precision allows for a new approach to this problem. Instead of evaluating the radiative transfer model itself, one can build a neural network proxy trained on the simulations but evaluating orders of magnitude faster. Such a framework is called an emulator or surrogate model. In this work, we present an emulator for the TARDIS supernova radiative transfer code applied to Type Ia supernova spectra. We show that we can train an emulator for this problem given a modest training set of a hundred thousand spectra (easily calculable on modern supercomputers). The results show an accuracy on the percent level (that are dominated by the Monte Carlo nature of TARDIS and not the emulator) with a speedup of several orders of magnitude. This method has a much broader set of applications and is not limited to the presented problem.