论文标题
高压船:迈向快速连续的型号
Hypersolvers: Toward Fast Continuous-Depth Models
论文作者
论文摘要
神经odes启用的无限深度范式在寻找新型动力学系统启发的深度学习原始基原始原始范围中引发了文艺复兴。但是,由于计算可伸缩性差,它们在非平凡大小问题中的利用通常是不可能的。这项工作为可扩展的神经ODE铺平了道路,预测时间可与传统离散网络相当。我们介绍了旨在解决低空费用的ODE和精确性的理论保证的odes的高空作物,神经网络。高压溶剂和神经ODE的协同组合允许廉价推理,并解锁了连续深度模型的实际应用新的边界。对标准基准测试的实验评估,例如对连续归一化流的采样,揭示了对经典数值方法一致的帕累托效率。
The infinite-depth paradigm pioneered by Neural ODEs has launched a renaissance in the search for novel dynamical system-inspired deep learning primitives; however, their utilization in problems of non-trivial size has often proved impossible due to poor computational scalability. This work paves the way for scalable Neural ODEs with time-to-prediction comparable to traditional discrete networks. We introduce hypersolvers, neural networks designed to solve ODEs with low overhead and theoretical guarantees on accuracy. The synergistic combination of hypersolvers and Neural ODEs allows for cheap inference and unlocks a new frontier for practical application of continuous-depth models. Experimental evaluations on standard benchmarks, such as sampling for continuous normalizing flows, reveal consistent pareto efficiency over classical numerical methods.