论文标题
基于动态系统的最佳控制的物理深度学习
Physical deep learning based on optimal control of dynamical systems
论文作者
论文摘要
深度学习是人工智能技术的骨干,它可以被视为一种多层喂养神经网络。深度学习的本质是通过层次传播信息。这表明深度神经网络与动态系统之间存在联系,因为信息传播是通过动态系统的时间进化明确建模的。在这项研究中,我们基于对连续时间动力学系统的最佳控制执行模式识别,该系统适用于物理硬件实现。该学习基于最佳控制动态系统的伴随方法,并且基于系统的时间演变的深(虚拟)网络结构用于处理输入信息。作为关键示例,我们将基于动态的识别方法应用于光电延迟系统,并证明延迟系统的使用允许仅使用少数几个控制信号进行图像识别和非线性分类。这与常规的多层神经网络相反,后者需要大量的权重参数训练。所提出的方法提供了对最佳控制问题框架中深网处理机制的见解,并提出了实现物理计算硬件的途径。
Deep learning is the backbone of artificial intelligence technologies, and it can be regarded as a kind of multilayer feedforward neural network. An essence of deep learning is information propagation through layers. This suggests that there is a connection between deep neural networks and dynamical systems in the sense that information propagation is explicitly modeled by the time-evolution of dynamical systems. In this study, we perform pattern recognition based on the optimal control of continuous-time dynamical systems, which is suitable for physical hardware implementation. The learning is based on the adjoint method to optimally control dynamical systems, and the deep (virtual) network structures based on the time evolution of the systems are used for processing input information. As a key example, we apply the dynamics-based recognition approach to an optoelectronic delay system and demonstrate that the use of the delay system allows for image recognition and nonlinear classifications using only a few control signals. This is in contrast to conventional multilayer neural networks, which require a large number of weight parameters to be trained. The proposed approach provides insight into the mechanisms of deep network processing in the framework of an optimal control problem and presents a pathway for realizing physical computing hardware.