基于物理的深度学习光纤通信系统

论文标题

基于物理的深度学习光纤通信系统

Physics-Based Deep Learning for Fiber-Optic Communication Systems

论文作者

Häger, Christian, Pfister, Henry D.

论文摘要

我们为光纤通信系统提出了一种新的机器学习方法，其信号传播受非线性Schrödinger方程（NLSE）的控制。我们的主要观察结果是，用于数值求解NLSE的流行拆分方法（SSM）与深层多层神经网络的功能形式基本相同。在这两种情况下，都可以交替进行线性步骤和指数的非线性。我们通过参数化SSM并将线性步骤视为一般线性函数来利用此连接，类似于神经网络中的权重矩阵。与“ Black-Box”功能近似器相比，基于物理学的机器学习模型具有多个优点。例如，它使我们能够检查和解释学习的解决方案，以了解它们的表现良好。作为应用程序，考虑了低复杂性非线性均衡化，该任务是有效地反转NLSE。这通常称为数字返回传播（DBP）。所提出的算法并没有采用神经网络，而是被称为学习的DBP（LDBP），而是使用基于物理的模型在每个步骤中都具有可训练的过滤器，并且在梯度下降过程中逐渐修剪过滤器的水龙头逐渐降低了其复杂性。我们的主要发现是，可以将过滤器修剪成明显的短长度 - 几乎没有3个水龙头/无效的牺牲性能。结果，与先前的工作相比，可以通过数量级来降低复杂性。通过检查过滤器响应，提供了有关学习参数配置的其他理论依据。我们的工作说明，将数据驱动的优化与现有领域知识相结合可以产生对旧通信问题的新见解。

We propose a new machine-learning approach for fiber-optic communication systems whose signal propagation is governed by the nonlinear Schrödinger equation (NLSE). Our main observation is that the popular split-step method (SSM) for numerically solving the NLSE has essentially the same functional form as a deep multi-layer neural network; in both cases, one alternates linear steps and pointwise nonlinearities. We exploit this connection by parameterizing the SSM and viewing the linear steps as general linear functions, similar to the weight matrices in a neural network. The resulting physics-based machine-learning model has several advantages over "black-box" function approximators. For example, it allows us to examine and interpret the learned solutions in order to understand why they perform well. As an application, low-complexity nonlinear equalization is considered, where the task is to efficiently invert the NLSE. This is commonly referred to as digital backpropagation (DBP). Rather than employing neural networks, the proposed algorithm, dubbed learned DBP (LDBP), uses the physics-based model with trainable filters in each step and its complexity is reduced by progressively pruning filter taps during gradient descent. Our main finding is that the filters can be pruned to remarkably short lengths-as few as 3 taps/step-without sacrificing performance. As a result, the complexity can be reduced by orders of magnitude in comparison to prior work. By inspecting the filter responses, an additional theoretical justification for the learned parameter configurations is provided. Our work illustrates that combining data-driven optimization with existing domain knowledge can generate new insights into old communications problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题