Neural network architectures for optical channel nonlinear compensation in digital subcarrier multiplexing systems

Ali Bakhshali; Hossein Najafi; Behnam Behinaein Hamgini; Zhuhong Zhang

doi:10.1364/OE.493240

1. Introduction

For high-speed long-haul fiber-optic transmission, the nonlinear interference from Kerr effect is a major bottleneck that limits the achievable transmission rates. This interference can be equalized by approximating and inversing the nonlinear Schrödinger equation through digital back-propagation (DBP) [1–3] or perturbation-based nonlinear compensation (PNLC) [4,5]. These solutions require accurate information from the optical channel and their prohibitive complexity has limited their application in real-time processing with agile and flexible requirements. DBP notably has been widely used as a bench marking algorithm for evaluation the performance of other nonlinear compensation (NLC) solutions due to its algorithmic simplicity and limited hyper-parameters. However, it faces serious challenges for fixed-point implementation in coherent modems due to higher required over-sampling rate, use of multiple (inverse) fast Fourier transform (FFT) modules, affecting the linear equalization path, and the need for higher fixed-point precision to maintain accuracy.

Alternatively, a variety of ANN solutions have been recently proposed for fiber nonlinearity compensation application. The primitive works tried to squeeze additional performance by feeding triplets inspired from perturbation analysis of fiber nonlinearity to a feed-forward neural network [6]. Later works draw inspiration from DBP and aimed to incorporate deep convolutional neural networks (CNNs) for this tasks [7,8]. The use of advance recurrent neural networks (RNNs), such as long short-term memory (LSTM) modules, which are more suitable for the equalization of time-series processes has also picked up a great interest [9,10], with more recent works employing transformer structures for this task [11]. In fact, the pattern and medium dependent characteristics of nonlinear propagation make it a suitable problem to be tackled by variety of toolsets and solutions from ANN domain. An ANN-based nonlinear equalizer is generally more flexible compared to the conventional methods in the sense that it can be better updated for different transmission scenarios without the need for accurate channel parameter feedback. Also, ANN nonlinear equalizers can be extended to include the functionalities of traditional DSP modules to form a more general equalizer. Furthermore, the ANN design where the compensation process is learned through data can potentially lead to a large reduction in computational complexity [12]. The flexibility and universality of machine learning solutions can be improved by using reinforcement learning (RL) solutions specially for adaptive application in optical environment where acquiring enough data for training and retraining is challenging [13].

In this work, we consider an application of ANN in compensation of fiber nonliterary distortions in coherent optical communication systems. We focus on advanced ANN structures with the ability to generate appropriate features without any reliance on an external pre-processing module. We particularly study digital subcarrier multiplexing (DSCM) systems since their design flexibility makes it a promising solution for the coherent optical modems [14,15]. Simplifying the DSP development with lower speed processing per-subcarrier, flexible channel-matched transmission, robust clock recovery, and the easy transition to a point-to-multi-point (P2MP) architecture are some of the advantages of DSCM systems.

Here, we develop macro ANN structures, inspired by the fiber nonlinearity distortion mechanism that governs the nonlinear interaction across different subcarriers and are shown to be more efficient in terms of inference complexity, model representation, and training efficiency. We propose various ANN structures for modeling and compensation of intra- and inter-subcarrier fiber nonlinearities in DSCM systems, and explore scalability and performance versus complexity tradeoffs of the presented solutions. Different models are designed in terms of how received symbols across digital subcarriers are employed for training ANN cores for intra-subcarrier self-phase modulation (iSPM) and inter-subcarrier cross-phase modulation (iXPM) nonlinear impairments. Starting with a fully-connected network across all subcarriers, we move toward upgrading the design with modular ANN cores and sequential training stages. In other words, we start with black-box ANN models and then propose more efficient and flexible modular designs inspired by nonlinear perturbation analysis. All models in here are universal from ANN-core choice perspective. Specifically, we choose the building block for all the proposed structures in this work to be an ANN core with combinations of CNN and LSTM layers. One important aspect in this work is to generalize the neural network designs such that a block of data is generated in the equalization since parallelization is an essential feature of the coherent modems. We explore parallelization of these designs and impact of block-processing on performance-complexity tradeoffs for these models. The results suggest that one can get orders of magnitude reduction in computational complexity by moving towards block equalization in this fashion when RNN-based solutions are deployed.

The remainder of this paper is organized as follows: In Section 2, the base of nonlinear compensation for fiber channel is briefly discussed. In Section 3, the multi-purpose ANN-core structure as the main building block of the proposed models is explained. The details of various ANN structures for NLC in DSCM are presented in Section 4 while Section 5 is devoted to the numerical setup and results. In Section 6, we discuss the impact of dispersion map on the design of the nonlinear equalizers for DSCM systems. Finally, we conclude the paper in Section 7.

2. Nonlinear compensation for optical fiber channel

The dual-polarization evolution of optical field over a fiber link can be explained by the Manakov equation [16] where the linear and nonlinear propagation impacts are described as follows:

(1)$$\frac{\partial{u_{x/y}}}{\partial{z}} + \frac{\alpha}{2}u_{x/y} + j\frac{\beta}{2}\frac{\partial^2u_{x/y}}{\partial{t^2}} = j\frac{8}{9}\gamma\Bigl[|u_x|^2+|u_y|^2\Bigr]u_{x/y},$$

where $u_{x/y} = u_{x/y}(t,z)$ represents the optical field of polarization $x$ and $y$, respectively, $\alpha$ is the attenuation coefficient, $\beta$ is the group velocity dispersion (GVD), and $\gamma$ is the nonlinear coefficient. Nonlinear interference can be equalized by approximating and inversing the above equation through DBP [1–3] where the fiber is modeled as a series of linear and nonlinear sections through first-order approximation of Manakov equation. On the other hand, by employing the perturbation analysis [4], one can represent the optical field as the solution of linear propagation plus a symbol domain perturbation term that encapsulates the accumulated nonlinear distortion on every symbol. It is shown that the first-order perturbation term can be modeled by the weighted sum of triplets of transmitted symbols plus a constant phase rotation [5,17].

Considering the lumped nonlinear compensation methods, a block diagram for the equalization module is presented in Fig. 1 where the pre-processing buffer generates appropriate inputs for a given method. Specifically, it includes a module that calculates appropriate PNLC triplets for regular perturbation-based method or an artificial neural network nonlinear compensation (ANN-NLC) approach that operates on externally generated triplet features [6]. In ANN-NLC solutions that directly operate on Rx-DSP outputs [8,10,18,19], this pre-processing buffer is tasked to provide an extended block of soft symbols needed to efficiently equalize the nonlinear interference.

Fig. 1. Block diagram for lumped perturbation-based nonlinear compensation.

Layer	Learn-able Parameters	value / sweep range
CNN	$n u m_l a y e r s$	1
	$n u m_o u t p u t_c h a n n e l s$	[10:200]
	$k e r n e l_s i z e$	[5:30]
LSTM	$n u m_h i d d e n_s t a t e$	[10:300]
	$n u m_o u t p u t_f e a t u r e s$	[10:300]
MLP	$n u m_h i d d e n_l a y e r s$	[0:2]
	$l a y e r_s i z e$	[10:100]

Abstract

1. Introduction

2. Nonlinear compensation for optical fiber channel

3. Multi-purpose ANN-core structure

4. ANN structures for NLC in DSCM

4.1 Common-core (CC)

4.2 Separate-core (SC) per subcarrier

4.3 Modular-I (M1)

4.4 Modular-II (M2)

5. Numerical results

5.1 System model

5.2 ANN optimization workflow

5.3 Numerical results comparison

6. Impact of dispersion map

7. Conclusion

Disclosures

Data availability

Supplemental document

References

Supplementary Material (1)

Data availability

Cited By

Figures (15)

Tables (1)

Equations (6)

Optics Express