Joint intra and inter-channel nonlinear compensation scheme based on improved learned digital back propagation for WDM systems

Xinyu Chi; Chenglin Bai; Chenglin Bai; Chenglin Bai; Fan Yang; Qi Qi; Ruohui Zhang; Hengying Xu; Hengying Xu; Hengying Xu; Hengying Xu; Lishan Yang; Lishan Yang; Lishan Yang; Wanxiang Bi; Tianchi Chen; Shunchang Bai

doi:10.1364/OE.506995

1. Introduction

In the digital age with the rapid development of the Internet, the explosive growth of network traffic puts forward higher requirements for optical fiber communication networks in terms of high speed and large capacity [1]. However, the capacity and transmission rate of optical fiber communication are constrained by linear and nonlinear impairments, making it difficult to meet the requirements. The linear impairment, including chromatic dispersion (CD) and polarization-mode dispersion (PMD), can be effectively compensated by digital signal processing (DSP) algorithms, while the impact of the nonlinear impairment brought by Kerr nonlinear effect escalates when the signal power and baud-rate increase. Especially, nonlinear effects in WDM systems include not only self-phase modulation (SPM) from the same channel, but also cross-phase modulation (XPM) and four-wave mixing (FWM) from the other channels, where the nonlinear phase shift caused by SPM and XPM will cause critical signal distortions. In addition, the optical fiber communication system is not a pure linear system or nonlinear system, and linear impairments will interfere with nonlinear effects to a certain extent during the signal transmission. Therefore, overcoming nonlinear effects of fiber is not only the key to optimize the system performance, but also a major difficulty [2].

Researchers have proposed several effective nonlinear compensation algorithms to alleviate the distortion caused by nonlinear effects. The digital back propagation (DBP) algorithm and its improved scheme [3–6] achieve alternating compensation of CD and nonlinearity by solving the reverse signal transmission equation based on the split-step Fourier method (SSFM). However, the iteration of DBP requires multiple Fourier transform pairs, and the performance improves as the number of steps increases, which means more high-level performance requires higher computational complexity. Also, this algorithm is a theoretical method to compensate nonlinear distortions, which requires transparent parameters of fiber links, and faces great challenges when directly applied to practice. In addition, optical phase conjugation (OPC) [7] for nonlinear compensation in the optical domain, the nonlinear equalization method based on Volterra series [8] and the nonlinear compensation algorithm based on perturbation theory [9,10] have also been proved to be effective. However, OPC has a high cost and low conversion efficiency in practical applications, resulting in performance limitations. Given that the method based on Volterra series requires Fourier transform modules, the complexity increases with the accumulation of CD. Nonlinear compensation based on perturbation theory requires higher computational complexity to achieve the expected quantization accuracy.

In recent years, with the rapid development of machine learning, the powerful learning ability of neural network has drawn worldwide attention. It can complete operations without consuming too much prior link information of systems. Therefore, artificial neural network (ANN), convolutional neural network (CNN), etc. have been introduced into the field of fiber nonlinearity compensation to further improve system performance [11]. The proposal of the correlation between neighboring symbols of triples has made memory neural networks a research hotspot. Taking long short-term memory (LSTM) networks and their variants [12–16] as examples, they can effectively realize nonlinear impairment compensation of coherent optical communication systems by memorizing the correlation between neighboring symbols. However, most of the nonlinear compensation methods based on neural networks mentioned above are “black box” approaches that only focus on performance improvement, the output results and learning process are difficult to be explained. Therefore, researchers combined theoretical models with neural networks and proposed interpretable learned DBP (LDBP), which addressed the limitations of “black box” neural network on nonlinear compensation. C. Häger et al. used deep neural networks to simulate the linear and nonlinear steps of DBP and conducted simulations in a single-channel single-polarization system [17]. Q. Fan et al. deeply studied DBP based on deep neural networks, which alleviated nonlinear impairments in single-channel and WDM systems by optimizing input parameters and neural network structure [18]. D. Tang et al. combined DBP and nonlinear polarization crosstalk compensation (NPCC) with neural networks based on their physical meanings to address the nonlinear impairments in WDM systems [19]. T. Inoue et al. proposed a LDBP scheme that considers SPM and XPM to compensate for the nonlinear distortion with a reasonable calculation cost [20]. O. Sidelnikov et al. used deep convolutional neural network to alleviate nonlinear distortions in long-haul fiber communication systems [21]. P. He et al. proposed a LDBP algorithm based on layer-reduced neural network, which smoothed the power term in the nonlinear compensation model [22]. However, most of the above methods did not fully consider the complex correlation between linear and nonlinear impairments and ignored the disturbance of signal nonlinearity caused by pulse-broadening effect within the channel and inter-channel walk-off effect in WDM systems.

This article proposes a novel joint compensation scheme for both intra and inter-channel nonlinearity based on improved LDBP. In this scheme, CD is compensated by linear filters in the time-domain convolutional layer. According to the time series and dispersion properties of the signal, the input characteristics are adjusted to achieve the appropriate combination of the overlap-and-save method and pulse-broadening effect. Considering that the influence of CD on nonlinearity varies with factors such as fiber length and cannot be quantified, we improve the nonlinear compensation model for this uncertainty interference. In the case of considering the nonlinear interaction between neighboring symbols, we use the enhanced split-step Fourier method (ESSFM) [23] to improve the SPM compensation model, which effectively improves the accuracy of nonlinear compensation. At the same time, the XPM compensation model is improved by factorizing walk-off effects between neighboring channels, which effectively solves the problem of asynchronous transmission of signal pulses.

The rest of the paper is organized as follows. In Section 2, we describe and analyze the channel model, the principle of time-domain CDC and nonlinear compensation of the proposed scheme. Section 3 is the construction of the simulation system, the analysis and discussion of the corresponding results. Moreover, the computational complexity is analyzed in this section. Section 4 further verifies the effectiveness of this scheme through experiments. Section 5 is a summary of the entire paper.

2. Principle of the improved LDBP

2.1. Channel model of PDM-WDM systems

For WDM systems, when we take the channel k as target channel and focus on the two polarizations of it, taking into account the influence of neighboring channels, the signal propagation can be represented by the following coupled nonlinear Schrödinger equation (NLSE) [24,25]:

(1)$$\frac{{\partial {u_{kx}}}}{{\partial z}} = \underbrace{{\left( { - \frac{\alpha }{2} + \frac{{j{\beta_2}}}{2}\frac{{{\partial^2}}}{{\partial {t^2}}} + \frac{{{\beta_3}}}{6}\frac{{{\partial^3}}}{{\partial {t^3}}}} \right)}}_{{\boldsymbol D}}{u_{kx}}\underbrace{{ - j\left[ {{\gamma_{kk}}({{{|{{u_{kx}}} |}^2} + {{|{{u_{ky}}} |}^2}} )+ \sum\limits_{n \ne k} {{\gamma_{nk}}({2{{|{{u_{nx}}} |}^2} + {{|{{u_{ny}}} |}^2}} )} } \right]}}_{{\boldsymbol N}}{u_{kx}}$$

(2)$$\frac{{\partial {u_{ky}}}}{{\partial z}} = \underbrace{{\left( { - \frac{\alpha }{2} + \frac{{j{\beta_2}}}{2}\frac{{{\partial^2}}}{{\partial {t^2}}} + \frac{{{\beta_3}}}{6}\frac{{{\partial^3}}}{{\partial {t^3}}}} \right)}}_{{\boldsymbol D}}{u_{ky}}\underbrace{{ - j\left[ {{\gamma_{kk}}({{{|{{u_{ky}}} |}^2} + {{|{{u_{kx}}} |}^2}} )+ \sum\limits_{n \ne k} {{\gamma_{nk}}({2{{|{{u_{ny}}} |}^2} + {{|{{u_{nx}}} |}^2}} )} } \right]}}_{{\boldsymbol N}}{u_{ky}}$$

The subscript n in the equation is used to distinguish different channels.${u_{kx}}({z,t} )$, ${u_{ky}}({z,t} )$ represent x and y polarization of the signal transmitted in the channel k. D and N represent the linear and nonlinear parts of the transmission equation, respectively. ${\beta _2}$, ${\beta _3}$ denote the second-order and third-order dispersion parameter. $\alpha $ is the attenuation coefficient.${\gamma _{kk}}$ is the nonlinear coefficient within the channel, and ${\gamma _{nk}}$ is the nonlinear coefficient between different channels. The first term in the nonlinear part N on the right side of the above equation denotes the nonlinear phase noise caused by SPM, and the second term denotes the XPM-induced nonlinear phase noise. Due to the nonlinear phase shift caused by SPM and XPM is the main reason of waveform distortions, we have ignored the nonlinear polarization crosstalk induced by XPM in above formulas [20].

CD and nonlinear phase shift caused by SPM and XPM are solved by alternating linear compensation and nonlinear compensation. Taking any three neighboring channels of the WDM system as an example, the corresponding architecture of the improved LDBP is shown in Fig. 1. The entire neural network structure is presented Fig. 1(a). It should be pointed out that a dispersion compensation layer and a nonlinear compensation layer form a hidden layer in the neural network. The blue squares represent the dual-polarized signals of different channels sent to the neural network. Green neurons denote the data that needs to be conveyed to the nonlinear layer after time-domain CDC. The orange neurons represent the data that needs to be conveyed to the next layer after nonlinear compensation. In nonlinear operations, the purple lines denote the process of compensating XPM effects between the target channel and other channel, the black lines denote compensation for the SPM effect on each channel. After the alternating compensation of linear and nonlinear is completed, the data is conveyed to other operations by the yellow and gray neurons, and the dark red neurons following them represent filters that compensate for nonlinear interactions related to polarizations. The yellow and gray neurons labeled with signals in the output layer are signal sequences of different channels obtained through learning and training.

Fig. 1. The architecture of improved LDBP. (a) The entire neural network structure; (b) Nonlinear compensation section.

Operation	Unified Expression (RMS)
Linear Layer	4(R + 1)N × 2
Nonlinear Layer	(4n + S + 17)N + 2N(log₂N-2) × 2
Polarization related filter	4(T + 1)×N × 2
Total	(N(4n + 8R + S + 25) + 4N(log₂N-2))N_layer+ 8(T + 1)N

Scheme	Unified Expression	RMs (×10⁸)
DBP [19]	8NN_spanN_step(log₂N + 2)	9.4372/4.7186
DCNN	2N(nS + 4R + 2n + 13)N_layer+ 8N(T + 1)	3.2873/3.2873
Improved LDBP	(N(4n + 8R + S + 25) + 4N(log₂N-2))N_layer+ 8(T + 1)N	2.9596/2.9596

Abstract

1. Introduction

2. Principle of the improved LDBP

2.1. Channel model of PDM-WDM systems

2.2. Principle of time-domain chromatic dispersion compensation

2.3. Theoretical model of nonlinear compensation

3. Simulation system and result analysis

3.1. Description of the simulation setup

3.2. Parameter optimization

3.3. Analysis and discussion of simulation results

3.4. Complexity analysis

4. Experimental system and result analysis

4.1. Description of the experimental system

4.2. Analysis of experimental results

5. Conclusion

Funding

Acknowledgments

Disclosures

Data availability

References

Data availability

Cited By

Figures (18)

Tables (2)

Equations (10)

Optics Express