Autoencoder assisted subcarrier optimization for nonlinear frequency division multiplexing

Peiyun Ge; Lixia Xi; Jiayun Deng; Jiacheng Wei; Shucheng Du; Xiaoguang Zhang

doi:10.1364/OE.500473

1. Introduction

According to Shannon’s theorems, in order to satisfy the increasing requirement of communication capacity, an available solution is to raise optical signal to noise ratio (OSNR) [1,2]. However, increasing the launch power to improve the OSNR will introduce nonlinear distortions due to the Kerr effect. Many techniques have been developed to overcome the nonlinear capacity limitation, such as digital backpropagation (DBP) [3,4], perturbation theory [5], optical phase conjugation [6], etc. However, these techniques treat nonlinearity as an undesired factor and mostly rely on approximate models derived from nonlinear Schrödinger equation (NLSE) [7,8], which means that they cannot completely eliminate nonlinear impairments. In recent years, an innovative technique called nonlinear frequency division multiplexing (NFDM) has attracted widespread attention [9]. The core of NFDM is the powerful mathematical tool named nonlinear Fourier transformation (NFT), which can linearize the nonlinear distance evolution of signal in the nonlinear spectrum defined by NFT [10]. Thus, it has the potential to realize nonlinear-free optical transmission.

As stated by NFT theory, the nonlinear spectrum of a given time domain signal consists of continuous and discrete part. According to the degree of freedom (DOF) used to encode the signal, NFDM can be divided into discrete spectral (DS) [11–15], continuous spectral (CS) [16–20] and full spectral (FS) [21–25] modulation, and the researches evolve from single polarization to dual polarization [26–32]. In this paper, we focus on dual-polarized CS-NFDM (DP-CS-NFDM), which has wider DOF than DS and less algorithmic complexity than FS [23,30,32]. CS modulation is implemented by nonlinear inverse synthesis (NIS) scheme, which encodes signals on the nonlinear spectrum and generates the corresponding time domain waveforms by inverse NFT (INFT). This scheme is analogous to orthogonal frequency division multiplexing (OFDM), and sinc function is used as subcarriers [16]. Except for sinc, other subcarriers such as root-raised-cosine (RRC) [33], Hermite-Gaussian (HG) [34,35] and faster than Nyquist (FTN) [36] subcarriers are reported, and show their advantages in different system conditions. Apparently, they are still an extension of linear systems, but may not be optimal options for nonlinear spectrum.

In recent years, applications of machine learning (ML) in optical communications have been widely discussed [37], as well as NFDM [38]. Various types of neural networks (NN), including artificial NN (ANN) [39–41], convolutional NN (CNN) [42–44] and recurrent NN (RNN) [45], have been used to equalize inter-carrier interference (ICI) and shorten guard interval. Nowadays, a learning model, called auto-encoder (AE), is innovatively applied in communication systems. An AE consists of an encoder and a decoder, and aims to recover input data, which resembles a communication system [46]. In [47], AE is firstly utilized in DS-NFDM to optimize the imaginary part of eigenvalues, radii of the constellations and phase difference between two solitons. This inspires the idea of optimizing subcarriers in CS-NFDM by AE.

In this paper, we present an AE assisted subcarrier optimization scheme for DP-CS-NFDM. We introduce an inter-polarized matrix at the transmitter side and jointly optimize the encoding and decoding matrices to make the received signal converge to the transmitted signal. This leads to a set of optimal sinc-like subcarriers naturally. We numerically verify the superiority of our scheme over standard sinc with linear minimum mean square error (LMMSE) equalization. We also analyze the features of the optimal subcarrier that account for the additional performance gain. Furthermore, we validate the generalizations of the optimized subcarriers to different modulation formats, transmission distances and bandwidths. Our proposed optimization scheme exhibits high robustness to variations in system parameters and offers a new perspective for designing subcarriers in NFDM.

The rest of this paper is organized as follows. Section 2 gives a brief introduction to the AE model. Section 3 first reviews the basis of NFT theory, and then explains the fundamental principles of NFDM and the AE assisted optimization scheme. Section 4 presents numerical verifications and further discussions. Section 5 concludes the work.

2. Brief review of AE

As is well known, a general communication system consists of five parts: information source, transmitter, channel, receiver and destination. In a coherent end-to-end optical communication system, the transmitter (Tx), the receiver (Rx) and their digital signal processing (DSP) work together to recover the signal at the receiver side with low error rate. Specially for NFDM, TxDSP encodes the signal on the nonlinear spectrum and generates a fiber-friendly time domain waveform by INFT. The fiber channel acts as an impairments source that degrades the transmission performance. In RxDSP, NFT decodes the nonlinear spectrum from the received waveform and then compensates for the channel response to recover the transmitted signal. The above process matches well with an AE.

AE is a special type of unsupervised learning model that uses the input data itself as supervision, forcing the output equal to the input [48]. In this way, the model is guided to establish a mapping relationship, and a reconstructed output of input is obtained. In [46], AE is first applied to the design of radio communication systems, also known as end-to-end optimization. Figure 1 illustrates a typical AE model. The encoder ${f(} \cdot {)}$ and decoder ${g(} \cdot {)}$ are trainable functions, while the channel response ${h(} \cdot {)}$ is the key component in optical communications, which is an untrainable function in the AE model. Therefore, an accurate channel model is the prerequisite of end-to-end optimization. For the DP-NFDM we studied in this paper, the numerical simulation channel model is the split step Fourier method (SSFM) base on the Manakov equation. The goal of AE is to jointly optimize ${f(} \cdot {)}$ and ${g(} \cdot {)}$ so that the output ${\mathbf{X}_{\mathbf{r}}}$ is close to the input $\mathbf{X}_{\mathbf{t}}$, and thus obtain the optimal channel-adapted encoder and decoder.

Fig. 1. A typical AE model for a communication system.

Download Full Size | PDF

Now, back to our concerned CS-NFDM, it encodes signals on the continuous spectrum by NIS. This process refers to OFDM, and the subcarriers used in linear systems are directly applied, which may not be optimal. To find an optimal set of subcarriers, we treat the NIS scheme as the encoder and decoder in an AE model. By making the received signals equal to the transmitted ones, a pair of optimal encoding and decoding subcarriers can be naturally obtained. In the next section, we will give a detailed introduction to the AE assisted subcarrier optimization scheme.

3. Basis of NFT theory and AE assisted optimized NFDM

3.1 Review of NFT theory

A DP signal transmitting along standard single mode fiber (SSMF) links follows the Manakov equation [49]

(1)$$\frac{\partial }{{\partial l}}{\mathbf Q}({\tau ,l} )={-} \frac{\alpha }{2}{\mathbf Q}({\tau ,l} )- j\frac{{{\beta _2}}}{2}\frac{{{\partial ^2}}}{{\partial {\tau ^2}}}{\mathbf Q}({\tau ,l} )+ j\frac{8}{9}\gamma {||{{\mathbf Q}({\tau ,l} )} ||^2}{\mathbf Q}({\tau ,l} )+ {N}({\tau ,l} ).$$

Here, ${\mathbf{Q}(\tau ,l)\ =\ [}{{Q}_{1}}{(\tau ,l)\;\ }{{Q}_{2}}{(\tau ,l)}{{]}^{T}}$ is the complex envelope of optical field, ${\mathbf{N}(\tau ,l)} =$ $[{{N}_{1}}{(\tau ,l)}\ {{N}_{2}}{(\tau ,l)}{{]}^{T}}$ is the noise vector, the subscripts denotes the two polarization components, ${\tau }$ and ${l}$ are retarded time and distance along fiber. ${\alpha }$, ${{\beta }_{2}}$ and ${\gamma }$ are attenuation, group velocity dispersion (GVD), and nonlinear coefficient respectively. The attenuation in Eq. (1) will destroy the integrability of Manakov equation, so the lossless path-averaged (LPA) model is used for approximation [50]. Meanwhile, normalized parameters are introduced as below

(2)$$\tilde{\gamma } = \gamma \frac{{1 - {e^{ - \alpha l}}}}{{\alpha l}},t = \frac{\tau }{{{T_n}}},{Z_n} = \frac{{2T_n^2}}{{|{{\beta_2}} |}},z = \frac{l}{{{Z_n}}},{P_n} = \frac{2}{{\frac{8}{9}\tilde{\gamma }{Z_n}}},{\mathbf q} = \frac{{{\mathbf Q}{e^{ - {{\alpha l} / 2}}}}}{{\sqrt {{P_n}} }}.$$

Omitting the noise component, the normalized form of Manakov equation is obtained

(3)$$\frac{{\partial {\mathbf q}({t,z} )}}{{\partial z}} = j\frac{{{\partial ^2}{\mathbf q}({t,z} )}}{{\partial {t^2}}} + 2j{||{{\mathbf q}({t,z} )} ||^2}{\mathbf q}({t,z} ).$$

Equation (3) is an integrable partial differential equation, its NFT is defined by solving the Zakharov–Shabat problem [51,52]

(4)$${{\mathbf v}_t} = \left[ {\begin{array}{ccc} { - j\lambda }&{{q_1}}&{{q_2}}\\ { - q_1^\ast }&{j\lambda }&0\\ { - q_2^\ast }&0&{j\lambda } \end{array}} \right]{\mathbf v}.$$

On the condition that $\mathbf{q}$ has vanishing boundary, Eq. (4) exists a pair of canonical eigenvector solutions as below

(5)$$\phi ({\lambda ,t} )= {e^{ - j\lambda t}}\left[ {\begin{array}{{c}} 1\\ 0\\ 0 \end{array}} \right],\bar{\phi }({\lambda ,t} )= {e^{j\lambda t}}\left[ {\begin{array}{{cc}} {\begin{array}{{c}} 0\\ 1\\ 0 \end{array}}&{\begin{array}{{c}} 0\\ 0\\ 1 \end{array}} \end{array}} \right],t \to - \infty$$

(6)$$\psi ({\lambda ,t} )= {e^{j\lambda t}}\left[ {\begin{array}{{cc}} {\begin{array}{{c}} 0\\ 1\\ 0 \end{array}}&{\begin{array}{{c}} 0\\ 0\\ 1 \end{array}} \end{array}} \right],\bar{\psi }({\lambda ,t} )= {e^{ - j\lambda t}}\left[ {\begin{array}{{c}} 1\\ 0\\ 0 \end{array}} \right],t \to + \infty.$$

Here, $\boldsymbol{\mathrm{\phi}} {(\lambda ,t)\ =\ [}\phi {(\lambda ,t)\;\ }\bar{\phi }{(\lambda ,t)}{]}^T$ and $\boldsymbol{\mathrm{\psi}} (\lambda ,t)\ =\ [\psi (\lambda ,t)\;\ \bar{\psi }(\lambda ,t){]}^{T}$ are 2 solution basis of the solution space. According to the inverse scattering theory, the scattering coefficients ${a(\lambda )}$ and ${\mathbf{b}(\lambda )\ =\ [b_1(\lambda )\;b_2(\lambda )}{]}^{T}$ are defined by

(7)$$\left[ {\begin{array}{ccc} {a(\lambda )}\\ {{b_1}(\lambda )}\\ {{b_2}(\lambda )} \end{array}} \right] = \mathop {\lim }\limits_{t \to + \infty } \left[ {\begin{array}{{ccc}} {{e^{j\lambda t}}}&{}&{}\\ {}&{{e^{ - j\lambda t}}}&{}\\ {}&{}&{{e^{ - j\lambda t}}} \end{array}} \right]\phi ({\lambda ,t} ).$$

In our studied CS modulation, ${\lambda } \in \mathbb{R}$, and they satisfy

(8)$${|{a(\lambda )} |^2} + {|{{b_1}(\lambda )} |^2} + {|{{b_2}(\lambda )} |^2} = 1.$$

In numerical computations, ${\lambda =\ }{{T}_{n}}{\pi f}$, where ${f}$ is the linear frequency. The process of solving the scattering coefficients is called NFT. Accordingly, given the scattering coefficients, the corresponding time domain waveform is determined, which known as INFT. In our numerical verifications, Ablowitz-ladik (AL) algorithm is used for (I)NFT operations [26]. As stated in NFT theory, the ${z}$-evolution of the scattering coefficients are written by

(9)$$\begin{aligned} a({\lambda ,z} )&= a({\lambda ,0} )\\ {\mathbf b}({\lambda ,z} )&= {\mathbf b}({\lambda ,0} ){e^{4j{\lambda ^2}z}} \end{aligned}.$$

As can be inferred from Eq. (9), the ${z}$-evolution of ${\mathbf{b}(\lambda )}$ manifests as a phase rotation, which is the theoretical basis of nonlinear free transmission in NFDM.

3.2 AE assisted optimized NFDM system

Figure 2(a) illustrates the setup of DP-NFDM, which consists of three stages. First, in TxDSP stage, signals are encoded on the ${\mathbf{b}(\lambda )}$, and the INFT generates the corresponding electrical time domain waveform. Then, the waveform is modulated on the optical carriers by a DP in-phase and quadrature modulator (DP-IQM), and transmitted through ${{N}_{s}}$ spans of lumped amplification (LA). The received optical signals are converted to electrical waveforms by the coherent receiver. Finally, in the RxDSP stage, the NFT recovers the ${\mathbf{b}(\lambda )}$ and then compensates for the channel response. It is worth mentioning that the channel of AE model should contain the DP-IQM, fiber link and coherent receiver in an actual optical communication system as shown in Fig. 2(a). We will explain the TxDSP, RxDSP and AE assisted optimization in detail below.

Fig. 2. (a) Setup of DP-NFDM; (b) Illustration of AE assisted end-to-end optimization scheme.

Download Full Size | PDF

The flowchart of TxDSP and RxDSP can be extracted in Fig. 2(b). To begin with, after serial-to-parallel conversion and symbol mapping, the DP signals can be expressed in vector form as follows

(10)$$\begin{array}{c} {\mathbf S}_{in}^X = \left[ {\begin{array}{{cccc}} {S_{in,0}^X}&{S_{in,1}^X}& \cdots &{S_{in,N - 1}^X} \end{array}} \right]\\ {\mathbf S}_{in}^Y = \left[ {\begin{array}{{cccc}} {S_{in,0}^Y}&{S_{in,1}^Y}& \cdots &{S_{in,N - 1}^Y} \end{array}} \right] \end{array}$$

where ${S}_{{in,\; n}}^{X}$ is an ${{N}_{c}}{\ \times 1}$ vector, corresponding to ${{N}_{c}}$ subcarriers, and represents ${n}$-th burst of transmitted ${x}$-polarized (Xpol) symbols. The same as ${S}_{{in,\; n}}^{Y}$ for ${y}$-polarization (Ypol). ${N}$ is the number of bursts in each simulation. Then they are encoded on the $\mathbf{b}(\lambda )$ through certain subcarriers, where sinc is mostly used and is realized by

(11)$$\begin{array}{c} {\mathbf U}_{in}^X(\lambda )= A\sum\limits_{k = 0}^{{N_c} - 1} {\left[ {\begin{array}{{cccc}} {S_{in,0}^{X,k}}&{S_{in,1}^{X,k}}& \cdots &{S_{in,N - 1}^{X,k}} \end{array}} \right]\textrm{sinc}\left[ {\frac{{\lambda {T_0}}}{{{T_n}}} + \left( {k - \frac{{{N_c} - 1}}{2}} \right)\pi } \right]} \\ {\mathbf U}_{in}^Y(\lambda )= A\sum\limits_{k = 0}^{{N_c} - 1} {\left[ {\begin{array}{{cccc}} {S_{in,0}^{Y,k}}&{S_{in,1}^{Y,k}}& \cdots &{S_{in,N - 1}^{Y,k}} \end{array}} \right]\textrm{sinc}\left[ {\frac{{\lambda {T_0}}}{{{T_n}}} + \left( {k - \frac{{{N_c} - 1}}{2}} \right)\pi } \right]} \end{array}.$$

Here, ${k}$ denotes the index of subcarriers, ${{T}_{0}}$ is the signal duration, and ${A}$ is a power control factor. In our numerical simulation, we set ${A = 1}{{0}^{\mathrm{\Omega /10}}}$ and by adjusting $\mathrm{\Omega }$ to adjust power, in which for every 0.5 increasing in $\mathrm{\Omega }$, the actual power goes up by approximately 1 dB. Meanwhile, to improve the accuracy of (I)NFT algorithm and avoid inter- burst interference, the oversampling of ${{R}_{0}}$ times and guard interval (GI) insertion of ${H - 1}$ times ${{T}_{0}}$ should be performed, which can be implemented by sampling the sinc function, that is, ${\lambda }$ is sampled from ${\ -\ \pi B}{{R}_{0}}{{T}_{n}}{/2}$ to ${\pi B}{{R}_{0}}{{T}_{n}}{/2}$ with ${{N}_{c}}{{R}_{0}}{H}$ equally spaced points. Then Eq. (11) can be expressed in matrix form

(12)$$\left[ {\begin{array}{{c}} {{\mathbf U}_{in}^X}\\ {{\mathbf U}_{in}^Y} \end{array}} \right] = A\left[ {\begin{array}{{cc}} {\mathbf F}&{\mathbf 0}\\ {\mathbf 0}&{\mathbf F} \end{array}} \right]\left[ {\begin{array}{{c}} {{\mathbf S}_{in}^X}\\ {{\mathbf S}_{in}^Y} \end{array}} \right]$$

where $\mathbf{F}$ is a matrix shaped ${(}{{N}_{c}}{\ \times }{{R}_{0}}{\ \times H)\ \times }{{N}_{c}}$, where each column is a shifted sinc subcarrier. To reduce the truncation error and ensure the vanishing boundary, we insert a large GI beforehand, and truncate the time domain waveform afterwards [53]. Hence, for a certain system with ${{N}_{c}}$ subcarriers and ${{R}_{0}}$, the dimension of $\mathbf{F}$ does not need to change according to bandwidth or distance which mainly determine the length of GI. Next, to overcome the energy barrier of ${\mathbf{b}(\lambda )}$ and reduce the actual GI, the DP-${{\varGamma }_{b}}$ transformation [28] and pre-dispersion compensation (PDC) [54] are operated in sequence. Thus, the ${\mathbf{b}(\lambda )}$ sent for INFT is given by

(13)$$\begin{array}{c} {{\mathbf b}_1}(\lambda )= {\mathbf U}_{in}^X\sqrt {\frac{{1 - \exp ({ - {{|{{\mathbf U}_{in}^X} |}^2} - {{|{{\mathbf U}_{in}^Y} |}^2}} )}}{{{{|{{U}_{in}^X} |}^2} + {{|{{U}_{in}^Y} |}^2}}}} \exp ({ - 2j{\lambda^2}Z} )\\ {{\mathbf b}_2}(\lambda )= {\mathbf U}_{in}^Y\sqrt {\frac{{1 - \exp ({ - {{|{{\mathbf U}_{in}^X} |}^2} - {{|{{\mathbf U}_{in}^Y} |}^2}} )}}{{{{|{{U}_{in}^X} |}^2} + {{|{{U}_{in}^Y} |}^2}}}} \exp ({ - 2j{\lambda^2}Z} )\end{array}$$

where ${Z}$ represents the normalized total distance. As mentioned above, after denormalizing the q(t) generated from INFT according to Eq. (2), we truncate the duration to the required length. Due to the PDC operation, the truncated time duration ${T}$ should satisfy [54]

(14)$$ T=T_0+T_{G I} \geq T_0+\pi B\left|\beta_2\right| L .$$

Here, ${B}$ is the signal bandwidth and ${L}$ is the transmission distance. For conveniently evaluating the GI duration, it often defines ${\eta =\ T/}{{T}_{0}}$. The bandpass filters (BPF) in TxDSP and RxDSP are used to limit the bandwidth in the training stage of AE, whose application will be discussed in the next section. The RxDSP is generally the inverse process of TxDSP. It is worth mentioning that the de-truncation is performed by adding zeros on both sides, making the result length equal to the original. Moreover, the spectrum decoding is executed by a matched filter, which can be written as

(15)$$\left[ {\begin{array}{{c}} {{\mathbf S}_{out}^X}\\ {{\mathbf S}_{out}^Y} \end{array}} \right] = \frac{1}{A}\left[ {\begin{array}{{cc}} {{{\mathbf F}^\dagger }}&{\mathbf 0}\\ {\mathbf 0}&{{{\mathbf F}^\dagger}} \end{array}} \right]\left[ {\begin{array}{{c}} {{\mathbf U}_{out}^X}\\ {{\mathbf U}_{out}^Y} \end{array}} \right]$$

where ${\dagger}$ represents Hermitian conjugate. As shown in Eq. (15), DP-NFDM uses sinc subcarriers for spectrum encoding and decoding independently for each polarization. Nevertheless, it might not be the best choice. First, the DP-${{\varGamma }_{b}}$ will impair the localizability of the time domain waveform, and produce tails on both sides [33]. But the truncation range is still selected based on the rectangular waveform, resulting in unwanted information loss. Moreover, [55] reports that even without polarization mode dispersion (PMD) and other polarization effects, the amplifier spontaneous emission (ASE) and processing noise will cause ICI within and between polarizations. Therefore, we extend the spectrum encoding process by introducing an inter- polarized component, replacing Eq. (12) with

(16)$$\left[ {\begin{array}{{c}} {{\mathbf U}_{in}^X}\\ {{\mathbf U}_{in}^Y} \end{array}} \right] = A\left[ {\begin{array}{{cc}} {{{\mathbf F}_{xx}}}&{{{\mathbf F}_{xy}}}\\ {{{\mathbf F}_{yx}}}&{{{\mathbf F}_{yy}}} \end{array}} \right]\left[ {\begin{array}{{c}} {{\mathbf S}_{in}^X}\\ {{\mathbf S}_{in}^Y} \end{array}} \right].$$

Accordingly, the spectrum decoding Eq. (15) in RxDSP is extended to

(17)$$\left[ {\begin{array}{{c}} {\mathbf{S}_{out}^X}\\ {{\mathbf S}_{out}^Y} \end{array}} \right] = \frac{1}{A}\left[ {\begin{array}{{cc}} {{\mathbf G}_{xx}^\dagger}&{{\mathbf G}_{yx}^\dagger}\\ {{\mathbf G}_{xy}^\dagger}&{{\mathbf G}_{yy}^\dagger} \end{array}} \right]\left[ {\begin{array}{{c}} {{\mathbf U}_{out}^X}\\ {{\mathbf U}_{out}^Y} \end{array}} \right].$$

Conveniently, we redefine the identifiers as

(18)$${\mathbf F} = \left[ {\begin{array}{{cc}} {{{\mathbf F}_{xx}}}&{{{\mathbf F}_{xy}}}\\ {{{\mathbf F}_{yx}}}&{{{\mathbf F}_{yy}}} \end{array}} \right],{\mathbf G} = \left[ {\begin{array}{{cc}} {{{\mathbf G}_{xx}}}&{{{\mathbf G}_{xy}}}\\ {{{\mathbf G}_{yx}}}&{{{\mathbf G}_{yy}}} \end{array}} \right].$$

It should be noted that the decoding matrix $\mathbf{G}$ is not equal to the encoding matrix $\mathbf{F}$. Otherwise, ICI will occur inevitably. Since linear correlation can be cancelled by a decorrelation matrix $\mathbf{W}$, the ICI can be equalized by pre-multiplying $\mathbf{W}$ after spectrum decoding. Therefore, we can get ${\mathbf{G}^{\dagger }} \approx \mathbf{W}{\mathbf{F}^{\dagger}}$. In fact, LMMSE is an effective way that determine the decorrelation matrix $\mathbf{W}$. We give a detailed discussion about this in Appendix. Now, we take Eq. (16) as an encoder, Eq. (17) as an decoder, and $\mathbf{F}$ and $\mathbf{G}$ are trainable complex valued matrices. Consequently, an AE model is formed naturally as shown in Fig. 2(b), and the training objective is to minimize the mean square error (MSE)

(19)$$Loss = \frac{1}{N}\frac{1}{{2{N_c}}}\sum\limits_n {({{{||{{\mathbf S}_{out}^X - {\mathbf S}_{in}^X} ||}^2} + {{||{{\mathbf S}_{out}^Y - {\mathbf S}_{in}^Y} ||}^2}} )}.$$

Algorithm 1. AE training procedure.

View Table | View all tables in this article

Then, when the loss function reaches convergence, a couple of optimal $\mathbf{F}$ and $\mathbf{G}$ is obtained. The above mentioned are our proposed AE assisted DP-NFDM subcarrier optimization scheme. We summarize the training procedure in Algorithm 1. As the training batches are created randomly in each iteration, AE can hardly overfit. Thus, AE has inherent strong generalization.

4. Numerical verification

4.1 System parameters and training process

The system parameters in numerical verifications are listed in Table. 1. Benefit by the automatic derivation mechanism of Pytorch 2.0, the optimization process is able to be performed. To ensure the loss function converges steadily, ${N}$ should not be too small. However, to store the gradient requires a large amount of memory resources. Due to our device limitation, the number of subcarriers ${{N}_{c}}$ is set as 64. Moreover, AE may exploit the remaining resources to improve the performance, so we must constrain the bandwidth during the training stage. Therefore, we set the 3 dB bandwidth of the BPF as 1.25 times the signal bandwidth, which guarantees a flat gain within the signal band.

Table 1. System parameters in numerical verification

View Table | View all tables in this article

During the training stage, we initialize subcarriers in $\mathbf{F}$ and $\mathbf{G}$ of intra- polarization as standard sinc, and zero the inter ones. ${N}$ is set as 300 in each iteration. We use the RMSprop optimizer with an initial learning rate ${\mu }$ of 5e-4 to update the encoder and decoder for 750 iterations in total. Besides, we pre-generated 2000 additional test bursts and evaluate the performance at the start of each iteration. We plot the training performances in Fig. 3. As can be seen in Fig. 3(a), the loss curves of training and test bursts closely match, indicating no overfitting. The tendency of Q-factor curve in Fig. 3(b) fits the loss curve, and exceeds the hard decision forward error correction (HD-FEC) threshold after about 300 iterations. Besides, in the training stage, the actual power is determined by ${A}$, amplitude of $\mathbf{S}_{{in}}^{{X/Y}}$ and the optimizing $\mathbf{F}$. Here, ${A}$ is determined by setting ${\Omega }$ as -7, which is constant during training stage, and the maximum amplitude of $\mathbf{S}_{{in}}^{{X/Y}}$ is $\sqrt {{34}} $ for 32QAM is implemented. Therefore, AE will adjust the energy of subcarriers in $\mathbf{F}$, and the actual launch power will slide to the optimal. Figure 3(b) illustrates this variation, which converges after about 400 iterations, decreasing from the original -4.78 dBm to -5.57 dBm.

Fig. 3. (a) Loss, (b) Q-factor and power as a function of iteration during training stage.

Download Full Size | PDF

4.2 Performance of optimized subcarriers

The encoding and decoding subcarriers obtained by AE are shown in Fig. 4. Due to the symmetry between two polarizations, we only display that of ${x}$-polarization. Figure 4(a)(b) show the encoding and decoding subcarriers respectively. The peaks of optimized ${\mathbf{F}_{{xx}}}$ are below 1, while the corresponding ${\mathbf{G}_{{xx}}}$ is larger, because of the high initial power in training stage, as pointed out in Fig. 5(a). This phenomenon coincides the power curve shown in Fig. 3(b). The 3 pairs of dash lines indicate the bandwidth of signal, 3 dB BPF and 3 dB OBPF from the center to the ends respectively. Apparently, the falloff at the 3 dB BPF bandwidth reflects the limited available bandwidth for spectrum encoding and decoding. As the training bursts and ASE noise are randomly created, AE will generate noise-like values outside the available bandwidth as shown in Fig. 4. To further analyze the characteristics of the optimized subcarriers, we filter $\mathbf{F}$ and $\mathbf{G}$ by a same BPF, which means that we keep the band within the green dash lines. We name the resulting subcarriers as AE-sinc subcarriers in the rest of this paper.

Fig. 4. AE optimized subcarriers: (a)(b) are the intra- polarizaed components of $\mathbf{F}$ and $\mathbf{G}$, and (c)(d) are the inter- ones. The transparent blue dashed lines are standard sinc subcarriers, while the colorful ones are optimized subcarriers. The red, green and blue dash lines presents the bandwidth of signal, BPF and OBPF respectively.

Download Full Size | PDF

Fig. 5. (a) Q-factor performance for different encoding and decoding strategies; (b) BER of different subcarriers at the optimal launch powers.

Download Full Size | PDF

We exhibit the Q-factor versus launch power for different encoding and decoding strategies in Fig. 5(a). For AE-sinc, we obtain the optimal point directly from the training stage, and vary ${A}$ while keeping $\mathbf{F}$ and $\mathbf{G}$ constant to obtain the other points, as done for the other strategies. Compared to the standard sinc and LMMSE equalization, the optimal launch power increases by 1.12 dB. Meanwhile, AE-sinc achieves a Q-factor gain of 1.54 dB and 0.62 dB over these two methods, respectively, exceeding the HD-FEC threshold. Figure 5(b) depicts the bit error rate (BER) of each subcarrier at the optimal launch powers. Due to truncation, the outer sinc subcarriers have poor BER performance, especially the right ones, which are also affected by the cumulative error of numerical (I)NFT algorithm. LMMSE equalization alleviates these effects, but the outer subcarriers still perform slightly worse than the inner ones. On the contrary, the AE-sinc shows flat BER performance across different subcarriers, and the BER is lower than LMMSE overall. In the next section, we will try to explain these phenomena.

4.3 Characteristics of subcarriers optimized by AE

To figure out how the AE-sinc leads to its performance improvements, we discuss its characteristics here. Since the out-of-band of AE-sinc have been analyzed and filtered previously, we mainly focus on the in-band and the inter- polarized components of AE-sinc. In advance, we make an assumption that the subcarriers optimized by AE are an optimal value close to the initial subcarriers under the current system parameters. This implies that the obtained AE-sinc is sinc-like. Hence, we can use the standard sinc as a reference for analysis.

In order to investigate the unflatten gain within the signal band observed in Fig. 4(a)(b), we retrain the model under three other scenarios: (1) back-to-back (BTB) without truncation; (2) BTB with truncation; (3) 960 km transmission without truncation. Figure 6(a-c) plot the obtained AE-sinc in each scenario. Conveniently, we mark the scenario in previous sections as Scenario 4, which is 960 km transmission with truncation. Upon the assumption above, the four AE-sinc subcarriers are optimal for their respective scenarios. Focusing on the Scenario 4, which is the most practical one, we notice three features: (1) lower peaks on the left and higher peaks on the right; (2) lower side lobes within signal band, but higher within 3 dB band of BPF; (3) higher central peaks than the outer ones. We can infer their benefits by comparing the unique variable involved in the other three scenarios.

Fig. 6. Obtained AE-sinc in 3 scenarios: (a) BTB w/o truncation; (b) BTB w/ truncation; (c) 960 km transmission w/o truncation. The dash lines have the same meaning as that in Fig. 4.

Download Full Size | PDF

As shown in Fig. 6(a), in Scenario 1, the left peaks of AE-sinc are slightly lower and the outer ones are a bit higher. Considering that Scenario 1 only involves the (I)NFT inaccuracy, which is transitive and proportional to high energy [56], it can be easily deduced that this energy distribution achieves a flat inaccuracy among different ${\lambda }$. Take ${\mathbf{F}_{{xx}}}$ for instance, where INFT is performed, the inverse AL algorithm operates from right to left, which is the same direction as the error propagation. Therefore, by reducing the energy of the left subcarriers, the total error of them will be reduced. Then, it leads to flat performance among subcarriers.

In Scenario 2 shown in Fig. 6(b), truncation is introduced with respect to Scenario 1. Compared with Fig. 6(a), it is obvious that the side lobes within the signal band are well suppressed. Figure 7(a) shows the time domain waveform of sinc subcarriers, where some information may be lost due to the truncation of the tails. Figure 7(b) compares the synthesized waveforms in Scenario 1 and 2, indicating that the tails of Scenario 1 are clearly higher than those of Scenario 2. It is well known that the ${\mathbf{b}(\lambda )}$ of a rectangle waveform is approximate to sinc [10], but the nonlinear transformation ${{\varGamma }_{b}}$ will magnify the side lobes of sinc, which have lower energy. That is why undesired tails are observed after ${{\varGamma }_{b}}$. Therefore, suppressing the side lobes can make the ${\mathbf{b}(\lambda )}$ more approximate to sinc after ${{\varGamma }_{b}}$ and improve the localizability of the corresponding time domain waveform.

Fig. 7. Time domain waveform of (a) sinc and (b) AE-sinc in Scenario 1 and 2; synthesized ${{b}_{1}}{(\lambda )}$ of (c) sinc and (d) AE-sinc in Scenario 3 and 4, (e)(f) are the corresponding time domain waveform respectively. The green and pink dash lines are useful and truncated time window respectively. (g) Q-factor performance of AE-sinc (Scenario 4) and sinc under different ${\eta }$.

Download Full Size | PDF

Figure 6(c) shows the AE-sinc obtained in Scenario 3, which introduces ASE noise based on Scenario 1. However, there is no significant difference from Fig. 6(a), except for the increased side lobes between the signal band and 3 dB band of BPF. It can be explained by the analytical model of noise density of ${\mathbf{b}(\lambda )}$, where the noise power on ${\mathbf{b}(\lambda )}$ has a negative correlation with the signal power [55]. As AE considers this band available, it chooses to increase the corresponding power spectral density (PSD) to counteract the noise PSD of ${\mathbf{b}(\lambda )}$. Figure 7(c)(d) show that compared with sinc, the synthesized ${\mathbf{b}(\lambda )}$ of sinc and AE-sinc in Scenario 3 and 4 have broader bands.

Now, we return to Scenario 4. The first two features mentioned above have appeared commonly in previous scenarios, but the lower peaks of outer subcarriers are unique. Considering that the truncation may cut off the broadened tails after PDC, we attribute it to different delay among subcarriers. As we know, the essence of dispersion is different propagation constant for different spectral components, which results in a time delay to the non-zero frequency subcarriers. Since the outer subcarriers are far from zero frequency, their delays are even longer. Therefore, together with the tails generated by ${{\varGamma }_{b}}$, the truncation operation will affect the outer subcarriers more than the inner ones. It leads to the BER distribution of sinc subcarriers shown in Fig. 5(b). Thus, the AE-sinc in Scenario 4 is a deliberate design choice to minimize the useful energy that is truncated. Figure 7(e)(f) show the time domain waveform of sinc and AE-sinc after PDC in Scenario 3 and 4. As can be seen, the waveform of Scenario 4 is more concentrated within the truncation window. The fluctuations outside the window can be mainly attributed to the randomness of training stage. To further demonstrates the inhibition of the tails of AE-sinc, we exhibit the Q-factor performance under different ${\eta }$ in Fig. 7(g). As can be seen, the Q-factor improvements of AE-sinc is about 0.31 dB when ${\eta }$ increases from the theoretical value (${\eta } \approx {1}{.26}$) to 2, while 0.87 dB improvements for standard sinc. On the other hand, when ${\eta }$ is smaller than the theoretical value, AE-sinc degrades significantly, as more energy of tails moves into the truncation window. It further proves that AE-sinc has shorter tails than sinc.

At last, we exhibit the correlation matrices of ${\mathbf{F}_{{xx}}}$, ${\mathbf{F}_{{xy}}}$, ${\mathbf{G}_{{xx}}}$ and ${\mathbf{G}_{{xy}}}$ in Fig. 8, to inspect the characteristics of the inter- polarized components of AE-sinc shown in Fig. 4(c)(d). As shown in Fig. 8(a)(b), AE-sinc induces weak correlation among adjacent subcarriers in both the encoder and decoder. Compared with the diagonal elements, there exists about -10 dB energy transfer. Figure 8(c)(d) exhibit the correlation of ${\mathbf{F}_{{xy}}}$ and ${\mathbf{G}_{{xy}}}$, where almost no correlation exists among adjacent subcarriers, but strong diagonal elements are observed. Focusing on the value of diagonal ones, it suggests that AE-sinc transfers about -9 dB energy between two polarizations in TxDSP.

Fig. 8. Correlation among AE-sinc subcarriers of (a) ${\mathbf{F}_{{xx}}}$, (b) ${\mathbf{G}_{{xx}}}$, (c) ${\mathbf{F}_{{xy}}}$ and (d) ${\mathbf{G}_{{xy}}}$. We take the logarithm of their magnitudes.

Download Full Size | PDF

4.4 Generalization validation

One of the major challenges for ML applications in optical communications is the generalization ability. To evaluate the generalization performance of AE-sinc trained in the system listed in Table. 1, we vary one parameter at a time to ensure the unique variable, while keeping the dimension of $\mathbf{F}$ and $\mathbf{G}$ unchanged. The parameters that validated involve modulation format, distance and bandwidth. That means although the AE-sinc is trained under the condition of 32QAM, 12${\ \times }$80 km and 16 GHz, it is utilized to perform 16/64QAM or 10${\ \times }$80 km and 14${\ \times }$80 km or 24/32 GHz transmission. In each case, GI is set as theoretical threshold given by Eq. (14). Figure 9(a) illustrates the Q-factor versus launch power for different modulation formats. AE-sinc achieves a Q-factor improvement of 0.85 dB and 0.62 dB for 16/32QAM compared with sinc equalized by LMMSE. For 64QAM, the Q-factor improvement is 0.40 dB and exceeds the soft decision FEC (SD-FEC) threshold. Then, Fig. 9(b) presents the validation results of distance, where AE-sinc obtains about 0.6 dB Q-factor gain in the validated distances and surpasses the HD-FEC threshold in the extended transmission of 1120 km. Finally, Fig. 9(c)(d) illustrate the validation of bandwidth of 24 GHz and 32 GHz respectively. It is evident that the generalization performance of AE-sinc worsens as the bandwidth deviates from the trained condition. This can be attributed to two reasons. First, the ICI is more sensitive to bandwidth and $\mathbf{F}$ and $\mathbf{G}$ no longer match well, so LMMSE brings considerable improvements for AE-sinc. Secondly, the AE-sinc is trained on the condition of 16 GHz, while the validated 24 GHz and 32 GHz have longer pulse broadening due to dispersion. Therefore, the suppression of outer subcarriers obtained in 16 GHz does not suit the validated scenario. In summary, our proposed AE assisted subcarrier optimization scheme exhibits excellent generalization for modulation format and distance, while that for bandwidth near the training condition.

Fig. 9. Generalization validation of (a) modulation format, (b) transmission distance, and (c)(d) bandwidth.

Download Full Size | PDF

5. Conclusion

In this paper, we propose and numerically verify an AE assisted subcarrier optimization scheme for DP-NFDM. Taking into account the inter-polarization correlation in DP-NFDM, we introduce an inter-polarized component to the spectrum encoding and decoding process. The resulting AE-sinc subcarriers surpass HD-FEC threshold in a 960 km transmission system and outperform sinc by 1.54 dB in Q-factor. Furthermore, by analyzing the characteristics of AE-sinc, we observe that AE tends to suppress side lobes and outer subcarriers to mitigate the effects of truncation and balance the transitive error of numerical (I)NFT by adjust the energy higher at right. In addition, the generalization validation shows that AE-sinc obtained in the training stage has excellent robustness to modulation format and transmission distance, while to bandwidth near the training condition. Our work provides a new perspective for subcarrier design in NFDM in the future.

Appendix: LMMSE for DP-NFDM

As we know, the ASE and processing noise cause correlation among the nonlinear spectrum components which should be originally orthogonal, leading to ICI. This phenomenon is effectively reduced by applying ${b}$-modulation in SP-NFDM. As derived in [57], under the condition of long-haul and wide band transmission, the approximate statistic model of ASE noise is

(20)$$\begin{array}{rl} {b_{out}}(\lambda )&= {b_{in}}(\lambda )+ {{\cal N}}(\lambda )\\ E[{{{\cal N}}(\lambda ){{{\cal N}}^\ast }({\lambda^{\prime}} )} ]&= 2\pi DZ({1 - {{|{{b_{in}}(\lambda )} |}^2}} )\delta ({\lambda - \lambda^{\prime}} )\\ E[{{{\cal N}}(\lambda ){{\cal N}}({\lambda^{\prime}} )} ]&= 0 \end{array}.$$

Here, ${{b}_{{in}}}{(\lambda )}$ and ${{b}_{{out}}}{(\lambda )}$ are the transmitted and received ${b}$ coefficients respectively, and ${D}$ is normalized ASE noise PSD. Equation (20) manifests that there is almost no correlation among the nonlinear spectrum components. However, it is not accurate for narrow bandwidth systems, and ICI arises. Therefore, [58] proposed the LMMSE equalization for narrow bandwidth SP-NFDM. Recently, [55] reported the noise model for DP-NFDM, which is given by

(21)$$ \begin{aligned} \mathbf{b}_{\text {out }}(\lambda) & =\mathbf{b}_{i m}(\lambda)+\mathcal{N}(\lambda) \\ E\left[\mathcal{N}(\lambda) \mathcal{N}^{\dagger}\left(\lambda^{\prime}\right)\right] & =2 \pi D Z\left[\begin{array}{cc} 1-\left|b_{i n, 1}(\lambda)\right|^2-\frac{\left|b_{i n, 2}(\lambda)\right|^2}{2} & -\frac{b_{i n, 1}(\lambda) b_{i n, 2}^*(\lambda)}{2} \\ -\frac{b_{i n, 2}(\lambda) b_{i n, 1}^*(\lambda)}{2} & 1-\left|b_{i n, 2}(\lambda)\right|^2-\frac{\left|b_{i n, 1}(\lambda)\right|^2}{2} \end{array} \delta\left(\lambda-\lambda^{\prime}\right)\right. \\ E\left[\mathcal{N}(\lambda) \mathcal{N}^T\left(\lambda^{\prime}\right)\right] & =0 \end{aligned} .$$

As can be seen, even without polarization effects, there exists correlation between inter- polarizations in long-haul and wide band transmissions. It suggests that LMMSE can be utilized for decorrelation. In our work, the LMMSE is performed after spectrum decoding, where the decoded symbols are represented by

(22)$${{\mathbf S}_{out}} = \left[ {\begin{array}{{c}} {{\mathbf S}_{out}^X}\\ {{\mathbf S}_{out}^Y} \end{array}} \right].$$

The corresponding transmitted symbols are

(23)$${{\mathbf S}_{in}} = \left[ {\begin{array}{{c}} {{\mathbf S}_{in}^X}\\ {{\mathbf S}_{in}^Y} \end{array}} \right].$$

We treat ${\mathbf{S}_{{in}}}$ and ${\mathbf{S}_{{out}}}$ as the expected and measured values of LMMSE. Thus, the equalization matrix $\mathbf{W}$ and the bias vector ${h}$ are calculated by

(24)$$ \begin{aligned} C_{\text {in}, \text {out}} & =\left(\mathbf{S}_{\text {in}}-\overline{\mathbf{S}}_{\text {in}}\right)\left(\mathbf{S}_{\text {out}}-\overline{\mathbf{S}}_{\text {out}}\right)^{\dagger} \\ C_{\text {out}, \text {out}} & =\left(\mathbf{S}_{\text {out}}-\overline{\mathbf{S}}_{\text {out}}\right)\left(\mathbf{S}_{\text {out}}-\overline{\mathbf{S}}_{\text {out}}\right)^{\dagger} \\ \mathbf{W} & =C_{\text {in,out}} C_{\text {out}, \text {out }}^{-1} \\ h & =\overline{\mathbf{S}}_{\text {in }}-\mathbf{W} \overline{\mathbf{S}}_{\text {out }} \end{aligned} .$$

Here, the bars over the characters denote the expectation. Consequently, the equalized signal vectors are given by

(25)$$ \mathbf{S}_{L M M S E}=\mathbf{W} \mathbf{S}_{out }+h $$

Its effectiveness for wide band transmission system has been demonstrated in previous numerical verifications (Fig. 5 and Fig. 9), where 6000 bursts are used to estimate the $\mathbf{W}$, and 4000 bursts for test. Furthermore, according to [55], the ASE noise of ${\mathbf{b}(\lambda )}$ is modeled as zero mean complex Gaussian. Therefore, the bias vector ${h}$ is almost negligible. As a result, the spectrum decoding in Eq. (15) can be jointly represented by

(26)$${{\mathbf S}_{LMMSE}} \approx {\mathbf W}\left[ {\begin{array}{{cc}} {{{\mathbf F}^\dagger}}&{\mathbf 0}\\ {\mathbf 0}&{{{\mathbf F}^\dagger}} \end{array}} \right]\left[ {\begin{array}{{c}} {{\mathbf U}_{out}^X}\\ {{\mathbf U}_{out}^Y} \end{array}} \right] = {\mathbf G}\left[ {\begin{array}{{c}} {{\mathbf U}_{out}^X}\\ {{\mathbf U}_{out}^Y} \end{array}} \right].$$

As stated by the minimal MSE principle, $\mathbf{G}$ is the optimal linear decoder. Thus, $\mathbf{F}$ and $\mathbf{G}$ form a pair of encoder and decoder subcarriers. Nevertheless, when we jointly perform end-to-end optimization, we will get a pair of $\mathbf{F}$ and $\mathbf{G}$ at the same time without calculating $\mathbf{W}$. Therefore, only by comparing AE-sinc with LMMSE, the advantages of the optimized encoder can be highlighted.

Funding

National Natural Science Foundation of China (62171048); Fundamental Research Funds for the Central Universities (2020XD-A05-3).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Reference

1. A. D. Ellis, M. E. McCarthy, M. A. Z. Al Khateeb, M. Sorokina, and N. J. Doran, “Performance limits in optical communications due to fiber nonlinearity,” Adv. Opt. Photon. 9(3), 429–503 (2017). [CrossRef]

2. X. Liu, “Evolution of Fiber-Optic Transmission and Networking toward the 5 G Era,” iScience 22, 489–506 (2019). [CrossRef]

3. E. Ip and J. M. Kahn, “Compensation of Dispersion and Nonlinear Impairments Using Digital Backpropagation,” J. Lightwave Technol. 26(20), 3416–3425 (2008). [CrossRef]

4. E. Ip, “Nonlinear Compensation Using Backpropagation for Polarization-Multiplexed Transmission,” J. Lightwave Technol. 28(6), 939–951 (2010). [CrossRef]

5. Z. Tao, L. Dou, W. Yan, L. Li, T. Hoshida, and J. C. Rasmussen, “Multiplier-Free Intrachannel Nonlinearity Compensating Algorithm Operating at Symbol Rate,” J. Lightwave Technol. 29(17), 2570–2576 (2011). [CrossRef]

6. X. Liu, A. Chraplyvy, P. Winzer, R. Tkach, and S. Chandrasekhar, “Phase-conjugated twin waves for communication beyond the Kerr nonlinearity limit,” Nat. Photonics 7(7), 560–568 (2013). [CrossRef]

7. K. V. Peddanarappagari and M. Brandt-Pearce, “Volterra series transfer function of single-mode fibers,” J. Lightwave Technol. 15(12), 2232–2241 (1997). [CrossRef]

8. A. Vannucci, P. Serena, and A. Bononi, “The RP Method: A New Tool for the Iterative Solution of the Nonlinear Schrödinger Equation,” J. Lightwave Technol. 20(7), 1102–1112 (2002). [CrossRef]

9. S. K. Turitsyn, J. E. Prilepsky, S. T. Le, S. Wahls, L. L. Frumin, M. Kamalian, and S. A. Derevyanko, “Nonlinear Fourier transform for optical data processing and transmission: advances and perspectives,” Optica 4(3), 307–322 (2017). [CrossRef]

10. M. I. Yousefi and F. R. Kschischang, “Information Transmission Using the Nonlinear Fourier Transform, Part I-III,” IEEE Trans. Inform. Theory 60(7), 4312–4328 (2014). [CrossRef]

11. Z. Dong, S. Hari, T. Gui, K. Zhong, M. I. Yousefi, C. Lu, P.-K. A. Wai, F. R. Kschischang, and A. P. T. Lau, “Nonlinear Frequency Division Multiplexed Transmissions Based on NFT,” IEEE Photon. Technol. Lett. 27(15), 1621–1623 (2015). [CrossRef]

12. T. Gui, T. H. Chan, C. Lu, A. P. T. Lau, and P.-K. A. Wai, “Alternative Decoding Methods for Optical Communications Based on Nonlinear Fourier Transform,” J. Lightwave Technol. 35(9), 1542–1550 (2017). [CrossRef]

13. T. Gui, C. Lu, A. P. T. Lau, and P. K. A. Wai, “High-order modulation on a single discrete eigenvalue for optical communications based on nonlinear Fourier transform,” Opt. Express 25(17), 20286–20297 (2017). [CrossRef]

14. G. Zhou, T. Gui, C. Lu, A. P. T. Lau, and P.-K. A. Wai, “Improving Soliton Transmission Systems Through Soliton Interactions,” J. Lightwave Technol. 38(14), 3563–3572 (2020). [CrossRef]

15. G. Zhou, L. Sun, C. Lu, and A. P. T. Lau, “Multi-symbol digital signal processing techniques for discrete eigenvalue transmissions based on nonlinear Fourier Transform,” J. Lightwave Technol. 39(17), 5459–5467 (2021). [CrossRef]

16. J. E. Prilepsky, S. A. Derevyanko, K. J. Blow, I. Gabitov, and S. K. Turitsyn, “Nonlinear inverse synthesis and eigenvalue division multiplexing in optical fiber channels,” Phys. Rev. Lett. 113(1), 013901 (2014). [CrossRef]

17. S. T. Le, I. D. Philips, J. E. Prilepsky, P. Harper, N. J. Doran, A. D. Ellis, and S. K. Turitsyn, “First experimental demonstration of nonlinear inverse synthesis transmission over transoceanic distances,” in 2016 Optical Fiber Communications Conference and Exhibition (OFC) (2016), pp. Tu2A.1.

18. S. T. Le and H. Buelow, “64 × 0.5 Gbaud Nonlinear Frequency Division Multiplexed Transmissions With High Order Modulation Formats,” J. Lightwave Technol. 35(17), 3692–3698 (2017). [CrossRef]

19. I. Tavakkolnia and M. Safari, “Signaling on the continuous spectrum of nonlinear optical fiber,” Opt. Express 25(16), 18685–18702 (2017). [CrossRef]

20. S. T. Le, K. Schuh, F. Buchali, and H. Buelow, “100 Gbps b-modulated Nonlinear Frequency Division Multiplexed Transmission,” in 2018 Optical Fiber Communications Conference and Exhibition (OFC) (2018), pp. W1G.6.

21. I. Tavakkolnia and M. Safari, “Signalling over nonlinear fibre-optic channels by utilizing both solitonic and radiative spectra,” in 2015 European Conference on Networks and Communications (EuCNC) (2015), pp. 103–107.

22. S. T. Le, V. Aref, and H. Buelow, “Nonlinear signal multiplexing for communication beyond the Kerr nonlinearity limit,” Nat. Photonics 11(9), 570–576 (2017). [CrossRef]

23. V. Aref, S. T. Le, and H. Buelow, “Modulation Over Nonlinear Fourier Spectrum: Continuous and Discrete Spectrum,” J. Lightwave Technol. 36(6), 1289–1295 (2018). [CrossRef]

24. J. Wei, L. Xi, X. Zhang, J. Deng, R. Zhang, S. Du, W. Zhang, and X. Zhang, “Improvement for a full-spectrum modulated nonlinear frequency division multiplexing transmission system,” Opt. Express 30(17), 31195–31208 (2022). [CrossRef]

25. R. Zhang, L. Xi, J. Wei, J. Deng, S. Du, W. Zhang, X. Zhang, and X. Xiao, “Optimal Design of Eigenvalues for the Full-Spectrum Modulated Nonlinear Frequency Division Multiplexing Transmission System,” IEEE Photonics J. 15(3), 1–8 (2023). [CrossRef]

26. J. W. Goossens, M. I. Yousefi, Y. Jaouen, and H. Hafermann, “Polarization-division multiplexing based on the nonlinear Fourier transform,” Opt. Express 25(22), 26437–26452 (2017). [CrossRef]

27. W. A. Gemechu, T. Gui, J.-W. Goossens, M. Song, S. Wabnitz, H. Hafermann, A. P. T. Lau, M. I. Yousefi, and Y. Jaouen, “Dual Polarization Nonlinear Frequency Division Multiplexing Transmission,” IEEE Photon. Technol. Lett. 30(18), 1589–1592 (2018). [CrossRef]

28. X. Yangzhang, V. Aref, S. Le, H. Buelow, D. Lavery, and P. Bayvel, “Dual-Polarization Non-Linear Frequency-Division Multiplexed Transmission With b-Modulation,” J. Lightwave Technol. 37(6), 1570–1578 (2019). [CrossRef]

29. X. Yangzhang, S. T. Le, V. Aref, H. Buelow, D. Lavery, and P. Bayvel, “Experimental Demonstration of Dual-Polarization NFDM Transmission With b-Modulation,” IEEE Photon. Technol. Lett. 31(11), 885–888 (2019). [CrossRef]

30. F. D. Ros, S. Civelli, S. Gaiarin, E. P. D. Silva, N. D. Renzis, M. Secondini, and D. Zibar, “Dual-Polarization NFDM Transmission With Continuous and Discrete Spectral Modulation,” J. Lightwave Technol. 37(10), 2335–2343 (2019). [CrossRef]

31. X. Zhang, L. Xi, J. Wei, S. Du, W. Zhang, J. Li, and X. Zhang, “Nonlinear frequency domain PMD modeling and equalization for nonlinear frequency division multiplexing transmission,” Opt. Express 29(18), 28190–28201 (2021). [CrossRef]

32. X. Zhang, S. Du, L. Xi, J. Wei, J. Deng, R. Zhang, M. Hu, W. Zhang, and X. Zhang, “Full-Spectrum INFT Algorithm for Dual-Polarization NFDM Transmission,” J. Lightwave Technol. 41(10), 3017–3025 (2023). [CrossRef]

33. S. Wahls, “Generation of Time-Limited Signals in the Nonlinear Fourier Domain via b-Modulation,” in 2017 European Conference on Optical Communication (ECOC) (2017), pp. 1–3.

34. M. Balogun and S. Derevyanko, “Hermite-Gaussian Nonlinear Spectral Carriers for Optical Communication Systems Employing the Nonlinear Fourier Transform,” IEEE Commun. Lett. 26(1), 109–112 (2022). [CrossRef]

35. M. Balogun and S. Derevyanko, “Enhancing the Spectral Efficiency of Nonlinear Frequency Division Multiplexing Systems via Hermite-Gaussian Subcarriers,” J. Lightwave Technol. 40(18), 6071–6077 (2022). [CrossRef]

36. X. Zhang, P. Sun, L. Xi, Z. Zheng, S. Du, J. Wei, Y. Wu, and X. Zhang, “Nonlinear-frequency-packing nonlinear frequency division multiplexing transmission,” Opt. Express 28(10), 15360–15375 (2020). [CrossRef]

37. J. W. Nevin, S. Nallaperuma, N. A. Shevchenko, X. Li, M. S. Faruk, and S. J. Savory, “Machine Learning for Optical Fiber Communication Systems: An Introduction and Overview,” APL Photonics 6(12), 121101 (2021). [CrossRef]

38. L. Xi, J. Wei, and W. Zhang, “Applications of Machine Learning on Nonlinear Frequency Division Multiplexing Optic-Fiber Communication Systems,” in 2021 IEEE 9th International Conference on Information, Communication and Networks (ICICN) (2021), pp. 190–194.

39. O. Kotlyar, M. Pankratova, M. Kamalian-Kopae, A. Vasylchenkova, J. E. Prilepsky, and S. K. Turitsyn, “Combining nonlinear Fourier transform and neural network-based processing in optical communications,” Opt. Lett. 45(13), 3462–3465 (2020). [CrossRef]

40. J. Wei, L. Xi, X. Zhang, J. Deng, S. Du, X. Zhang, W. Zhang, and X. Xiao, “Probabilistic shaping and neural network-based optimization for a nonlinear frequency division multiplexing system,” Opt. Lett. 46(15), 3697–3700 (2021). [CrossRef]

41. X. Chen, X. Fang, and F. Zhang, “100 Gbaud PDM 16QAM NFDM transmission with neural network-based equalization,” Opt. Fiber Technol. 78, 103329 (2023). [CrossRef]

42. E. V. Sedov, P. J. Freire, V. V. Seredin, V. A. Kolbasin, M. Kamalian-Kopae, I. S. Chekhovskoy, S. K. Turitsyn, and J. E. Prilepsky, “Neural networks for computing and denoising the continuous nonlinear Fourier spectrum in focusing nonlinear Schrödinger equation,” Sci. Rep. 11(1), 22857 (2021). [CrossRef]

43. W. Q. Zhang, T. H. Chan, and S. Afshar, “Direct decoding of nonlinear OFDM-QAM signals using convolutional neural network,” Opt. Express 29(8), 11591–11604 (2021). [CrossRef]

44. W. Q. Zhang, T. H. Chan, and S. A. Vahid, “Serial and parallel convolutional neural network schemes for NFDM signals,” Sci. Rep. 12(1), 7962 (2022). [CrossRef]

45. O. Kotlyar, M. Kamalian-Kopae, M. Pankratova, A. Vasylchenkova, J. E. Prilepsky, and S. K. Turitsyn, “Convolutional long short-term memory neural network equalizer for nonlinear Fourier transform-based optical transmission systems,” Opt. Express 29(7), 11254–11267 (2021). [CrossRef]

46. T. O’Shea and J. Hoydis, “An Introduction to Deep Learning for the Physical Layer,” IEEE Trans. Cogn. Commun. Netw. 3(4), 563–575 (2017). [CrossRef]

47. S. Gaiarin, F. D. Ros, R. T. Jones, and D. Zibar, “End-to-End Optimization of Coherent Optical Communications Over the Split-Step Fourier Method Guided by the Nonlinear Fourier Transform Theory,” J. Lightwave Technol. 39(2), 418–428 (2021). [CrossRef]

48. U. Michelucci, “An Introduction to Autoencoders,” arXiv, arXiv:2201.03898 (2022). [CrossRef]

49. G. P. Agrawal, Nonlinear Fiber Optics (Fifth Edition) (Academic, 2013).

50. S. T. Le, J. E. Prilepsky, and S. K. Turitsyn, “Nonlinear inverse synthesis technique for optical links with lumped amplification,” Opt. Express 23(7), 8317–8328 (2015). [CrossRef]

51. S. Civelli, Nonlinear frequency-division multiplexing: theoretical aspects, numerical algorithms, and experimental demonstration, Ph.D. thesis (Sant'Anna School of Advanced Studies, 2019).

52. X. Yangzhang, Study of Non-linear Optical Fibre Communications, Ph.D. thesis (University College London, 2020).

53. J. Wei, L. Xi, X. Zhang, S. Du, W. Zhang, and X. Zhang, “Design and Optimization of the Parameters in the Waveform Shaping Stage of the Transmitter in the NFDM System,” in 26th Optoelectronics and Communications Conference (2021), pp. W4B.8.

54. S. T. Le, V. Aref, and H. Buelow, “High Speed Precompensated Nonlinear Frequency-Division Multiplexed Transmissions,” J. Lightwave Technol. 36(6), 1296–1303 (2018). [CrossRef]

55. S. A. Derevyanko and J. E. Prilepsky, “Channel model for the dual-polarization b-modulated nonlinear frequency-division multiplexing optical transmission systems,” Opt. Express 31(12), 19686–19702 (2023). [CrossRef]

56. S. Wahls, S. Chimmalgi, and P. J. Prins, “Wiener-Hopf Method for b-Modulation,” in 2019 Optical Fiber Communications Conference and Exhibition (OFC) (2019), pp. W2A.50.

57. S. Derevyanko, M. Balogun, O. Aluf, D. Shepelsky, and J. E. Prilepsky, “Channel model and the achievable information rates of the optical nonlinear frequency division-multiplexed systems employing continuous b-modulation,” Opt. Express 29(5), 6384–6406 (2021). [CrossRef]

58. Q. Zhang and F. R. Kschischang, “Correlation-Aided Nonlinear Spectrum Detection,” J. Lightwave Technol. 39(15), 4923–4931 (2021). [CrossRef]

Parameters		Value	Parameters		Value
Central frequency $λ_{c}$		1550.12 nm	Normalized time $T_{n}$		1 ns
Bandwidth $B$		16 GHz	Number of spans $N_{s}$		12
Subcarrier quantity $N_{c}$		64	Span length $L_{s}$		80 km
Modulation format		32QAM	GVD $β_{2}$		-21.54 ps²/km
Oversampling rate $R_{0}$		4	Attenuation $α$		0.2 dB/km
$H$		4	Nonlinear coefficient $γ$		1.3 /W/km
$η$		1.26	Noise Figure (NF) of EDFA		5 dB
BPF	Type	Gaussian	Optical BPF (OBPF)	Type	Gaussian
	Order	10		Order	10
	3 dB bandwidth	1.25 $B$		3 dB bandwidth	1.75 $B$

Parameters		Value	Parameters		Value
Central frequency $λ_{c}$		1550.12 nm	Normalized time $T_{n}$		1 ns
Bandwidth $B$		16 GHz	Number of spans $N_{s}$		12
Subcarrier quantity $N_{c}$		64	Span length $L_{s}$		80 km
Modulation format		32QAM	GVD $β_{2}$		-21.54 ps²/km
Oversampling rate $R_{0}$		4	Attenuation $α$		0.2 dB/km
$H$		4	Nonlinear coefficient $γ$		1.3 /W/km
$η$		1.26	Noise Figure (NF) of EDFA		5 dB
BPF	Type	Gaussian	Optical BPF (OBPF)	Type	Gaussian
	Order	10		Order	10
	3 dB bandwidth	1.25 $B$		3 dB bandwidth	1.75 $B$

Autoencoder assisted subcarrier optimization for nonlinear frequency division multiplexing

Abstract

1. Introduction

2. Brief review of AE

3. Basis of NFT theory and AE assisted optimized NFDM

3.1 Review of NFT theory

3.2 AE assisted optimized NFDM system

4. Numerical verification

4.1 System parameters and training process

4.2 Performance of optimized subcarriers

4.3 Characteristics of subcarriers optimized by AE

4.4 Generalization validation

5. Conclusion

Appendix: LMMSE for DP-NFDM

Funding

Disclosures

Data availability

Reference

Data availability

Cited By

Figures (9)

Tables (2)

Equations (26)

Optics Express