End-to-end learning strategy based on a frequency domain feature decoupling network emulator with joint probabilistic shaping and equalization for a 300-Gbit/s OAM mode division multiplexing transmission

Qi Xu; Ran Gao; Zhaohui Cheng; Fei Wang; Yi Cui; Yi Cui; Yi Cui; Fuling Yang; Fuling Yang; Fuling Yang; Zhipei Li; Huan Chang; Jie Liu; Dong Guo; Lei Zhu; Xiaolong Pan; Qi Zhang; Qi Zhang; Qi Zhang; Qinghua Tian; Qinghua Tian; Qinghua Tian; Xin Huang; Jinghao Yan; Lin Jiang; Xiangjun Xin

doi:10.1364/OE.519842

1. Introduction

The increasing popularity of AI-centric applications such as ChatGPT has imposed high-capacity requirements on IM/DD transmission for short-reach optical interconnections [1,2]. As there is a need to break through the Shannon limit on the capacity of a single-mode optical fiber (SMF), orbital angular momentum (OAM) mode division multiplexing (MDM) has been developed as an innovative multiplexing technology for enhancing capacity [3–10]. OAM-MDM IM/DD transmission has achieved ultra-high-capacity transmission over different distances by leveraging a ring-core fiber (RCF) with weak inter-MG coupling; for example, researchers have used a 20-Gbaud 3D CAP-64 architecture for OAM-MDM transmission over 2.3 km [3], a three-mode multiplexed PAM8 transmission for IM/DD OAM-MDM over 2.6 km [4], and a 4.32 Tbit/s PAM8 transmission for IM/DD OAM-MDM over 2 km [5].

However, in short-reach IM/DD OAM-MDM systems, significant nonlinear impairments arise from the optoelectronic devices used, such as Mach-Zehnder modulators (MZMs), spatial light modulators (SLMs) and photodiodes (PDs). Furthermore, the nonlinear distortion of each of the different OAM modes affects the other modes, due to the mode coupling in OAM-MDM transmission. Hence, OAM-MDM shows strong complex nonlinear impairment, and this is particularly prominent for multiple OAM mode transmission systems. For conventional block-structured OAM-MDM systems, AI-based compensation methods have achieved remarkable results following the optimization of independent blocks, such as probabilistic shaping (PS) [11], linear equalization [12] and nonlinear equalization [3–5]. However, although the methods described above can effectively improve the performance of OAM-MDM systems, it is still uncertain whether individually optimized blocks can lead to global optimization of an IM/DD OAM-MDM system due to the presence of mode coupling and complex nonlinear impairment [13]. Hence, the development of optimal modulation and equalization schemes for OAM-MDM systems remains a persistent challenge.

End-to-end (E2E) learning is an emerging technology based on an autoencoder (AE) that provides intelligent E2E optimization according to the impairment of a communication system [14–18]. An E2E strategy can achieve global optimization of joint modulation and equalization, and can compensate for signal impairment by treating the entire communication system as an AE, which offers great performance improvements for optical communications. Nan Chi et al. proposed a novel joint geometric shaping (GS) and post-equalization scheme for a fiber-THz integrated communication system [17]. Jianyang Shi et al. presented an E2E learning strategy to achieve joint pre-equalization and post-equalization for a visible light communication system [18]. Accurate estimation of the OAM-MDM channel is crucial for E2E PS or equalization. In conventional optical communication systems, deep learning strategies are usually adopted to provide accurate channel models, such as conditional generative adversarial networks (CGANs) [19,20], heterogeneous neural network networks [21,22] or a three-subnetwork architecture [23]. However, the channel model of the input signal exhibits different characteristics at different frequencies, due to the complex nonlinear impairments to OAM-MDM transmission. For example, the nonlinear impairment caused by optoelectronic devices is mainly seen at high frequencies [24], while the linear impairment caused by mode coupling is mainly observed at low frequency [25]. Channel modelling strategies based on conventional machine learning always build a mixed model of the transmission channel for different frequencies, resulting an inaccurate channel model for an OAM-MDM IM/DD system.

In this paper, we propose a scheme based on a frequency domain feature decoupling network (FDFDnet) and E2E learning for joint PS and equalization of an OAM-MDM IM/DD system. Our FDFDnet-based E2E learning scheme can accurately build a complex nonlinear model of an OAM-MDM system by separating the signal into different frequency domain features. Secondly, a FDFDnet-based E2E strategy for joint PS and equalization based on the CAP-32 modulation format is presented to compensate the signal impairment. An experiment is carried out to verify the effectiveness of the proposed method in an OAM-MDM IM/DD transmission with three modes; the results demonstrate that the proposed method can build an accurate channel model that can significantly mitigate the nonlinear impairments of the OAM-MDM IM/DD transmission. Thus, the proposed scheme is a promising contender for OAM-MDM transmission in ultra-high-capacity inter-data center interconnects.

2. End-to-end deep learning of joint probabilistic shaping and equalization for an OAM-MDM system using FDFDnet emulator

For a conventional block-structured OAM-MDM system, AI-based methods are commonly implemented by optimizing independent blocks. However, these individually optimized blocks cannot lead to the global optimization of the OAM-MDM system, and as a result, the development of optimal modulation and equalization schemes for OAM-MDM systems remains a challenge. In order to achieve an accurate channel response for an OAM-MDM system, a novel E2E learning strategy for joint PS and equalization based on an FDFDnet emulator is proposed for OAM-MDM transmission with CAP-32 modulation. The E2E learning scheme proposed in this paper is shown in Fig. 1.

Fig. 1. Diagram showing the proposed E2E learning strategy based on a FDFDnet emulator for an OAM-MDM system: (a) actual three-mode OAM-MDM system; (b) proposed OAM-MDM system based on AE.

Download Full Size | PDF

Figure 1(a) depicts a three-mode OAM-MDM system consisting of a transmitter, channel and receiver. In the proposed E2E learning scheme for joint PS and equalization, the whole OAM-MDM system is considered as an AE, with the transmitter regarded as the AE encoder and the receiver as the AE decoder. The channel is simulated with our FDFDnet emulator. The FDFDnet emulator separates the signal into different frequency domain features rather than building a mixed model for the transmission channel for different frequencies, meaning that it can accurately model the channel response of the OAM-MDM system even though the received signal is subject to mode coupling and complex nonlinear impairments. With the trained FDFDnet emulator, gradients can be processed from the AE decoder to the AE encoder, thereby achieving joint training of the transmitter and receiver. In this way, the AE encoder achieves the optimal probability distribution and the AE decoder achieves equalization. As shown in Fig. 1(b), the proposed optimal modulation and equalization scheme based on the FDFDnet emulator is realized through E2E learning, which can bring great performance improvements to an OAM-MDM system.

2.1 FDFDnet-based channel modelling strategy for an OAM-MDM system

In an IM/DD OAM-MDM system, the transmitted CAP-32 symbol sequence can be represented as $x(n) = [{{x_1},{x_2}, \cdot{\cdot} \cdot {x_n}} ],{x_n} \in ( - 1,1)$. The CAP-32 symbol sequence received by the OAM-MDM system is distorted due to the mode coupling and complex nonlinear impairment, and can be represented as $y(n) = [{{y_1},{y_2}, \cdot{\cdot} \cdot {y_n}} ]$.

Before simulating the channel response in the OAM-MDM, the transmitted and received CAP-32 signals are pre-processed as shown in Fig. 2. The current symbol ${x_i}(i = 1, \ldots n)$ is wrapped with its L preceding and L succeeding symbols to form a feature vector ${X_i} = [{{x_{i - L}}, \cdot{\cdot} \cdot ,{x_{i - 1}},{x_i}, \cdot{\cdot} \cdot ,{x_{i + L}}} ]$. The transmitted time-series signal $x(n) = [{{x_1},{x_2}, \cdot{\cdot} \cdot {x_n}} ]$ is transformed into a matrix composed of $(n - 2L)$ feature vectors with a length of $({2L + 1} )$. The feature vector matrix can be expressed as

(1)$$X = \left( {\begin{array}{ccc} {x_1^{}}& \ldots &{x_{2L + 1}^{}}\\ \vdots & \ddots & \vdots \\ {x_{n - 2L}^{}}& \cdots &{x_n^{}} \end{array}} \right) = \left( \begin{array}{l} {X_1}\\ \vdots \\ {X_{n - 2L}} \end{array} \right)$$

Fig. 2. Data preprocessing.

Download Full Size | PDF

The received signal ${y_i}(i = 1,2, \ldots n)$ corresponding to the feature vector ${X_i}(i = 1, \ldots ,n - 2L)$ is taken as real data, and can be expressed as

(2)$$Y = \left( \begin{array}{l} {y_{L + 1}}\\ \vdots \\ {y_{n - L}} \end{array} \right) = \left( \begin{array}{l} {Y_1}\\ \vdots \\ {Y_{n - 2L}} \end{array} \right). $$

The feature vector X and real data Y are combined to form the dataset $D = \{{X,Y} \}$. The dataset D is then divided into two parts to form the training dataset ${D_T} = \{{{X_T},{Y_T}} \}$ and the testing dataset ${D_P} = \{{{X_P},{Y_P}} \}$, where ${X_T} = [{{X_1},{X_2} \cdots {X_T}} ]$, ${Y_T} = [{{Y_1},{Y_2} \cdots {Y_T}} ]$, ${X_P} = [{{X_1},{X_2} \cdots {X_P}} ]$ and\[{Y_P} = [{{Y_1},{Y_2} \cdots {Y_P}} ]\]$({P + T = n - 2L} )$.

The structure of the FDFDnet emulator is depicted in Fig. 3(a). It consists of three blocks: an input layer, two fully connected layers, and an output layer. Each block represents a frequency domain of the OAM-MDM signal (Block 1, 2, 3 represent the low, middle, and high frequencies, respectively). The first two blocks also include a frequency low cut (FLC) pooling layer. The training dataset ${D_T}$ is applied to the FDFDnet emulator to determine all of the parameters. The training data is first passed to the input layer of the Block 1 in the form of a feature vector ${X_i}({i = 1,2, \ldots T} )$ of the training data set ${D_T}$.

Fig. 3. Schematic diagram of the FDFDnet emulator: (a) structure of Block 1; (b) structure of Block 2; (c) structure of Block 3.

Download Full Size | PDF

In the second step, in the Block 1, the input vectors ${X_i}$ with the length of $(2L + 1)$ are converted into the frequency domain with a fast Fourier transform (FFT). The low-frequency part is separated in the FLC pooling layer, and is expressed as

(3)$${F_{low}} = Cu{t_{low}}({FFTshift({FFT({{X_i}} )} )} ), $$

where ${F_{low}}$ is the low-frequency vector output by the FLC pooling layer, with a length of $({2L + 1} )/3$. $Cu{t_{low}}$ represents the extraction of low-frequency feature vectors in the FLC pooling layer. The low-frequency signal is separated by setting the medium-frequency and high-frequency signals as zero. Hence, Block 1 focuses on learning features at low frequency.

In the third step, in Block 1, the two fully connected layers are used to perform two regression calculations on ${F_{low}}$ with the weight vectors $({{W_1},{W_2}} )$ and the bias vectors $({{b_1},{b_2}} )$, and this is followed by a $ReLU ({} )$ activation function for each fully connected layer. The output layer is used to perform a linear calculation to generate two output vectors with the weight vector $({{W_3}} )$ and the bias vector $({{b_3}} )$. The whole process can be expressed as

(4)$${Y_1} = {W_3}({\textrm{Re LU} ({{W_2}(\textrm{Re LU} ({W_1} \times {F_{low}} + {b_1}) + {b_2}} )} )+ {b_3}$$

(5)$${\widehat y_1} = {Y_1}(1,\textrm{ }2L + 1)$$

(6)$${y_1} = {Y_1}(2L + 1,\textrm{ }2L + 2)$$

The dimensions of ${W_1}$, ${b_1}$, ${W_2}$, ${b_2}$, ${W_3}$ and ${b_3}$ are the same as the number of neurons in the fully connected layers. ${Y_1}$ is the output vector of Block 1, and has a length of $({2L + 2} )$. ${\widehat y_1}$ is the reconstructed vector of ${F_{low}}$, with length $({2L + 1} )$. ${y_1}$ is the emulated signal based on the low-frequency feature vector. ${\widehat y_1}$ is calculated to avoid repeated training of signal that have been learned in subsequent blocks, while the ${y_1}$ is added to the output of the subsequent blocks to form the final emulated signal.

In the fourth step, the input of Block 2 is obtained by subtracting the reconstructed vector ${\widehat y_1}$ from the input vector ${X_i}$ of Block 1, which can be expressed as

(7)$${x_2} = {X_i} - {\widehat y_1}$$

where ${x_2}$ represents the input of Block 2. Equation (7) can be used to separate low-frequency signal from the input vector ${X_i}$ to avoid the need for repeated training.

In the fifth step, in Block 2, the input vector ${x_2}$ is taken using the same calculation process as for Block 1. Block 2 can then be used to extract the medium-frequency signals, which can be expressed as

(8)$${Y_2} = {W_6}({\textrm{Re LU} ({{W_5}(\textrm{Re LU} ({W_4}(Cu{t_{mid}}({FFTshift({FFT({{x_2}} )} )} )+ {b_4}))) + {b_5}} )} )+ {b_6}$$

(9)$${\hat{y}_2} = {Y_2}(1,\textrm{ }2L + 1)$$

(10)$${y_2} = {Y_2}(2L + 1,\textrm{ }2L + 2)$$

where ${Y_2}$ is the output vector of Block 2 with a length of $({2L + 2} )$; ${\hat{y}_2}$ is the reconstructed vector of medium frequency with a length of $({2L + 1} )$; and ${y_2}$ is the emulated signal based on the medium-frequency feature vector. $Cu{t_{mid}}$ represents the extraction of the medium-frequency feature vector in the FLC pooling layer. The dimensions of ${W_4}$, ${b_4}$, ${W_5}$, ${b_5}$, ${W_6}$ and ${b_6}$ are the same as the number of neurons in the fully connected layers.

In the sixth step, the input to Block 3 is obtained by subtracting the reconstructed vector ${\widehat y_2}$ from the input vector ${x_2}$ of Block2, which can be expressed as

(11)$${x_3} = {x_2} - {\widehat y_2}, $$

where ${x_3}$ represents the input to Block 3, which retains only the high-frequency signal. Block 3 can therefore separate the medium-frequency signals from the input vector ${x_2}$ to avoid the need for repeated training.

The two fully connected layers are then used to perform two regression calculations on ${x_3}$ with the weight vectors $({{W_7},{W_8}} )$ and the bias vectors $({{b_7},{b_8}} )$, and this is followed by a $ReLU ({} )$ activation function for each fully connected layer. The output layer is then used to perform a linear calculation with the weight vector $({{W_9}} )$ and the bias vector $({{b_9}} )$, without an activation function. The whole process can be expressed as

(12)$${y_3} = {W_6}(\textrm{Re LU} ({W_5} \times {x_3} + {b_5}) + {b_6}, $$

where ${y_3}$ is the emulated signal based on the high-frequency feature vector. The dimensions of ${W_7}$, ${b_7}$, ${W_8}$, ${b_8}$, ${W_9}$ and ${b_9}$ are the same as the number of neurons in the fully connected layers.

Finally, the emulated signal $\hat{Y}$ of the input sample ${X_i}$ output from the FDFDnet emulator can be expressed as

(13)$$\hat{Y} = {y_1} + {y_2} + {y_3}. $$

After obtaining the estimated value $\hat{Y}$, FDFDnet applies the mean squared error (MSE) function to calculate the loss throughout the training process. The Adam gradient descent function is employed to iteratively train the model parameters $({{W_1},{b_1},{W_2},{b_2},{W_3},b{}_3,{W_4},{b_4},{W_5},{b_5},{W_6},{b_6},{W_7}}$,${{b_7},{W_8},{b_8},{W_9},{b_9}} )$ through the use of backpropagation. The distortion symbols corresponding to the testing set ${X_P} = [{{X_1},{X_2} \cdots {X_P}} ]$ can be simulated by using the forward propagation process of the FDFDnet emulator after the training process.

The combination of mode coupling in OAM-MDM with nonlinear impairment from optoelectronic devices typically results in strong complex nonlinear damage, which poses a challenge for traditional channel emulators based on a mixed model for different frequencies. However, since the FDFDnet emulator separates the signal into different frequency domain features rather than building a mixed model for the transmission channel for different frequencies, it can accurately model the channel response of the OAM-MDM system even though the received signal is subject to mode coupling and complex nonlinear impairment. Our FDFDnet emulator can obtain an accurate model from the different frequency domain features, thus forming an effective OAM-MDM transmission channel emulator for strong complex nonlinearity.

2.2 Joint probabilistic shaping and equalization for an OAM-MDM system based on AE

In this section, an AE-based scheme for joint optimization of PS and equalization for an OAM-MDM system with CAP-32 modulation is introduced, as depicted in Fig. 4. The system is composed of an AE encoder, a mapper, CAP modulation, a FDFDnet emulator, CAP demodulation, and an AE decoder. Both the encoder and decoder are neural networks (NNs) composed of fully connected layers.

Fig. 4. Diagram showing our AE scheme for PS and equalization in an OAM-MDM system. (a) AE Encoder, (b) Mapper, (c) FDFDnet emulator, (d) AE Decoder.

Download Full Size | PDF

At the transmitter, the AE encoder first generates a vector of probabilities ${P_M} = ({{p_1}, \ldots ,{p_M}} )$ to denote the probability for each constellation point ${C_M} = ({{c_1}, \ldots {c_M}} )$ and $M = {2^m}$ represents the modulation order, as shown in Fig. 4(a).

Next, as shown in Fig. 4(b), the constellation point probability ${P_M}$ and the batch size S are input to a sampler. In order to find the corresponding symbol number ${Q_M} = ({{q_1}, \ldots {q_M}} )$ for each constellation point, algorithm 2 from [26] is applied. It is implemented by minimizing the information entropy according to the constellation point probability ${P_M}$ and batch size S. The symbol number ${Q_M}$ are then converted to the binary representation to get the bit vector ${b_k} = ({{b_{1,k}}, \ldots {b_{m,k}}} ),k = 1, \ldots S$; these are encoded to the corresponding one-hot vector ${V_S} = ({v_1}, \ldots {v_S})$ of size $M \times S$, which is not zero at the index corresponding to the binary-to-integer conversion of the bit vector. Then, to consider the non-uniform constellation point probability, the constellation points ${C_M} = \{{{c_1}, \ldots {c_M}} \}$ are normalized based on the probability ${P_M}$, as follows:

(14)$${\hat{c}_i} = {c_i}/\sqrt {\sum\nolimits_{i = 1}^M {{p_i}{c_i}^2} }, $$

where ${\hat{c}_i}$ represents the normalised constellation point, and ${\hat{C}_M} = \{{{{\hat{c}}_1}, \ldots {{\hat{c}}_M}} \}$. To generate a symbol sequence K consistent with probability ${P_M}$, we take the dot product of a one-hot vector ${V_S}$ with ${\hat{C}_M}$, which can be expressed as

(15)$$K = {V_S} \times {\hat{C}_M}. $$

Next, the symbol sequence $K = \{{{k_1}, \ldots {k_S}} \}$ is upsampled and filtered using two shaping filters for the CAP modulation, and the CAP signal $X = \{{{x_1}, \ldots {x_S}} \}$ is generated.

At the channel, as shown in Fig. 4(c), the CAP signal $X = \{{{x_1}, \ldots {x_S}} \}$ is then fed into the FDFDnet emulator to generate a received signal $Y = \{{{Y_1}, \ldots {Y_S}} \}$, which contains complex nonlinear impairment.

At the receiver, the impaired signal $Y = \{{{Y_1}, \ldots {Y_S}} \}$ is filtered using two shaping filters and then downsampled to generate the CAP demodulated signal $E = \{{{E_1}, \ldots {E_S}} \}$. Finally, as shown in Fig. 4(d), the decoder maps the CAP demodulated signal $E = \{{{E_1}, \ldots {E_S}} \}$ to a probability vector ${R_j}(j = 1, \ldots S)$.

In order to train E2E AE model accurately, a modified loss function based on the generalized mutual information (GMI) is used, which can be expressed as [15]

(16)$$\ell = \frac{1}{S}\sum\nolimits_{i = 1}^S {[{ - {V_i}\log ({R_i})} ]} - H({{P_M}} ), $$

where $H({{P_M}} )$ is the entropy of ${P_M}$. In the E2E AE model, $\ell $ is used to calculate the loss for the training process, and the Adam gradient descent function is applied to optimize ${P_M}$ and the AE decoder using backpropagation. The optimal probability distribution ${P_M}$ and equalizer are found when the loss function $\ell $ converges. Thus, the AE can achieve joint PS and equalization for an OAM-MDM system through the use of the FDFDnet emulator.

3. Experimental

3.1 Experimental setup

As shown in Fig. 5, to verify the accuracy of the proposed FDFDnet emulator, an experiment was carried out on a 300 Gbit/s OAM-MDM IM/DD transmission with three OAM modes over a 10 km RCF. At the transmitter, a pseudo-random bit sequence with the length of ${2^{18}}$ was generated and mapped onto a CAP-32 symbol sequence. The electrical signal was passed through the arbitrary waveform generator (AWG) and electrical amplification (EA) and then modulated by the MZM onto a double-sideband optical signal generated by a laser with a wavelength of 1550.12 nm. The optical signal was then divided into three branches using three optical couplers (OCs), and amplified using three erbium-doped fiber amplifiers (EDFAs). The three branches of the optical signal were delayed separately using a 10 m SMF for decorrelation of the data modes, and were simultaneously transmitted into space through a polarisation controller (PC), collimator (Col.), and linear polarizer (LP). Three SLMs were then used to modulate the three optical signals into three OAM modes ($l = 2,3,4$). In this way, 300Gbit/s transmission could be realized with a 20 GBaud CAP-32 signal per OAM mode in the OAM-MDM IM/DD transmission. The three optical signals in the OAM modes were then combined using polarization beam combiners (PBC) and a beam combiner (BC). Finally, the combined optical signal was converted to circular polarization via a quarter-wave plate (QWP) and transmitted through a 10 km RCF.

Fig. 5. Experimental setup for the E2E OAM-MDM system.

Download Full Size | PDF

In Fig. 5, inset (i) shows the cross section of the RCF, and insets (ii)–(iv) show the intensity profiles of the three OAM modes ($l = 2,3,4$), respectively. At the receiver, the multiplexed OAM mode was divided into three beams by two BSs, and these were then converted into Gaussian beams via three vortex phase plates (VPPs). The three Gauss beams were coupled to the SMFs through the Col., and converted into electronic signal via three PDs. The electrical signal was recorded using a real-time oscilloscope at 256GSa/s, and processed by offline DSP including resampling, synchronisation, CAP demodulation, AE-optimized equalization and BER calculation. Each set of 11 adjacent symbols of the electrical signal was input to the FDFDnet emulator as a feature vector ${X_i}$. The corresponding symbol after synchronization was used as real data ${Y_i}$ to sent to the FDFDnet emulator. The feature vector ${X_i}$ and real data ${Y_i}$ were used as the dataset for training the emulator. In the experiment, L was set to five, and the batch size and learning rate of emulator were set to 512 and 1e-3, respectively, for the three OAM modes.

In order to verify the performance of our FDFDnet emulator, we compared it with a traditional CGAN emulator. In addition, as the number of blocks in FDFDnet represent the number of frequency domains considered, we tested the impact of the number of blocks by using only two blocks, to simulate low and high frequencies, in a model called the FDFDnet-2 emulator. As the number of blocks increases, it can lead to a significant increase in the complexity of the emulator. The same dataset was used for training the FDFDnet-2 and CGAN emulators. The trained emulators were then integrated with the AE to obtain the optimal probability distribution and equalization for the OAM-MDM. The pseudo-random bit sequence was mapped onto a CAP-32 signal sequence using a distribution matcher according to the learned optimal probability distribution [26]. The CAP-32 signal was transmitted via AWG and modulated by an optical carrier with the MZM. The electric signal was then recorded and processed vis offline DSP, including resampling, synchronization, CAP demodulation, AE-optimized equalization and BER calculation. The values of the BER for the FDFDnet, FDFDnet-2 and CGAN emulators were compared to verify the effectiveness of the proposed scheme.

3.2 Experimental results and analysis

3.2.1 Performance of the FDFDnet emulator for the OAM-MDM system

In these experiments, the OAM-MDM system channels were simulated using the FDFDnet, FDFDnet-2 and CGAN emulators. The accuracy of each emulator was verified by comparing its output with the real OAM-MDM system channel signal. Figures 6(a), (b) and (c) illustrate the frequency domain emulated results trained with different emulators when the voltage peak-to-peak (Vpp) is 350 mV and received optical power (ROP) is 0 dBm. Compared with the FDFDnet-2 and CGAN emulator, the results from the FDFDnet emulator show higher consistency with the real OAM-MDM channel output. The fading trends of FDFDnet and FDFDnet-2 at high frequencies are more similar to the real channel output signal than that of the CGAN emulator.

Fig. 6. Channel output waveforms in the frequency domains based on the OAM-MDM system channel, CGAN emulator and FDFD emulator for (a) $l = 2$, (b) $l = 3$, (c) $l = 4$.

Download Full Size | PDF

To quantitatively represent the channel modelling effect of the real OAM-MDM system using the FDFDnet, FDFDnet-2 and CGAN emulators, we compared the normalized MSE (NMSE) and BER performance. Figures 7(a), (b) and (c) show the NMSE and BER performance for the signals generated by the FDFDnet, FDFDnet-2 and CGAN emulators, respectively, at different ROPs and for different OAM modes ($l = 2,3,4$). Compared with the traditional CGAN emulator, the maximum improvements in the modelling accuracy of the FDFDnet and FDFDnet-2 emulator are 30.8%, 26.3%, 31%, 11.5%, 10.1% and 9.6% at an ROP of −3dBm for the three modes, respectively. The average BER errors between the signal generated by the FDFDnet emulator and the real received signal were only 8%, 7.6% and 8.9% for different OAM modes ($l = 2,3,4$), respectively. The average BER errors between the signal generated by the FDFDnet-2 emulator and the real received signal were 15.3%, 14.5% and 15.8%. These were much lower than the values of 20.5%, 21.9% and 19.6% found for the CGAN emulator. This is because the FDFDnet-2 emulator only divides the complex nonlinear impairment into two parts to train the transmission channel model, while the FDFDnet emulator divides the complex nonlinear impairment into three parts, thereby achieving high accurate modeling of the transmission channel. This indicates that the proposed FDFDnet emulator is feasible for use in modelling an OAM-MDM system channel with complex nonlinear impairment.

Fig. 7. NMSE and BER value for the signal amplitude versus ROP for (a) $l = 2$, (b) $l = 3$, (c) $l = 4$ in an OAM MDM IM/DD transmission system over a 10 km RCF.

Download Full Size | PDF

The complexity of the proposed FDFDnet emulator was also compared with that of the CGAN emulator. The number of real multiplications carried out by the CGAN emulator can be expressed by [19]

(17)$$C{C_{CGAN}} = ({d_1} + z){k_1} + {k_1}{k_2} + {k_2}{d_2}$$

where ${d_1}$ denotes the length of the feature vector, and ${d_2}$ denotes the length of the generated vector. z is the length of the noise vector, which in this case is 10. ${k_1}$ denotes the number of neurons for the first hidden layer, ${k_2}$ denotes the number of neurons for the second hidden layer.

The complexity of the FDFDnet emulator can be expressed as [27]

(18)$$C{C_{FDFDnet}} = \frac{{11}}{6}{d_1} \times {h_1} + 3{h_1}{h_2} + {h_2}(2{d_1} + 3{d_2})$$

where ${h_1}$ denotes the number of neurons in the first hidden layer of each block, ${h_2}$ denotes the number of neurons in the second hidden layer of each block.

The complexity of the two emulators with similar NMSE performance can then be calculated based on the above parameters for different values of the ROP. As shown in Tables 1, 2 and 3, compared with the conventional CGAN emulator, the maximal reductions in complexity achieved by the FDFDnet emulator are 59.5%, 50.4%, and 52.8%, respectively, at an ROP of −3dBm, for the three modes. This indicates that the proposed FDFDnet emulator has both high performance and low complexity.

Table 1. Experimental results for the complexity of the CGAN and FDFDnet emulators for l = 2

View Table | View all tables in this article

Table 2. Experimental results for the complexity of the CGAN and FDFD emulators for l = 3

View Table | View all tables in this article

Table 3. Experimental results for the complexity of the CGAN and FDFD emulators for l = 4

View Table | View all tables in this article

3.2.2 Performance validation of AE-based joint PS and equalization for OAM-MDM system

In these experiments, the AE-based joint PS and equalization scheme was applied to the OAM-MDM system with the parameters shown in Table 4. We present the BER results based on the AE implementation of the FDFDnet, FDFDnet-2 and CGAN emulators with different values of Vpp for the three OAM modes in Figs. 8, 9 and 10. As can be seen from Figs. 8(a), 9(a) and 10(a), with an increase in the Vpp, the BER values for the AE-based joint PS and equalization scheme for the FDFDnet, FDFDnet-2 and CGAN emulators all decreases at first and then increases. Figures 8(b), 9(b) and 10(b) show that the entropy of the PS for the FDFDnet, FDFDnet-2 and CGAN emulators also increases at first and then decreases as Vpp increases. This is because with an increase in Vpp, the nonlinear effect of the system first decreases and then increases.

Fig. 8. Measured values of BER versus Vpp for $l = 2$ in an IM-DD OAM-MDM transmission system over a 10 km RCF.

Download Full Size | PDF

Fig. 9. Measured values of BER versus Vpp for $l = 3$ in an IM-DD OAM-MDM transmission system over a 10 km RCF.

Download Full Size | PDF

Fig. 10. Measured values of BER versus Vpp for $l = 4$ in an IM-DD OAM-MDM transmission system over a 10 km RCF.

Download Full Size | PDF

Table 4. Parameters of the AE

View Table | View all tables in this article

The AE-based joint PS and equalization scheme for the FDFDnet emulator yields better BER performance than for the FDFDnet-2 and CGAN emulators for various values of Vpps. Furthermore, as shown in Fig. 11, the values of entropy achieved by the AE for the FDFDnet emulator are 4.4, 4.42, 4.39, 4.7, 4.65, 4.72, 4.59, 4.62, 4.56, which are lower than those of 4.47, 4.52, 4.46, 4.73, 4.71, 4.76, 4.64, 4.68, 4.66 obtained for the FDFDnet-2 emulator and also lower than those of 4.5, 4.56, 4.48, 4.75, 4.74, 4.77, 4.65, 4.7, 4.68 obtained for the CGAN emulator at the same Vpp for three OAM modes. This is because the FDFDnet-2 emulator and CGAN emulator build a hybrid model for complex nonlinear impairment at different frequencies in the OAM-MDM system. However, the FDFDnet emulator can build complex nonlinear model by separating signal into different frequency domain features, and can therefore simulate the OAM-MDM transmission channel model accurately.

Fig. 11. Entropy of the trained PS-based FDFDnet, FDFDnet-2 and CGAN emulators for CAP-32 versus VPP for (a) $l = 2$, (b) $l = 3$,(c) in an IM-DD OAM-MDM transmission system over a 10 km RCF.

Download Full Size | PDF

Figure 12 illustrates the BER performance for different OAM modes ($l = 2,3,4$) versus ROP for the proposed FDFDnet, FDFDnet-2 and CGAN emulators when the Vpp is 300 mV. The measured ROP ranges from −7dBm to 3dBm. It is clear that the proposed FDFDnet-based joint PS and equalization scheme outperforms the other methods. Compared with the CAP-32 baseline scheme, the FDFDnet-based joint PS and equalization scheme achieves improvements in the receiver sensitivity of 5.5 dB, 5.1 dB, 5.3 dB for different OAM modes ($l = 2,3,4$) at the 20% FEC limit, respectively. Compared with the FDFDnet-2-based joint PS and equalization scheme, the FDFDnet-based joint PS and equalization scheme achieves improvements in receiver sensitivity of 2 dB, 2.5 dB, 2.9 dB for different OAM modes ($l = 2,3,4$) at the 20% FEC limit, respectively. Compared with the CGAN-based scheme, the FDFDnet-based scheme yields improvements in receiver sensitivity of 3 dB, 2.5 dB, 2.2 dB for different OAM modes ($l = 2,3,4$) at the 20% FEC limit, respectively. These results validate the effectiveness of the proposed FDFDnet-based joint PS and equalization scheme for mitigating nonlinear impairment in OAM-MDM transmission systems.

Fig. 12. Measured values of BER versus ROP for (a) $l = 2$, (b) $l = 3$,(c) $l = 4$ in an IM-DD OAM-MDM transmission system over a 10 km RCF.

Download Full Size | PDF

4. Conclusion

This paper has proposed an E2E learning scheme with joint PS and equalization based on an FDFDnet emulator for an OAM-MDM IM/DD system. As the channel model of the damaged signal exhibits different characteristics at different frequencies, it is difficult for traditional channel emulators to build a mixed model for the transmission channel for the different frequencies of OAM-MDM transmission. However, our FDFDnet emulator can accurately build a complex nonlinear model of an OAM-MDM system by separating the signal into different frequency domain features. Our experimental results show that the proposed FDFDnet emulator outperforms the conventional CGAN emulator, with improvements in modelling accuracy of 30.8%, 26.3% and 31% for OAM modes $l = 2,3,4$, respectively. Furthermore, an AE-based joint PS and equalization scheme for the FDFDnet emulator has been presented. In terms of the receiver sensitivity, the proposed FDFDnet emulator with E2E scheme outperforms the CGAN emulator by 3, 2.5, 2.2 dBm, the FDFDnet-2 emulator by 2, 2.5, 2.9 dBm, and the real channel by 5.5, 5.1, and 5.3 dBm for the three OAM modes, respectively. These experimental results demonstrate that the proposed AE-based joint PS and equalization scheme with the FDFDnet emulator is a promising candidate for OAM-MDM optical fiber communication.

Funding

National Key R&D Program of China from Ministry of Science and Technology (2019YFA0706300); Beijing Municipal Natural Science Foundation (L232049); Joint Fund Project of National Natural Science Foundation of China (U22A2087); National Outstanding Youth Science Fund Project of National Natural Science Foundation of China (62022016); National Natural Science Foundation of China (62105026, 62205023); Fundamental Research Funds for the Central Universities; Beijing Municipal Natural Science Foundation (4222075).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. P. Carbone, G. Dan, J. Gross, et al., “NeuroRAN: rethinking virtualization for AI-native radio access networks in 6 G,” arXiv, arXiv:2104.08111 (2021). [CrossRef]

2. N. Chi, Y. Zhou, Y. Wei, et al., “Visible light communication in 6G: advances, challenges, and prospects,” IEEE Veh. Technol. Mag. 15(4), 93–102 (2020). [CrossRef]

3. Y. Cui, R. Gao, Q. Zhang, et al., “Hidden conditional random field-based equalizer for the 3D-CAP-64 transmission of OAM mode-division multiplexed ring-core fiber communication,” Opt. Express 31(18), 28747–28763 (2023). [CrossRef]

4. S. Zhou, X. Liu, R. Gao, et al., “Adaptive Bayesian neural networks nonlinear equalizer in a 300-Gbit/s PAM8 transmission for IM/DD OAM mode division multiplexing,” Opt. Lett. 48(2), 464–467 (2023). [CrossRef]

5. F. Wang, R. Gao, Z. Li, et al., “OAM mode-division multiplexing IM/DD transmission at 4.32 Tbit/s with a low-complexity adaptive-network-based fuzzy inference system nonlinear equalizer,” Opt. Lett. 49(3), 430–433 (2024). [CrossRef]

6. G. Li, N. Bai, N. Zhao, et al., “Space-division multiplexing: the next frontier in optical communication,” Adv. Opt. Photon. 6(4), 413–487 (2014). [CrossRef]

7. B. J. Puttnam, G. Rademacher, and R. S. Luís, “Space-division multiplexing for optical fiber communications,” Optica 8(9), 1186–1203 (2021). [CrossRef]

8. S. Randel, R. Ryf, A. Sierra, et al., “6×56-Gb/s mode-division multiplexed transmission over 33-km few-mode fiber enabled by 6×6 MIMO equalization,” Opt. Express 19(17), 16697–16707 (2011). [CrossRef]

9. J. Tu, J. Li, T. Wen, et al., “OAM-SDM solution toward a submarine cable,” J. Lightwave Technol. 41(7), 1963–1973 (2023). [CrossRef]

10. M. Zhuang, J. Tu, D. Wang, et al., “Multi-step-index fiber with a large number of weakly coupled OAM mode groups for IM/DD systems in data centers: design, fabrication, and characterization,” Opt. Lett. 48(22), 6036–6039 (2023). [CrossRef]

11. Q. Xu, R. Gao, H. Chang, et al., “Bayesian generative adversarial network emulator based end-to-end learning strategy of the probabilistic shaping for OAM mode division multiplexing IM/DD transmission,” Opt. Express 31(24), 40508–40524 (2023). [CrossRef]

12. S. Wang, R. Gao, X. Xin, et al., “Adaptive Bayes-Adam MIMO equalizer with high accuracy and fast convergence for orbital angular momentum mode division multiplexed transmission,” J. Lightwave Technol. 41(15), 5026–5036 (2023). [CrossRef]

13. Z. Li, J. Shi, Y. Zhao, et al., “Deep learning based end-to-end visible light communication with an in-band channel modeling strategy,” Opt. Express 30(16), 28905–28921 (2022). [CrossRef]

14. O. Timothy and J. Hoydis, “An introduction to deep learning for the physical layer,” IEEE Trans. Cogn. Commun. Netw. 3(4), 563–575 (2017). [CrossRef]

15. St. Maximilian, F. Aoudia, and J. Hoydis, “Joint learning of geometric and probabilistic constellation shaping,” 2019 IEEE Globecom Workshops (GC Wkshps).IEEE, 2019.

16. V. Aref and M. Chagnon, “End-to-end learning of joint geometric and probabilistic constellation shaping,” in Optical Fiber Communication Conference (OFC) 2022, (Optica Publishing Group, 2022), p. W4I.3.

17. Z. Li, C. Huang, J. Jia, et al., “Deep learning-based end-to-end bit-wise autoencoder for G-Band fiber-terahertz integrated DFT-S-OFDM communication system,” in Optical Fiber Communication Conference (OFC) 2023, (Optica Publishing Group, 2023), p. W2B. 26.

18. J. Shi, Z. Li, J. Jia, et al., “Waveform-to-Waveform End-to-End Learning Framework in a Seamless Fiber-Terahertz Integrated Communication System,” J. Lightwave Technol. 41(8), 2381–2392 (2023). [CrossRef]

19. Y. Hang, Z. Niu, and S. Xiao, “Fast and accurate optical fiber channel modeling using generative adversarial network,” J. Lightwave Technol. 39(5), 1322–1333 (2021). [CrossRef]

20. B. Karanov, M. Chagnon, and V. Aref, “Concept and experimental demonstration of optical IM/DD end-to-end system optimization using a generative model,” in Optical Fiber Communication Conference (OFC) 2020, (Optica Publishing Group, 2020), p. Th2A. 48.

21. Y. Zhao, P. Zou, W. Yu, et al., “Two tributaries heterogeneous neural network based channel emulator for underwater visible light communication systems,” Opt. Express 27(16), 22532–22541 (2019). [CrossRef]

22. J. Shi, W. Niu, Z. Li, et al., “Optimal adaptive waveform design utilizing an end-to-end learning-based pre-equalization neural network in an UVLC System,” J. Lightwave Technol. 41(6), 1626–1636 (2023). [CrossRef]

23. J. Cai, Z. Li, and N. Chi, “Physical prior inspired ensemble learning enables effective channel estimation of underwater visible light communication,” Opt. Express 31(10), 16148–16161 (2023). [CrossRef]

24. W. Yan, B. Liu, L. Li, et al., “Nonlinear distortion and DSP-based compensation in metro and access networks using discrete multi-tone,” European Conference and Exhibition on Optical Communication. Optica Publishing Group, 2012.

25. X. Jin, A. Gomez, K. Gomez, et al., “Mode coupling effects in ring-core fibers for space-division multiplexing systems,” J. Lightwave Technol. 34(14), 3365–3372 (2016). [CrossRef]

26. P. Schulte and G. Böcherer, “Constant composition distribution matching,” IEEE Trans. Inform. Theory 62(1), 430–434 (2016). [CrossRef]

27. Cristian Challu, K. G. Olivares, B. N. Oreshkin, et al., “Nhits: Neural hierarchical interpolation for time series forecasting,” Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 37. No. 6. 2023.

ROP (dBm)	NMSE	Hyperparameter ( $h_{2}$ )	Multiplications (CGAN; FDFDnet)	Reduction
−3	0.023	(256,256,72,72)	86528; 35064	59.5%
−2	0.022	(192,192,64,64)	52608; 29632	43.7%
−1	0.022	(150,150,52,52)	34800; 22204	36.2%
0	0.021	(130,130,48,48)	27560; 19920	27.7%
1	0.021	(110,110,42,42)	21120; 16674	21.1%
2	0.023	(92,92,36,36)	16008; 13644	14.8%

ROP (dBm)	NMSE	Hyperparameter ( $k_{1}, k_{2}, h_{1}, h_{2}$ )	Multiplications (CGAN; FDFDnet)	Reduction
−3	0.025	(228,228,72,72)	70680; 35064	50.4%
−2	0.024	(190,190,66,66)	51680; 30954	40.1%
−1	0.024	(148,148,50,50)	34040; 21050	38.2%
0	0.023	(128,128,46,46)	26880; 18814	30.1%
1	0.022	(110,110,40,40)	21120; 15640	26.0%
2	0.025	(85,85,32,32)	14195; 11744	17.3%

ROP (dBm)	NMSE	Hyperparameter ( $k_{1}, k_{2}, h_{1}, h_{2}$ )	Multiplications (CGAN; FDFDnet)	Reduction
−3	0.025	(256,256,80,80)	86528; 40880	52.8%
−2	0.023	(180,180,65,65)	47160; 30290	35.8%
−1	0.025	(160,160,60,60)	38720; 27060	30.1%
0	0.024	(140,140,55,55)	31080; 23980	22.8%
1	0.026	(92,92,36,36)	16008; 13644	14.8%
2	0.026	(80,80,36,36)	12960; 11744	9.4%

Parameters	Values
Encoder layers	3
Decoder layers	3
Activation function	Relu
Batch size	8192
Learning rate	1e-3
Optimizer	Adam
Epoch	1000

ROP (dBm)	NMSE	Hyperparameter ( $h_{2}$ )	Multiplications (CGAN; FDFDnet)	Reduction
−3	0.023	(256,256,72,72)	86528; 35064	59.5%
−2	0.022	(192,192,64,64)	52608; 29632	43.7%
−1	0.022	(150,150,52,52)	34800; 22204	36.2%
0	0.021	(130,130,48,48)	27560; 19920	27.7%
1	0.021	(110,110,42,42)	21120; 16674	21.1%
2	0.023	(92,92,36,36)	16008; 13644	14.8%

End-to-end learning strategy based on a frequency domain feature decoupling network emulator with joint probabilistic shaping and equalization for a 300-Gbit/s OAM mode division multiplexing transmission

Abstract

1. Introduction

2. End-to-end deep learning of joint probabilistic shaping and equalization for an OAM-MDM system using FDFDnet emulator

2.1 FDFDnet-based channel modelling strategy for an OAM-MDM system

2.2 Joint probabilistic shaping and equalization for an OAM-MDM system based on AE

3. Experimental

3.1 Experimental setup

3.2 Experimental results and analysis

3.2.1 Performance of the FDFDnet emulator for the OAM-MDM system

3.2.2 Performance validation of AE-based joint PS and equalization for OAM-MDM system

4. Conclusion

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (12)

Tables (4)

Equations (19)

Optics Express