Reproduction of mode-locked pulses by spectrotemporal domain-informed deep learning

Jingxuan Sun; Zhen Liu; Yiqing Shu; Yiqing Shu; Jianqing Li; Weicheng Chen; Weicheng Chen

doi:10.1364/OE.501721

1. Introduction

Fiber lasers stimulate diverse industrial applications due to their prominent advantages, such as their high output intensity [1], high pulse repetition frequency [2], narrow pulse width [3] and narrow linewidth [4]. The investigation of the generation [5,6] and dynamics [7,8] of mode-locked soliton pulses in fiber lasers is of great significance and has attracted many researchers’ attention. Since artificial intelligence (AI) technology has been widely introduced into optics, the subject termed “intelligent photonics” has emerged [7–11]. AI methods, including evolutionary algorithms (EAs), genetic algorithms (GAs), neural networks (NNs), deep learning (DL) and human-like algorithms (HLAs), have been extensively employed for automatically triggering mode-locked pulse generation. In general, AI-induced mode locking is achieved with assistance from a single information domain, such as a second harmonic generation (SHG) trace [12], radio frequency (RF) [13] or spectrum [14]. However, these approaches suffer from slow optimization speeds and limited parameter control. On the other hand, manipulating the characteristics of mode-locked pulses based on AI technology has been another popular topic in the field of fiber optics. AI technology has been used to generate various mode-locked patterns, such as dissipative solitons [15] and breathing solitons [16], and manipulate pulse parameters based on the HLA-based Rosenbrock search method as human-like behavior [17]. Additionally, AI technology can dynamically control the dynamics of mode-locked pulses by optimal strategies [18,19]. However, the aforementioned manipulations were fulfilled by utilizing a single formation domain, such as spectrum information or temporal signals.

Although AI technology has made great achievements in triggering mode locking and manipulating mode-locked pulse parameters, the repeatability of the reproduction of unique pulse states is not available for the abovementioned studies. Considering that one frequency component of an arbitrary pulse wave function is denoted as $\omega _{_{1}}$, the computation of $\omega _{_{1}}$ can be expressed as:

(1)$$\frac{\mathrm{d}\phi_{_{1}}}{\mathrm{d} t}=\dfrac{\mathrm{d}\left(\omega_{_{1}} t + \varphi_{_{1}}\right)}{\mathrm{d} t}=\omega_{_{1}},$$

where $\phi _{_{1}}$ and $\varphi _{_{1}}$ are the time-varying and initial phases, respectively. Eq. (1) can be solved as:

(2)$$\phi_{_{1}}=\int \omega_{_{1}}\mathrm{d} t=\omega_{_{1}}t+C,$$

where $C$ denotes an arbitrary constant. However, a specific solution depending on the value of $\varphi _{_{1}}$ becomes unattainable in the absence of additional constraint conditions. Even if the spectra of two pulse states appear to be identical, their pulse patterns, including intensity, phase and profile, may not be completely identical. Relying solely on the spectral domain as the criterion is inadequate for evaluating the reproduction of mode-locked pulses. Likewise, relying solely on the temporal domain as a reproduction criterion is also inadequate since the same pulse shape can correspond to different spectral profiles.

Hence, we propose a novel spectrotemporal domain-informed deep learning algorithm for mode-locked pulse reproduction. This algorithm combines pulse information in both the spectral and temporal domains, providing a more comprehensive criterion for accurate reproduction. We first trained a fully connected neural network (FCNN) to describe the mode-locked fiber laser by using the parameters of a self-developed automated polarization controller (PC) as inputs and the predicted spectrotemporal data as outputs. After training the FCNN, we could further optimize the polarization settings within the cavity by employing the backpropagation (BP) and gradient descent (GD) algorithms. Then, the loss function, consisting of both the autocorrelation trace and spectrum as the temporal and spectral information domains, respectively, was designed for optimizing the reproduction process. Furthermore, we introduced a novel single-layer perceptron model to retrieve the initial phase distribution of the reproduced pulses. Thus, the uniqueness of the reproduced results can be demonstrated by the retrieval of the phase information. The reproduction matching similarity defined by the spectrotemporal MSE was $3.99\times {10}^{-5}$. Our work can stimulate the potential application of intelligent photonics in the ultrafast laser industry.

2. Principle of reproduction

The reproduction workflow can be summarized as follows: Firstly, a comprehensive data set including spectra and autocorrelation data for every PC setting is collected. Subsequently, an FCNN is iteratively trained with the collected data set until the loss error arrives at a predefined threshold. Then, a nonlinear mapping between the PC parameters and experimental outputs is established through the well-trained FCNN. Finally, the FCNN has an ability of manipulating the PC to experimentally reproduce the targeted pulse state by utilizing the trained mapping.

2.1 Training for the laser model

An FCNN was constructed to simulate an experimental fiber laser system to establish the mathematical relationship between the PC parameters and soliton states. As depicted in Fig. 1(b), the FCNN employs 2 parameters, i.e., the rotation angle $\theta$ and the squeeze level $\phi$ of the self-developed automated PC, in the input layer, represented as $\left (\theta,\phi \right )^\top$, where $\top$ indicates the transpose of the vector. The squeeze level is defined by the pressure on the optical fiber in the PC and measured by a stress sensor. The FCNN incorporates 3 hidden layers in an inverted pyramid shape, with the hyperbolic tangent function $\tanh \left (x\right )$ as the nonlinear activation function for better performance. All hidden layers together are essentially a function $\mathbf {F}$. The output layer is designed for predicting the spectrum data $\widehat {A}$ and the autocorrelation trace data $\widehat {I}$. Therefore, the mathematical relationship between inputs and outputs can be represented as follows:

(3)$$\mathbf{F}\left(\begin{matrix}\theta\\\phi\\\end{matrix}\right)=\left(\begin{matrix}\widehat{I}\left(\theta,\phi\right)\\\widehat{A}\left(\theta,\phi\right)\\\end{matrix}\right).$$

Fig. 1. The reproduction process of spectrotemporal domain-informed deep learning. (a) Digital control system. (b) Well-trained FCNN. (c) Matching the optimized process between the experimental results and the targeted soliton state. (d) Experimental setup.

Download Full Size | PDF

The FCNN is trained with experimental data. The well-trained FCNN depicted in Fig. 1(b) can provide a well-fitted mathematical relationship between the fiber laser output and the PC settings, e.g., autocorrelation trace and the corresponding PC parameters.

2.2 Training for mode-locked pulse reproduction

After laser model training, the subsequent step is target reproduction training. The desired PC parameters are searched by GD. The well-trained FCNN is considered a special activation function. It should be noted that the manipulation of PC parameters for reproducing targeted pulses cannot be achieved without the mapping of PC parameters and laser output provided by FCNN, BP and GD. The input layer is then transformed into the layer to be optimized. As shown in Fig. 1(b), the reproduction procedure starts with the random initialization of the PC parameters $\left (\widehat {\theta },\widehat {\phi }\right )^\top$. Then, by inputting the initial PC parameters into the well-trained FCNN, the FCNN will generate the predicted output of both spectral and temporal information. On the other hand, as shown in Fig. 1(a), the PC parameters will also be transmitted to a programmable logic controller (PLC) to control the servo-based automated PC1 shown in Fig. 1(d) for manipulating the experimental laser system. To reduce the random fluctuations induced by the environment during the processes of data collection and model training, as illustrated in Fig. 1(c), the spectrotemporal outputs are also measured and involved in the model. Therefore, a new loss function is constructed by combining the FCNN prediction data, real-time experimental data and target mode-locked pulse data $\left (I,A\right )^\top$ in the spectrotemporal domains. As a result, a spectrotemporal domain-informed deep learning model is constructed to manipulate the reproduction of targeted solitons.

The optimization process is based on the GD algorithm, which can be mathematically represented as follows:

(4)$$ \theta := \theta-\eta\dfrac{\partial L_{_{\mathrm{S-T}}(\theta, \phi)}}{\partial \theta},$$

(5)$$\phi := \phi-\eta\dfrac{\partial L_{_{\mathrm{S-T}}(\theta, \phi)}}{\partial \phi},$$

where $L_{_{\mathrm {S-T}}}$ and $\eta$ represent the spectrotemporal joint loss function and the learning rate (LR) that determines the speed of descent, respectively. “:=” denotes the assignment symbol. Since the GD algorithm is an iterative optimization algorithm, the right-hand sides of both Eq. (4) and Eq. (5) are calculated and then assigned to the respective left-hand sides in each iteration. After adequate training iterations, the best predicted result is defined as:

(6)$$\{\theta^{*}, \phi^{*}\} = {arg}\mathop{\min}_{\theta,\phi}L_{_{\mathrm{S-T}}}(\theta,\phi),$$

where $\theta ^\ast$ and $\phi ^\ast$ are the optimal solutions for the rotation angle $\theta$ and squeeze level $\phi$ of the PC, respectively, and ${arg}\mathop {\min }_{\theta,\phi }L_{_{\mathrm {S-T}}}(\theta,\phi )$ represents the values of arguments $\theta$ and $\phi$ that minimize the function $L_{_{\mathrm {S-T}}}$. To calculate the gradient of $L_{_{\mathrm {S-T}}}$, the well-trained FCNN is involved as an intermediary, facilitating the implementation of BP for gradient computation. BP has been widely used to optimize physical NNs [20] and photonic NNs [11], and its process can be briefly represented as follows:

(7)$$\dfrac{\partial L_{_{\mathrm{S-T}}}(\theta,\phi)}{\partial \theta} = \dfrac{\partial L_{_{\mathrm{S-T}}}(\theta,\phi)}{\partial \mathbf{F}_{_{\mathrm{W-T}}}} \dfrac{\partial \mathbf{F}_{_{\mathrm{W-T}}}}{\partial \theta},$$

(8)$$\dfrac{\partial L_{_{\mathrm{S-T}}}(\theta,\phi)}{\partial \phi} = \dfrac{\partial L_{_{\mathrm{S-T}}}(\theta,\phi)}{\partial \mathbf{F}_{_{\mathrm{W-T}}}} \dfrac{\partial \mathbf{F}_{_{\mathrm{W-T}}}}{\partial \phi},$$

where $\mathbf {F}_{_{\mathrm {W-T}}}$ is the equivalent equation of the well-trained FCNN.

The optimizer depicted in Fig. 1 is developed by incorporating the aforementioned processes and algorithms. Through deductive reasoning and mathematical verification, we have established the mathematical operation of our proposed DL model. Thus, we confidently employ our model to reproduce designated mode-locked pulses.

2.3 Criteria for soliton reproduction

Generally, loss functions in deep learning, such as the mean absolute error (MAE), MSE and Huber loss, are used to evaluate the reproduction performance. Due to the lack of physical background, however, these loss functions cannot capture the inherent features presented in optical spectra and autocorrelation traces. Given the inherent distinctions between spectra and autocorrelation traces, it becomes imperative to employ distinct loss functions to optimize the network’s performance in terms of convergence speed, prediction accuracy and generalization capacity. Therefore, we propose constructing loss functions with physical properties to evaluate the matching quality of mode-locked pulse reproduction.

For the spectral criterion, we propose a novel algorithm, termed “optical spectrum matching absolute deviation variance (OSM-ADV)”. The OSM-ADV loss function, denoted by $S^{2}_{_{\mathrm {OSM-ADV}}}$, is specifically designed to capture key features of optical spectra, such as the positions and spike amplitudes of Kelly sidebands. Mathematically, $S^{2}_{_{\mathrm {OSM-ADV}}}$ is calculated as follows:

(9)$$S^{2}_{_{\mathrm{OSM-ADV}}}(\theta, \phi) = \sigma^{2}\left(\left|\left|\widehat{A}(\theta, \phi)-A\right|\right|\right) = \dfrac{1}{n-1}\displaystyle\sum_{i=1}^{n}\left(\left|\left|\widehat{A}(\theta, \phi)-A\right|\right|_{_{i}}-\overline{\left|\left|\widehat{A}(\theta, \phi)-A\right|\right|}\right)^{2},$$

where $\sigma ^2$, $\widehat {A}\left (\theta,\phi \right )$ and $A$ denote the variance operator, predicted and target spectra, respectively, and $i$ denotes the index of data in the absolute error vector $\left |\left |\widehat {A}\left (\theta,\phi \right )-A\right |\right |$. Compared with the commonly used MSE loss, OSM-ADV offers clearer physical interpretations because it effectively captures the stability of the absolute deviation of spectral amplitudes between the targeted and predicted spectra. By quantifying this stability, OSM-ADV can effectively account for preserving distinct features, e.g., Kelly sidebands, and identifying the whole spectral profile, e.g., spectral distribution features. Crucially, every step in the OSM-ADV calculation is differentiable, including the absolute deviation, squaring, mean and variance between the targeted and measured spectra. Consequently, the entire OSM-ADV loss function is differentiable with respect to $L_{_{\mathrm {S-T}}}\left (\theta,\phi \right )$, which enables the gradient calculation of Eq. (6) during carrying out BP and GD processes.

For the temporal criterion, we utilize the modified Pearson’s correlation coefficient, denoted as modified Pearson’s $r$, as the loss function for temporal pulse reproduction. Mathematically, Pearson’s $r$ is defined as:

(10)$$r_{_{\widehat{I},I}}(\theta, \phi) = \dfrac{\mathrm{Cov}\left[\widehat{I}(\theta, \phi), I\right] }{\sigma\left[\widehat{I}(\theta, \phi)\right] \sigma(I)},$$

where $\mathrm {Cov}$, $\sigma$, $\widehat {I}\left (\theta,\phi \right )$ and $I$ represent the covariance operator, standard deviation operator, predicted and targeted autocorrelation traces, respectively. Pearson’s $r$, which is a linear correlation analysis algorithm, can evaluate the linear correlation between the predicted and targeted autocorrelation traces of the mode-locked pulse. The range of Pearson’s $r$ is from $-1$ to $1$. The values of $-1$ and $1$ indicate a perfect negative and positive linear correlation, respectively, while $0$ represents uncorrelation. To align with the optimized endpoint of the neural network’s loss function, we modified Pearson’s $r$ as follows to improve the evaluation ability in positive correlation between predicted and target autocorrelation traces:

(11)$$\tilde{r}_{_{\widehat{I},I}}(\theta, \phi) =\left|\left|r(\theta, \phi) -1\right|\right|.$$

In conclusion, the OSM-ADV and the modified Pearson’s ${\tilde {r}}_{_{\hat {I},I}}$ loss functions provide a more comprehensive and physically meaningful evaluation of the similarity between the predicted results and the targeted information. Then, the whole spectrotemporal loss function is defined as:

(12)$$L_{_{\mathrm{S-T}}}(\theta,\phi) = C_{_{1}} S^{2}_{_{\mathrm{OSM-ADV}}}(\theta, \phi)+C_{_{2}} \tilde{r}_{_{\widehat{I},I}}(\theta, \phi),$$

where both $C_{_{1}}$ and $C_{_{2}}$ are normalized coefficients of $S^{2}_{_{\mathrm {OSM-ADV}}}(\theta, \phi )$ and ${\tilde {r}}_{_{\hat {I},I}}\left (\theta,\phi \right )$, respectively, for balancing the loss value. If the spectral reference baseline is set at $-m$ dBm in logarithmic coordinate (equivalent to $10^{-m/10}$ mW in linear coordinate) and the target spectrum is set as $X$ in linear coordinate, the absolute error data alternates between 0 and $X-10^{-m/10}$ repeatedly. The variance of the absolute error can reach the maximum value of $\sigma ^{2}(X-10^{-m/10})$. Therefore, we set $C_{_{1}}$ and $C_{_{2}}$ at $1/2$ and $1/\sigma ^{2}(X-10^{-m/10})$, respectively, to balance the magnitudes of the spectrotemporal domain loss functions. If the two loss components are imbalanced, the network may predominantly optimize the information domain with a higher normalized coefficient. Therefore, the balance of these coefficients is essential for the performance of capturing the desired features in both domains.

By incorporating these loss functions into the target reproduction training, our proposed model not only enables the evaluation of specific data differences but also captures a broader range of similarities in the overall trends for achieving more accurate and reliable pulse reproduction. Simultaneously, our spectrotemporal loss functions can be universally applied to all types of mode-locked lasers. These loss functions can effectively evaluate the reproduction performance of various output states.

3. Experimental reproduction of the targeted soliton

The experimental setup, depicted in Fig. 1(d), features a ring cavity mode locked by the nonlinear polarization rotation (NPR) technique. The laser is composed of a 4-meter segment of commercial erbium-doped fiber (EDF, YOFC EDF1022) pumped by a 980-nm laser, incorporating a 200-meter segment of conventional single-mode fiber (SMF) to accumulate nonlinear phases of the pulses for facilitating NPR mode-locking. The NPR mechanism is realized by strategically placing a pair of PC3 and PC4 and a single polarization-dependent isolator (PD-ISO) within the ring cavity, serving as both a mode locker and a filter. Additionally, a Lyot filter, consisting of a 0.25-meter segment of polarization-maintaining fiber (PMF) and two polarization controllers (PC2 and automated PC1), is integrated with the NPR-based filtering effects through a shared PD-ISO. The characteristics of mode-locked pulses can be manipulated by adjusting polarization settings induced by PCs in our NPR-based fiber laser. We aim to fully automatically activate mode-locking and search the target in fiber lasers by adjusting PCs assisted by our FCNN model. On the other hand, the composite filtering composed of NPR filtering and Lyot filtering can help generate diverse pulse patterns. Therefore, we practice reproducing various output states with our FCNN model by using composite filtering, which will validate the model’s ability to distinguish between different pulse states. The total length of the cavity is approximately 207.3 meters. The output of the laser system is obtained from the 10% port of a Y-type coupler. By configuring specific polarization settings and reaching a threshold pump power of 53 mW, we could achieve a self-started mode-locked soliton with inherent Kelly sidebands, which served as the target pulse to be faithfully reproduced.

We employed an OSA (Yokogawa AQ6375B) and an autocorrelator (Femtochrome FR-103XL) with an oscilloscope (Teledyne LeCroy WaveMaster 813Zi-B) for monitoring spectrotemporal signal information. The resolutions of the OSA and the autocorrelation were kept at the highest precision of 0.01 nm and 5 fs during the measurement processes, respectively. The oscilloscope as a screen was used to display the autocorrelation trace of the signal. Real-time data collection was facilitated using a MacBook laptop equipped with an Apple Silicon M1 CPU and 16 GByte of memory. The experimentally acquired data were utilized to construct a comprehensive spectrotemporal database as the training data set for our FCNN. The laptop then exports the predictions of our deep learning model to the PLC shown in Fig. 1(a), enabling the adjustment of the automated PC1 shown in Fig. 1(d) to ensure that the experimental spectrotemporal output aligns with the designated target for the precise reproduction of the mode-locked soliton state through the aforementioned automatic control algorithm.

We commenced the training for the laser model by collecting spectra and autocorrelation traces within the PC1 adjustment ranges of $0^\circ \le \theta \le {150}^\circ$ and $-5^\circ \le \phi \le 5^\circ$, respectively, with a step size of 1$^{\circ }$ for each degree of freedom. The acquired data were divided into a training data set (80%) used for training the FCNN and a test data set (20%) employed to evaluate the training performance. To address the issue of overfitting in deep learning, we employed dropout regularization during the training process. By randomly deactivating the neurons in the FCNN with a typical probability $p$ value of 0.5 [21], we aimed to prevent the model from relying too heavily on specific features or memorizing the training data [22]. This technique helps promote better generalization and reduces the risk of overfitting, enhancing the model’s ability to perform well on unseen data. The training of the FCNN was terminated when the training error reached a value below ${10}^{-7}$. The FCNN achieved a prediction accuracy of 99.0237% for the training data set and 95.3450% for the testing data set, indicating a well-trained FCNN.

Figures 2(a) and (d) illustrate the evolution of the 49 optimal cases in the process of searching and reproducing the targeted mode-locked pulses. Figure 2(a) depicts the evolution of the spectra during the search and reproduction processes, exhibiting a clear trend from longer wavelengths to shorter wavelengths due to the target spectrum being biased at the short wavelength band centered at 1542 nm. Figure 2(d) depicts the corresponding evolution of the autocorrelation traces, demonstrating the progression of the pulse peak intensity from weak to strong and the pulse width from broad to narrow. The provided target for pulse reproduction is represented by the red boxes or curves in Figs. 2(a), (b), (c) and (d), while the deep blue boxes or curves depict the tracked and captured experimental results through our proposed algorithm. These two algorithms employed in our system have a unique advantage in accurately evaluating spectrotemporal domain information, which are much different from the reported algorithms in the field of AI. Therefore, we additionally calculated the MSE loss to facilitate intuitive comparison during the optimization process, as shown in Fig. 2(e). The minimum MSE is $3.99 \times 10^{-5}$.

Fig. 2. The reproduction of a conventional soliton. (a) Spectral matching process. (b) Spectra. (c) Autocorrelation traces. (d) Temporal matching process. (e) Error curve.

Download Full Size | PDF

Figures 2(b) and (c) present the spectra and autocorrelation traces, respectively, corresponding to PC1 state index numbers of 0, 20, 40 and 49. When the index number is 0, the spectrum already exhibits Kelly sidebands, and the autocorrelation trace indicates the generation of a soliton. Since the central wavelength and intensity of the autocorrelation trace differ from the target, our reproduction system does not consider it as the final result. When the index number is 20, the central wavelength of the experimentally obtained soliton has been manipulated to a spectral region close to the target. Because there are differences in the spectral profile, spectrum width, pulse width and pulse intensity, it is still not identified as the target. At index number 40, the obtained pulse exhibits a high similarity with the target in terms of the spectral trend, central wavelength and pulse width. However, its MSE loss, as shown in Fig. 2(e), is only $7.31\times {10}^{-5}$, resulting in the tracked result not being considered the best result. This indicates that our proposed OSM-ADV algorithm can capture subtle differences in the spectra, while Pearson’s correlation coefficient, as the pulse similarity analysis algorithm, can capture differences in pulse intensity. Finally, the reproduction process halts at index number 49, corresponding to the 237th epoch. The subsequent 500 epochs can be regarded as the optimal reproduction results with a high degree of similarity in the whole spectral and temporal pulse parameters. Specifically, the spectrotemporal MSE loss is $3.99\times {10}^{-5}$ for the best reproduction.

To verify the validity of our model, we further reproduced another pulse pattern, dual-wavelength solitons. As shown in Fig. 3(a), The autocorrelation trace of dual-wavelength solitons has a small pedestal similar to that of noise-like pulses. However, due to the constraint of the spectral information, our model did not misjudge noise-like signals as reproducible objects. Then, the autocorrelation trace and spectrum of reproduced dual-wavelength solitons in orange is shown in Fig. 3, which have a much high similarity with the target in blue. The reproduction loss defined by MSE is $8.23\times 10^{-5}$. Thus, two different reproduction cases in Figs. 2 and 3 show that the validity and accuracy of our model.

Fig. 3. The reproduction of dual-wavelength solitons. (a) Autocorrelation trace. (b) Optical spectrum.

Download Full Size | PDF

4. Discussion

To verify the physical completeness of our reproduction approach, we will demonstrate how to achieve the reproduction of authentic mode-locked pulses. In accordance with the Fourier theorem, an arbitrary mode-locked pulse can be defined in a complex exponential summation form. As shown in the stereogram in Fig. 4(b), the best matched output mode-locked pulse, denoted as $\boldsymbol{\varPsi}(t)$ (depicted as the deep blue time-intensity graph in Fig. 4(b)), is actually composed of a linear superposition of monochromatic signals with different frequencies (represented by a series of sinusoidal curves in Fig. 4(b) ranging from purple to red in spectral order). Consequently, $\boldsymbol{\varPsi}(t)$ can be expressed as:

(13)$$\boldsymbol{\varPsi}(t)=\sum_{k=1}^{N}A_{_{k}} \exp\left[i\left(2\pi\nu_{_{k}}t+\varphi_{_{k}}\right)\right].$$

Fig. 4. The structure of PPA. (a) The physics-informed phase retrieval process. (b) The data-informed pulse output state measurement process.

Download Full Size | PDF

As depicted in Fig. 4(b), Eq. (13) characterizes the temporal amplitude $\boldsymbol{\varPsi}(t)$ of the mode-locked optical pulse field with $N$ distinct frequency components, in which each component is defined by individual amplitudes $A_{_{k}}$, frequencies $\nu _{_{k}}$ and initial phase $\varphi _{_{k}}$. The distribution of $A_{_{k}}^2$ at $\nu _{_{k}}$ measured by the OSA represents the spectral information of the pulse. The initial phase $\varphi _{_{k}}$ of each frequency component corresponds to the horizontal offset of that component’s signal, ranging from 0 to $2\pi$. For a single monochromatic sinusoid signal, $\varphi _{_{k}}$ determines the positions of its peaks and valleys. When the phases cyclically vary as the frequency increases or decreases, the superposition of these frequency components can produce high-intensity peaks at specific positions, thereby achieving mode-locked pulses.

As illustrated in Fig. 4(b), we performed an autocorrelation measurement to obtain the temporal data of $\boldsymbol{\varPsi}(t)$. The autocorrelation trace represents the self-convolution of $\boldsymbol{\varPsi}(t)$ and is obtained using an autocorrelator. An oscilloscope was also involved to measure the autocorrelation trace $I(t)$, which can be represented as:

(14)$$I(t) = \left|\left|\int_{-\infty}^{+\infty}\boldsymbol{\varPsi}(\tau-t)\boldsymbol{\varPsi}^{*}(\tau)\mathrm{d}\tau\right|\right|^{2}.$$

We can therefore obtain the information of $\boldsymbol{\varPsi}(t)$ from $I(t)$. Thus far, all items in Eq. (13) have been revealed except for the initial phase $\varphi _{_{k}}$ for each frequency component. However, the direct experimental measurement of $\varphi _{_{k}}$ faces challenges. Insufficient phase information can lead to incomplete pulse reproduction. To address this issue, we propose a novel single-layer perceptron model, referred to as the phase perceptron algorithm (PPA).

As shown in Fig. 4(a), we constructed the PPA model by using 6666 sampling points of spectral data. The model’s weights were set as the frequency $\omega _{_{k}}=2\pi \nu _{_{k}}$, and the corresponding scaling factor was set as the amplitude $A_{_{k}}$. The biases were optimized by setting them as the parameter column $\varphi _{_{k}}$. However, the mathematical expression for the pulse phase cannot be directly deduced in terms of the spectra and autocorrelation traces. We can numerically calculate the phase distribution by an iterative phase-retrieval algorithm. The phase-retrieval process was experimentally carried out as follows: the phase-retrieval algorithm was to first introduce an initial phase for constructing an optical pulse field based on the experimentally achieved spectral data, and then to compare with the error of the similarity between the theoretically calculated autocorrelation trace and the experimental result. When the error arrived at a given target, the iterative calculation process would be terminated. The perceptron output, $\boldsymbol{\varPsi}(t)$, reconstructs the mode-locked pulse signal, which can be described by Eq. (13). To measure the discrepancy between the predicted and actual autocorrelation traces, we constructed a loss function based on the temporal loss proposed in Eq. (11), which is the modified Pearson’s ${\tilde {r}}_{_{\hat {I},I}}$. We used the complex exponential $\exp \left (ix\right )$ as the nonlinear activation function and employed BP and GD to optimize the parameter column $\varphi _{_{k}}$ according to the loss function. The optimization process enabled our proposed PPA model to achieve phase retrieval. And we also incorporated a dropout regularization into the PPA framework. Specifically, we randomly deactivated 20% neurons of PPA every iteration and recover their avidities subsequently. This innovative approach serves to mitigate the impact of high-amplitude components on the final results, thereby guaranteeing equal involvement of all spectral components. After optimization, we obtained the retrieved phase distribution schematically represented in Fig. 4(b).

By feeding with the experimental autocorrelation trace data and optical spectrum data of the best matched result, PPA is capable of retrieving the corresponding initial phase distribution of the measured mode-locked soliton. It is crucial to clarify that while frequency-resolved optical gating (FROG) is primarily employed to retrieve the temporal phase distribution, PPA is specifically designed for the retrieval of the initial spectral phase distribution.

According to Fig. 5(b), PPA directly introduces the best reproduced spectral data as the initial invariant parameter of the perceptron. On the other hand, as shown in Fig. 5(a), the corresponding autocorrelation trace data are introduced into the perceptron as fitting objects. Therefore, the orange, green and blue curves in Fig. 5(i) show the autocorrelation signals reconstructed from the same optical spectrum data (Fig. 5(b)) after phase distribution retrieval (Figs. 5(c), (d) and (e)) using PPA. Simultaneously, Figs. 5(c), (d) and (e) show periodic features in the initial phase distributions, which indicate the generation of mode-locked pulses. The initial phases determine the positions of spectral peaks and valleys within the mode-locked pulses. Figures 5(f), (g) and (h) show zoom-in images of Figs. 5(c), (d) and (e) for the initial phase periods of 0.298 nm, 0.833 nm and 1.000 nm, respectively. In other words, Figs. 5(c), (d) and (e) accordingly contain 84, 30, and 25 times of initial spectral phase periods. Figure 5(i) shows three different PPA-retrieved autocorrelation traces, which have evident differences in phase periodicity and spectral phase profile due to different training iterations. The red, orange, green and blue dashed lines in Fig. 5(i) represent the symmetry axes of the experimental autocorrelation trace, best PPA-reconstructed autocorrelation signal, and other PPA-reconstructed autocorrelation cases of 1 and 2, respectively. Notably, the best reconstructed autocorrelation signal is consistent with the targeted soliton in terms of the envelope profile and symmetry axis, thereby achieving reconstruction.

Fig. 5. Results of the PPA. (a) The best experimental reproduced autocorrelation trace. (b) The best experimental reproduced optical spectrum. (c) The best retrieved phase distribution. (d) and (e) Retrieved phase distributions with the same spectral information. (f), (g), and (h) Details of the retrieved initial phase distribution. (i) Comparison between the reconstructed autocorrelation signals and the best reproduced autocorrelation trace.

Download Full Size | PDF

It is worth noting that PPA operates without specific constraints. The fidelity of the phase retrieval will be enhanced for larger sampling points from the experimental instrument, higher data smoothness and reduced data noise. In essence, PPA is not limited to any particular pulse type and can effectively perform phase retrieval as long as autocorrelation data and spectral information are accessible. However, our model is only valid for the reproduction of a single polarized pulse state currently. According to a pulsation operation containing a series of pulses with different polarization states, our model is not accordingly trained for effectively reproducing mode-locked pulsation operation.

5. Conclusion

We have successfully reproduced targeted mode-locked solitons by spectrotemporal domain-informed deep learning. A robust reproduction approach is proposed, i.e., combining the OSM-ADV algorithm in the spectral domain and the modified Pearson’s ${\tilde {r}}_{_{\hat {I},I}}$ in the temporal domain, for evaluating the similarity of the spectra and the autocorrelation trace, respectively. The experimental results demonstrate perfect reproduction, with a total MSE of $3.99\times {10}^{-5}$. Furthermore, our designed PPA model enables the precise retrieval of the phase distribution of the measured pulses by using experimental optical spectra and autocorrelator traces, which validates the physical completeness of our proposed deep learning reproduction algorithm. Our research contributes to a deep understanding of the dynamic properties of mode-locked pulses, enabling advancements in various fields, including ultrafast laser technology, optical communication and nonlinear optics.

Funding

Department of Education of Guangdong Province (2021ZDJS105); National Natural Science Foundation of China (62375053); Research Fund of Department of Science and Technology of Guangdong Province (2020B1212030010).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. D. Strickland, “Nobel lecture: Generating high-intensity ultrashort optical pulses,” Rev. Mod. Phys. 91(3), 030502 (2019). [CrossRef]

2. Y. K. Chembo, D. Brunner, M. Jacquot, et al., “Optoelectronic oscillators with time-delayed feedback,” Rev. Mod. Phys. 91(3), 035006 (2019). [CrossRef]

3. C. Markos, J. C. Travers, A. Abdolvand, et al., “Hybrid photonic-crystal fiber,” Rev. Mod. Phys. 89(4), 045003 (2017). [CrossRef]

4. S. S. Harilal, M. C. Phillips, D. H. Froula, et al., “Optical diagnostics of laser-produced plasmas,” Rev. Mod. Phys. 94(3), 035002 (2022). [CrossRef]

5. M. Rowley, P.-H. Hanzard, A. Cutrona, et al., “Self-emergence of robust solitons in a microcavity,” Nature 608(7922), 303–309 (2022). [CrossRef]

6. J. S. Peng, S. Boscolo, Z. H. Zhao, et al., “Breathing dissipative solitons in mode-locked fiber lasers,” Sci. Adv. 5(11), eaax1110 (2019). [CrossRef]

7. R. Iwami, T. Mihana, K. Kanno, et al., “Controlling chaotic itinerancy in laser dynamics for reinforcement learning,” Sci. Adv. 8(49), eabn8325 (2022). [CrossRef]

8. T. K. Zhou, W. Wu, J. Z. Zhang, et al., “Ultrafast dynamic machine vision with spatiotemporal photonic computing,” Sci. Adv. 9(23), eadg4391 (2023). [CrossRef]

9. G. Carleo, I. Cirac, K. Cranmer, et al., “Machine learning and the physical sciences,” Rev. Mod. Phys. 91(4), 045002 (2019). [CrossRef]

10. G. Wetzstein, A. Ozcan, S. Gigan, et al., “Inference in artificial intelligence with deep optics and photonics,” Nature 588, 39–47 (2020). [CrossRef]

11. S. Pai, Z. H. Sun, T. W. Hughes, et al., “Experimentally realized in situ backpropagation for deep learning in photonic neural networks,” Science 380(6643), 398–404 (2023). [CrossRef]

12. U. Andral, R. Si Fodil, F. Amrani, et al., “Fiber laser mode locked through an evolutionary algorithm,” Optica 2(4), 275–278 (2015). [CrossRef]

13. U. Andral, J. Buguet, R. Si Fodil, et al., “Toward an autosetting mode-locked fiber laser cavity,” J. Opt. Soc. Am. B 33(5), 825–833 (2016). [CrossRef]

14. D. G. Winters, M. S.. Kirchner, S. J. Backus, et al.Electronic initiation and optimization of nonlinear polarization evolution mode-locking in a fiber laser,” Opt. Express 25(26), 33216–33225 (2017). [CrossRef]

15. Z. Li, S. S. Yang, Q. Xiao, et al., “Deep reinforcement with spectrum series learning control for a mode-locked fiber laser,” Photonics Res. 10(6), 1491–1500 (2022). [CrossRef]

16. X. Q. Wu, J. S. Peng, S. Boscolo, et al., “Intelligent breathing soliton generation in ultrafast fiber lasers,” Laser Photonics Rev. 16, 2100191 (2022). [CrossRef]

17. G. Q. Pu, L. L. Yi, L. Zhang, et al., “Intelligent programmable mode-locked fiber laser with a human-like algorithm,” Optica 6(3), 362–369 (2019). [CrossRef]

18. G. Q. Pu, L. L. Yi, L. Zhang, et al., “Intelligent control of mode-locked femtosecond pulses by time-stretch-assisted real-time spectral analysis,” Light: Sci. Appl. 9(1), 13 (2020). [CrossRef]

19. Q. Q. Yan, Q. H. Deng, J. Zhang, et al., “Low-latency deep-reinforcement learning algorithm for ultrafast fiber lasers,” Photonics Res. 9(8), 1493–1501 (2021). [CrossRef]

20. L. G. Wright, T. Onodera, M. M. Stein, et al., “Deep physical neural networks trained with backpropagation,” Nature 601(7894), 549–555 (2022). [CrossRef]

21. N. Srivastava, G. Hinton, A. Krizhevsky, et al., “Dropout: A simple way to prevent neural networks from overfitting,” J. Mach. Learn. Res. 15, 1929–1958 (2014). [CrossRef]

22. G. E. Hinton, N. Srivastava, A. Krizhevsky, et al., “Improving neural networks by preventing co-adaptation of feature detectors,” arXiv, arXiv:1207.0580 (2012). [CrossRef]

Reproduction of mode-locked pulses by spectrotemporal domain-informed deep learning

Abstract

1. Introduction

2. Principle of reproduction

2.1 Training for the laser model

2.2 Training for mode-locked pulse reproduction

2.3 Criteria for soliton reproduction

3. Experimental reproduction of the targeted soliton

4. Discussion

5. Conclusion

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (5)

Equations (14)

Optics Express