Deep learning control of THz QCLs

Benedikt Limbacher; Benedikt Limbacher; Sebastian Schoenhuber; Sebastian Schoenhuber; Martin Alexander Kainz; Martin Alexander Kainz; Nicolas Bachelard; Aaron Maxwell Andrews; Aaron Maxwell Andrews; Hermann Detz; Hermann Detz; Gottfried Strasser; Gottfried Strasser; Juraj Darmo; Karl Unterrainer; Karl Unterrainer

doi:10.1364/OE.430679

1. Introduction

Classical simulations, usually based on assumptions and simplifications, are employed in order to map complex systems to an analytical model. Real world problems are rarely completely mapped or only with great computational effort. In a digital experiment, where the physical component is replaced by a predictive model, no assumptions are required, which enables it to map the real world behavior, including effects that would be neglected in simulations. In contrast to classical simulations, which require details about the experiment geometry, boundary conditions and a sound mathematical description of the involved physics, but not a single measurement, the machine-learning based digital experiment only requires a measured data set of the parameter space.

Machine learning based solutions in photonics are rapidly gaining momentum. They are successfully used for controlling and modelling lasers [1,2], all-optical object recognition [3,4] and neuromorphic photonics [5,6]. Machine learning is successfully employed for designing photonic structures [7]. In conjunction with the development of Terahertz (THz) Quantum Cascade Lasers (QCLs) it was for example recently shown that machine learning methods can complement and perhaps even replace the finite-elements method for certain problems [8]. Another area of photonics, where machine learning is already successfully used in ultra-fast photonics [9].

An example of a complex system, which is difficult to map by using classical simulations are THz QCLs, which are efficient and powerful sources of coherent terahertz light [10]. Recently thermo-electrically cooled terahertz quantum cascade lasers [11] with operating temperatures up to 250 K [12] have been presented, allowing for portable systems. A sub-class of these lasers are the Terahertz Quantum Cascade Random Lasers (QCRLs) [13]. State-of-the-art QCRLs exhibit broadband, multi-mode terahertz emission with a collimated far field, making them ideal candidates for spectroscopic applications [14–16]. Molecular absorption lines result from transitions between vibrational and rotational states of molecules and lie within the THz spectral region. Molecules such as HCN or NO$_{2}$ exhibit their rotational transitions at 2.30 THz and 2.32 THz [17] and hence can be identified by THz spectroscopy.

Contrary to conventional Quantum Cascade (QC) resonator structures, such as ridge- or micro-disk resonators, which feature periodic Fabry-Perot and whispering gallery modes, the random laser is based on feedback by multiple scattering at scattering centers that are randomly distributed among the active region [18]. Considering the passive resonator only, such a cavity is broadband and not frequency selective, since the optical feedback is quasi-continuously broadband [19].

However, emission spectra of the THz QCRL exhibit discrete laser lines only. This is due to non-linear interaction between individual lasing modes that arise from the multiple scattering. The modes compete for the available gain in the spatial ("spatial hole burning") and the spectral ("mode competition") domains, which selects a set of modes.

The tuning of QCLs has previously been achieved by tuning the cavity refractive index via temperature [10], employing external cavities [20–23], and using micro-mechanical systems [24,25]. Recently published results show, that QCRLs may also be tuned by illumination with near infrared light (NIR) [26]. The interplay between the modes can be influenced by spatially shaping NIR illumination of the QCRL and thus the scattering conditions of the resonator structure. By optical control of the QCRL, a specific laser mode can be selectively enhanced until a single-mode spectrum is achieved. However, this iterative procedure requires repeated feedback from a FTIR spectrometer, which is fed into an optimization algorithm to shape the NIR modulation pattern. A typical optimization routine requires measuring dozens of spectra, which is very time consuming. In fact, every single desired spectrum requires remeasuring a large amount of spectra to achieve the desired output, hence the field of practical applications is limited.

To overcome this vexing limitation of the truly broadband random resonator, we demonstrate predictive deep learning models, which once trained are able to precisely predict the QCRL’s behavior. The rapid and instantaneous feedback of the predictive models significantly accelerates possible applications, such as the generation of designed output spectra. To modulate the QCRL towards specific output spectra, tuning of the hyperparameter space is necessary. While this process is inconvenient and time consuming for real experiments, as it requires restarting the experiment for each set of parameters, the digital experiment easily enables the tuning of the optimization process due to its high speed. The digital experiment allows the modulation of any NIR modulation pattern, thus enabling remote adaption of the output spectra.

For the fast generation of the desired THz spectra, we make use of a direct predictive approach by applying a convolutional neural network (CNN) [27]. For the exploration of boundaries of the physical system, an iterative approach demonstrates the potential of the digital experiment.

These results pave the way for real-time emission spectrum modulation of QCRLs with high flexibility, which enables advanced spectroscopic applications.

2. Methods

2.1 General

The THz beam is emitted from a QCRL cryogenically cooled to 10 K. The active region design is based on a three-well phonon depopulation, bound-to-continuum $\mathrm {Al_{0.15}Ga_{0.85}As}$ heterostructure as introduced by Amanti et al. [28]. The epitaxial layer has a thickness of 13 µm and consists of 340 repetitions of the active region as well as a 100 nm highly doped contact layer on the bottom. The device is fabricated as double-metal waveguide structure with a disk diameter of 500 µm. Etched holes with diameters of 20 µm act as random scatterers. They are distributed such that the filling fraction is 18%. The QCRL is operated in pulsed mode with a repetition rate of 10 kHz and a pulse width of 4 µs. The bias current density at a voltage of 11.8 V is 350 A cm⁻².

The modulation is achieved by expanding a 810 nm NIR laser beam with a continuous wave power of 950 mW to a diameter of 1 cm. The beam is directed onto a spatial light modulator (SLM), which spatially modulates the wave front of the beam in such a way that the desired image is created at the focal plane, where the QCRL is located. The diameter of the pattern in the focal plane is slightly larger than the diameter of the QCRL (500 µm), which is compensated by the pattern generation. For the projection phase-masks are calculated with the Gerchberg-Saxton algorithm [29]. This operation is performed on a computer, which also controls the experiment. The THz beam, emitted by the QCRL is directed into a Bruker Vertex 80 Fourier Transform Infrared (FTIR) spectrometer, which records the emission spectrum of the QCRL. The resolution of the FTIR spectrometer is set to 3 GHz. An illustration of the setup is depicted in Fig. 1.

Fig. 1. Illustration of the measurement setup used for data generation and validation. A 950 mW near infrared laser beam with a 810 nm wavelength is expanded and directed onto a spatial light modulator (SLM), which controls the phase front of the beam. The light reflected from the SLM is projected on the THz Quantum Cascade Random Laser (QCRL) using a lens. As a result, the image, imposed by the SLM on the light beam, is projected on the surface of the QCRL. Simultaneously, the emission spectrum of the QCRL is measured by using a Fourier-Transform Infrared (FTIR) spectrometer. The experiment is controlled by a computer, which continuously generates patterns and stores the corresponding spectra.

Download Full Size | PDF

During the data generation phase the computer continuously generates random patterns and displays them with the SLM. The pattern is projected onto the QCRL. As a consequence the interaction of the NIR, which is coupled through the holes in the device, with the lasing material significantly changes the local electrical permittivity of the laser device and thus the scattering conditions responsible for the mode formation, which leads to strong changes in the occurring modes [26,30]. The resulting spectrum is recorded by the FTIR spectrometer and stored on the computer. This procedure is repeated $N$ times, where $N$ is the number of samples and the data set consisting of $N$ random patterns and their corresponding QCRL emission spectra is stored.

Two different approaches are compared. The first approach is an iterative approach, which in combination with an optimizer is capable of intrinsically capturing the limits of the system but is limited to a one-dimensional column-wise modulation due to the complexity of the system. The complexity of this one-dimensional approach is given by the 16 utilized columns and the gray-scale-depth of 256 values, which results in more than $10^{38}$ possible patterns. Increasing the number of degrees of freedom by e.g. moving to a two-dimensional scheme leads to a poor performance of the optimization algorithm [31]. The second approach is a direct predictive approach, which is capable of handling the complexity of two-dimensional modulation, but does not provide limits, which is why we complement it with an auto-encoder based anomaly detection [32,33]. The complexity of the two-dimensional approach is given by $16\times 16=256$ pixels with a gray-scale-depth of 256 values, resulting in an astoundingly high number of more than $10^{616}$ possible patterns. As in this approach no optimization is employed, the amount of possible degrees of freedom can be significantly increased. Obviously for neither approach the whole set can be measured, which fortunately is not required, as for both approaches a sub-sample, which significantly describes the whole space is sufficient.

2.2 Iterative approach

In the training phase of the iterative approach an artificial neural network (ANN) is presented with the column weight vectors, which are a compressed representation of the NIR illumination patterns, and with the corresponding emission spectra. The column weight vector contains 16 values, each of which is a 8-bit weighting factor for a corresponding column of the $16\times 16$ Hadamard matrix. The artificial neural network learns to predict the QCRL behavior for different applied patterns by running through hundreds of training steps, which are commonly called epochs. During training the weight and bias values of the ANN are optimized by using the back propagation algorithm [34], such that the network improves its predictions over the course of the training cycle. Thereby, an Adam optimizer [35] with a learning rate of $10^{-3}$ and first and second momentum decay rates of $0.9$ and $0.999$ respectively is employed. After training the ANN is capable of predicting the emission spectrum for a given column weight vector. An optimizer is used to iteratively approximate the desired spectrum. This approach requires hundreds of neural network predictions. The main advantage of this approach is that it intrinsically captures the limits of the real device.

The ANN consists of three layers (see Fig. 2(b)). The first layer, which acts as the input layer has 16 neurons, one neuron for each column weight vector value. The hidden layer consists of 128 neurons with leaky rectified linear activation functions [36], which help to reduce the vanishing gradient problem and introduce non-linearities into the ANN. The output layer has 60 neurons with linear activation functions. Each of the output layer neurons corresponds to a unique frequency in the predicted spectrum.

Fig. 2. Iterative approach: (a) Schematic illustration of the pattern generation algorithm. The desired spectrum is translated into a cost function and fed into the optimizer, which iteratively creates column weight vectors, presents them to the ANN, and returns the predicted spectra. After convergence the resulting vector is returned by the optimizer, and is converted into an illumination pattern. This pattern is projected onto the THz Quantum Cascade Random Laser (QCRL), which changes its emission spectrum accordingly. (b) Sketch of the ANN. The input layer, which accepts the column weight vector has 16 neurons, one for each column weight vector entry. The hidden layer consists of 128 neurons with leaky rectified linear activation functions. The output layer has 60 neurons with linear activation functions. Each output neuron corresponds to a unique frequency in the predicted spectrum.

Download Full Size | PDF

A data set consisting of 1325 pairs of illumination patterns, generated from column weight vectors and the corresponding spectra is recorded. For each sample the column weight vector is converted to an illumination pattern, which is subsequently converted to a phase-mask by using the Gerchberg-Saxton algorithm. The phase-mask is displayed by the SLM, which modulates the phase-front of the NIR accordingly. As the pattern is projected on the QCRL, the emission THz spectrum is modulated, and measured by the FTIR spectrometer. Thereby the FTIR measurement is the slowest step taking between 20 and 30 seconds per measurement. 10% of the data set, equaling 133 pairs, is split off for testing of the model. Another 10% of the remaining 1192 samples, resulting in 119 samples are used for evaluation during the training of the ANN. The remaining 1073 samples are used for training. The ANN is trained for 200 epochs. The input column weight vectors do not have to be normalized as they already only consist of values between 0 and 1. The spectra on the other hand are Z-score normalized for each frequency. The loss function used for training is mean squared error. The ANN training converges already after 50 epochs. Overfitting is successfully prevented by dropout layers and L2 kernel regularization [37,38]. As a result, the validation loss and training loss are very similar. After training the ANN is evaluated on the 10% test set.

The trained ANN can be used to quickly generate the desired output spectra. A scheme of the spectrum generation process is depicted in Fig. 2(a). To generate a desired spectrum, a cost function is derived from it (e.g. maximize the intensity at a certain frequency) and fed into a Nelder-Mead downhill simplex optimizer [39]. The optimizer gets feedback from the ANN, which predicts the QCRL behavior and subsequently adapts the modulation pattern. After convergence the pattern and the predicted spectrum are returned.

The behavior of the ANN is defined by its weight and bias values. These values are fit during the training process and are finite. Our ANN is bounded-input, bounded-output stable, which means that for a bounded input the output is also bounded. The output bounds are defined by the weight and bias values and are implicitly learned by the neural network. This is the reason that the neural network is able to model the limitations of the real system. For the iterative approach, where the ANN is a direct model of the QCRL behavior under NIR modulation, no additional steps have to be taken to ensure realistic outputs.

2.3 Direct predictive approach

For an even faster generation of lasing spectra, that is also capable of leveraging the full two-dimensional surface of the QCRL the neural network is trained to return the NIR illumination pattern directly. This approach scales much better than the iterative approach at the cost of not capturing the limitations of the device. The neural network returns a NIR illumination pattern for any desired spectrum and is validated by using an auto-encoder based anomaly detection.

For this approach no optimizer is required. Instead, the NIR illumination pattern is directly generated by the ANN. This is especially beneficial when dealing with larger parameter spaces such as in two-dimensional patterns. Here a Convolutional Neural Network (CNN) [27] is used, which directly generates modulation patterns for any given spectrum (see Fig. 3(a)). CNNs are commonly used for object recognition tasks, as they are very efficient in learning patterns with a spatial structure, such as our two-dimensional NIR illumination patterns. To further enhance the effect of the random modulation patterns Perlin noise [40], a pseudo-randomly generated noise is used. Perlin noise is the ideal candidate for this task, as it resembles the spatial distribution of the modes in the QCRL cavity, leading to a higher impact of the NIR illumination on the output spectra. Additionally, Perlin noise has a lower information entropy than white noise, which enables the CNN to learn its structure, effectively reducing the parameter space, while preserving the two-dimensional nature of the data. Perlin noise leads to a $2.7$ times higher variance in the spectral data when doing random modulation compared to uniform noise due to the higher spatial coherence of Perlin noise.

Fig. 3. Direct predictive approach: (a) Schematic of the pattern generation algorithm. After checking the validity of the desired spectrum, the spectrum is fed into the convolutional neural network (CNN), which directly returns the corresponding NIR illumination pattern. The pattern is projected onto the QCRL, which emits the desired spectrum. (b) Illustration of the convolutional neural network used for pattern generation. The numbers above the layers indicate their shape. The CNN accepts a desired spectrum and predicts the corresponding NIR illumination pattern. The kernel size of the hidden convolutional layer is $2\times 2$, whereas the kernel size of the output layer is $4\times 4$.

Download Full Size | PDF

The CNN consists of 60 input nodes, one for each frequency value in the spectrum. The second layer is a 64 node dense layer with rectified linear activation functions. The third layer is an $8\times 8$ 2D convolutional layer with a kernel size of $2\times 2$ pixels, $16$ filters and likewise rectified linear activation functions. Symmetric zero-padding is used in order to retain the image size. After the third layer an up-sampling layer is added to scale the image to the desired format of $16 \times 16$ pixels. The output layer is another 2D convolutional layer with a $4\times 4$ pixel kernel, a total size of $16\times 16$ pixels, symmetric zero padding and only one filter (gray-scale). The stride size for all employed convolutional layers is $1\times 1$. The topology of the CNN is depicted in Fig. 3(b).

For training, an Adam optimizer [35] with a learning rate of $5\cdot 10^{-4}$ and first and second momentum decay rates of $0.9$ and $0.999$ respectively is employed. The batch-size for the training is 32.

A data set containing 696 pairs of Perlin noise patterns and corresponding QCRL spectra is recorded. Again 10% are split-off for after training evaluation and another 10% are used for during training validation of the model. The CNN is trained for 200 epochs, but also already shows convergence after 50 epochs. No critical overfitting is observed, eliminating the need for dropout layers and kernel regularization.

In contrast to the iterative approach, the direct prediction with a CNN only requires one evaluation. The desired spectrum (see dashed line in Fig. 5) is fed into the ANN. The network is able to return a NIR pattern that modulates the QCL accordingly. The main advantage of this method is its speed. Predictions are performed within one evaluation only, which takes less than 10 ms on state of the art computer systems. By using this approach a desired spectrum can be realized within a second, which includes the calculation of the phase-mask and displaying it on the SLM. Note that this approach does not intrinsically capture the limits of the QCRL. The CNN will always provide a NIR illumination pattern, even if the desired spectrum is not achievable. This can be negated by adding an auto-encoder based anomaly detection to the system. Therefore an auto-encoder neural network [41] with one hidden layer (10 neurons) is trained to perform a lossy compression of pre-processed, measured spectra and to recreate the spectra as closely as possible. The recreation error is small for spectra that are similar to the real spectra and larger for spectra, which are very different from the measured spectra. A T-Test performed on a set of generated spectra, based on real spectra and similar looking, but randomly generated spectra, yields a statistical significance ($p \ll 0.01$) of the results produced by this approach.

3. Results

For the iterative approach the calculated mean absolute error for the 133 tested pairs of patterns and spectra is $6.3$ intensity points per frequency. Considering that a typical mode peak in the spectrum has a height between 100 and 300 intensity points this is fairly low. This error is mostly accumulated at the edges of the spectra and small portions of the spectrum that are not properly predicted. The major peaks and their behaviors are very well predicted by the ANN, which is confirmed by a mean Pearson correlation between the real and predicted spectra of 99.1%.

In addition to comparing the performance of the ANN to previously recorded data it is even more interesting to perform a full optimization cycle. Therefore optimization routine is used to enhance a specific mode of the original QCRL spectrum by choosing a cost function $cost_{max}(k_0) = 1/(\mathrm {max}\ S(k_0))$, where $S(k)$ is the predicted spectrum and $k_0$ is the selected mode. Starting from an initial condition, the selected mode at 2.302 THz is subsequently enhanced in its intensity for 300 iterations. Figure 4(a) shows the predicted column weight vector for a maximization of the mode at 2.302 THz. Figure 4(b) shows the corresponding predicted spectrum (dashed line). To confirm the predicted output spectra under illumination with the simulated NIR illumination pattern, the pattern is projected onto the QCRL and the spectrum is measured. The experimental data (solid line in Fig. 4(b)) is in excellent agreement with the predicted spectrum. The neural network is capable of predicting the absolute intensities and position of the occurring modes in the emission spectrum. Thus, one can conclude that the ANN correctly learns the mode dynamics as well as the out-coupling efficiency of the respective modes. Note that the ANN is evaluated 300 times and each significant error would lead to an accumulation as each optimization step depends on the previous one. It can therefore be concluded that the presented method is very accurate and stable.

Fig. 4. Column weight vectors generated by using the iterative approach and spectra after 500 optimizer iterations. (a) Column wise predicted normalized NIR illumination of the QCRL to maximize the mode at 2.302 THz. (b) The dotted line shows the predicted spectrum, maximizing the mode at 2.302 THz as indicated by the red arrow. The solid green line shows the spectrum, which is measured when projecting the predicted pattern on the real device. (c) Column wise predicted normalized NIR illumination of the QCRL to minimize the mode at 2.302 THz. (d) The dotted line shows the predicted spectrum, minimizing the mode at 2.302 THz, indicated by the red arrow. The solid green line shows the spectrum, which is measured when projecting the predicted pattern on the real device.

Download Full Size | PDF

The optimization routine is not limited to enhancing modal lines in the spectrum. The suppression of lasing lines can be advantageous for self-referenced spectroscopy. The minimization is realized by choosing an appropriate cost function $cost_{min}(k_0) = \mathrm {max}\ S(k_0)$.

The results of the mode minimization and maximization are depicted in Fig. 4. The optimization routine returns the column weight vector (c), which leads to a modulated spectrum as depicted in (dö). The optimization routine works so well that the mode completely vanishes. The comparison of the predicted and measured spectra demonstrates that the ANN is capable of predicting both intensities and frequencies of the occurring modes. While the cost function favors a real single mode spectrum, the ANN predicts that the actual emission spectrum consists of at least two modes, which cannot be suppressed independently. The complex nonlinear interactions between the lasing modes hinders the single mode operation at 2.302 THz. Gain competition and spatial hole burning lead to the fact that only certain configurations are possible under the prevailing conditions. To achieve a single mode spectrum in this case a (free) parameter must be changed: either the current is reduced, so that all existing modes are weaker, or the magnitude of the NIR modulation is increased.

Evaluating the direct predictive approach is more difficult as it does not return spectra that can be directly graded by a metric but merely illumination patterns. During the training and by testing on the separate testing data set a significant reduction in mean-absolute error between the predicted and original NIR illumination patterns is observed. The mean-absolute error is reduced from 64 to 16. From the presented data the ANN learns which regions impact which modes and therefore only illuminates the necessary regions, which is in strong contrast to the random patterns. As a result the predicted illumination patterns look different than the corresponding ground-truth in many cases. The Pearson correlation between the pattern pairs is only about 0.5. Nonetheless the illumination patterns returned by the CNN yield the desired results as can be seen by two examples shown in Fig. 5. Thereby the dashed lines represent the desired spectra and the solid lines the actually measured ones for the returned illumination patterns, which are shown as an inset.

Fig. 5. Desired spectra (dashed lines) and their measured counterparts (solid lines) for two-dimensional NIR illumination. The insets show the NIR illumination patterns, generated by using the direct predictive approach. The QCRL outline is indicated by the black circle.

Download Full Size | PDF

4. Conclusion

We find that deep learning models are well suited to predict the emission spectra of QCRLs. A spatially modulated NIR laser with a continuous wave output power of 950 mW is used to modify a THz QCRL with an emission spectrum around 2.3 THz. Pairs of randomly generated NIR patterns and THz spectra are recorded and subsequently deep learning models are trained on the stored data. An iterative approach, which intrinsically captures the limits of the experiment, allows the generation of modulation patterns within less than 10 seconds. A direct predictive approach, which generates predicted NIR patterns for a desired spectrum within less than 100 ms is trained on 696 pairs and is complemented by an auto-encoder based anomaly detection for spectrum verification. We show that by using these methods the complete elimination or maximization of a mode can be achieved. The mean correlation between predicted and the corresponding real spectra is above 99%. As these methods are both fast and highly flexible, they may be useful for applications in spectroscopy and sensing.

Funding

Horizon 2020 Framework Programme (840745); Austrian Science Fund (P30709, W1210, W1243); Österreichische Forschungsförderungsgesellschaft (FFG 849614); Air Force Office of Scientific Research (FA9550-17-1-0340).

Acknowledgments

The authors would like to thank the developers of the Keras [42] and Tensorflow [43] libraries, as they were used for implementing the ANNs.

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. D. Teixidor, M. Grzenda, A. Bustillo, and J. Ciurana, “Modeling pulsed laser micromachining of micro geometries using machine-learning techniques,” J. Intell. Manuf. 26(4), 801–814 (2015). [CrossRef]

2. A. Kokhanovskiy, A. Ivanenko, S. Kobtsev, S. Smirnov, and S. Turitsyn, “Machine Learning Methods for Control of Fibre Lasers with Double Gain Nonlinear Loop Mirror,” Sci. Rep. 9(1), 2916 (2019). [CrossRef]

3. S. Jiao, J. Feng, Y. Gao, T. Lei, Z. Xie, and X. Yuan, “Optical machine learning with incoherent light and a single-pixel detector,” Opt. Lett. 44(21), 5186 (2019). [CrossRef]

4. B. Limbacher, S. Schoenhuber, M. Wenclawiak, M. A. Kainz, A. M. Andrews, G. Strasser, J. Darmo, and K. Unterrainer, “Terahertz optical machine learning for object recognition,” APL Photonics 5(12), 126103 (2020). [CrossRef]

5. J. K. George, A. Mehrabian, R. Amin, J. Meng, T. F. de Lima, A. N. Tait, B. J. Shastri, T. El-Ghazawi, P. R. Prucnal, and V. J. Sorger, “Neuromorphic photonics with electro-absorption modulators,” Opt. Express 27(4), 5181 (2019). [CrossRef]

6. L. Mennel, J. Symonowicz, S. Wachter, D. K. Polyushkin, A. J. Molina-Mendoza, and T. Mueller, “Ultrafast machine vision with 2D material neural network image sensors,” Nature 579(7797), 62–66 (2020). [CrossRef]

7. W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, and Y. Liu, “Deep learning for the design of photonic structures,” Nat. Photonics 15(2), 77–90 (2021). [CrossRef]

8. P. Tang, X. Chi, B. Chen, and C. Wu, “Predictions of resonant mode characteristics for terahertz quantum cascade lasers with distributed feedback utilizing machine learning,” Opt. Express 29(10), 15309–15326 (2021). [CrossRef]

9. G. Genty, L. Salmela, J. M. Dudley, D. Brunner, A. Kokhanovskiy, S. Kobtsev, and S. K. Turitsyn, “Machine learning and applications in ultrafast photonics,” Nat. Photonics 15(2), 91–101 (2021). [CrossRef]

10. B. S. Williams, “Terahertz quantum-cascade lasers,” Nat. Photonics 1(9), 517–525 (2007). [CrossRef]

11. M. A. Kainz, M. P. Semtsiv, G. Tsianos, S. Kurlov, W. T. Masselink, S. Schönhuber, H. Detz, W. Schrenk, K. Unterrainer, G. Strasser, and A. M. Andrews, “Thermoelectric-cooled terahertz quantum cascade lasers,” Opt. Express 27(15), 20688–20693 (2019). [CrossRef]

12. A. Khalatpour, A. K. Paulsen, C. Deimert, Z. R. Wasilewski, and Q. Hu, “High-power portable terahertz laser systems,” Nat. Photonics 15(1), 16–20 (2021). [CrossRef]

13. R. Köhler, A. Tredicucci, F. Beltram, H. E. Beere, E. H. Linfield, A. G. Davies, D. A. Ritchie, R. C. Iotti, and F. Rossi, “Terahertz semiconductor-heterostructure laser,” Nature 417(6885), 156–159 (2002). [CrossRef]

14. S. Biasco, L. Li, E. H. Linfield, A. G. Davies, and M. S. Vitiello, “Multimode, aperiodic terahertz surface-emitting laser resonators,” in Photonics, vol. 3 (Multidisciplinary Digital Publishing Institute, 2016), p. 32.

15. Y. Zeng, G. Liang, B. Qiang, K. Wu, J. Tao, X. Hu, L. Li, A. G. Davies, E. H. Linfield, H. K. Liang, Y. Zhang, Y. Chong, and Q. J. Wang, “Two-dimensional multimode terahertz random lasing with metal pillars,” ACS Photonics 5(7), 2928–2935 (2018). [CrossRef]

16. S. Schoenhuber, M. Brandstetter, T. Hisch, C. Deutsch, M. Krall, H. Detz, A. M. Andrews, G. Strasser, S. Rotter, and K. Unterrainer, “Random lasers for broadband directional emission,” Optica 3(10), 1035–1038 (2016). [CrossRef]

17. I. Gordon, L. Rothman, C. Hill, R. Kochanov, Y. Tan, P. Bernath, M. Birk, V. Boudon, A. Campargue, K. Chance, B. Drouin, J.-M. Flaud, R. Gamache, J. Hodges, D. Jacquemart, V. Perevalov, A. Perrin, K. Shine, M.-A. Smith, J. Tennyson, G. Toon, H. Tran, V. Tyuterev, A. Barbe, A. Császár, V. Devi, T. Furtenbacher, J. Harrison, J.-M. Hartmann, A. Jolly, T. Johnson, T. Karman, I. Kleiner, A. Kyuberis, J. Loos, O. Lyulin, S. Massie, S. Mikhailenko, N. Moazzen-Ahmadi, H. Müller, O. Naumenko, A. Nikitin, O. Polyansky, M. Rey, M. Rotger, S. Sharpe, K. Sung, E. Starikova, S. Tashkun, J. V. Auwera, G. Wagner, J. Wilzewski, P. Wcisło, S. Yu, and E. Zak, “The hitran2016 molecular spectroscopic database,” J. Quant. Spectrosc. Radiat. Transfer 203, 3–69 (2017). [CrossRef]

18. D. S. Wiersma, “The physics and applications of random lasers,” Nat. Phys. 4(5), 359–367 (2008). [CrossRef]

19. S. Schoenhuber, M. Wenclawiak, M. Kainz, B. Limbacher, A. Andrews, H. Detz, G. Strasser, J. Darmo, and K. Unterrainer, “Scattering strength dependence of terahertz random lasers,” J. Appl. Phys. 125(15), 151611 (2019). [CrossRef]

20. A. W. M. Lee, B. S. Williams, S. Kumar, Q. Hu, and J. L. Reno, “Tunable terahertz quantum cascade lasers with external gratings,” Opt. Lett. 35(7), 910 (2010). [CrossRef]

21. C. A. Curwen, J. L. Reno, and B. S. Williams, “Terahertz quantum cascade VECSEL with watt-level output power,” Appl. Phys. Lett. 113(1), 011104 (2018). [CrossRef]

22. C. A. Curwen, J. L. Reno, and B. S. Williams, “Broadband continuous single-mode tuning of a short-cavity quantum-cascade VECSEL,” Nat. Photonics 13(12), 855–859 (2019). [CrossRef]

23. S. Biasco, H. E. Beere, D. A. Ritchie, L. Li, A. G. Davies, E. H. Linfield, and M. S. Vitiello, “Frequency-tunable continuous-wave random lasers at terahertz frequencies,” Light: Sci. Appl. 8(1), 43 (2019). [CrossRef]

24. N. Han, A. de Geofroy, D. P. Burghoff, C. W. I. Chan, A. W. M. Lee, J. L. Reno, and Q. Hu, “Broadband all-electronically tunable MEMS terahertz quantum cascade lasers,” Opt. Lett. 39(12), 3480 (2014). [CrossRef]

25. T. Alam, M. Wienold, X. Lü, K. Biermann, L. Schrottke, H. T. Grahn, and H.-W. Hübers, “Wideband, high-resolution terahertz spectroscopy by light-induced frequency tuning of quantum-cascade lasers,” Opt. Express 27(4), 5420–5432 (2019). [CrossRef]

26. S. Schönhuber, N. Bachelard, B. Limbacher, M. A. Kainz, A. M. Andrews, H. Detz, G. Strasser, J. Darmo, S. Rotter, and K. Unterrainer, “All-optical adaptive control of quantum cascade random lasers,” Nat. Commun. 11(1), 5530 (2020). [CrossRef]

27. K. Fukushima, “Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position,” Biol. Cybernetics 36(4), 193–202 (1980). [CrossRef]

28. M. I. Amanti, G. Scalari, R. Terazzi, M. Fischer, M. Beck, J. Faist, A. Rudra, P. Gallo, and E. Kapon, “Bound-to-continuum terahertz quantum cascade laser with a single-quantum-well phonon extraction/injection stage,” New J. Phys. 11(12), 125022 (2009). [CrossRef]

29. R. W. Gerchberg and W. O. Saxton, “A Practical Algorithm for the Determination of Phase from Image and Diffraction Plane Pictures,” Optik 35, 237–246 (1972).

30. N. Bachelard, J. Andreasen, S. Gigan, and P. Sebbah, “Taming random lasers through active spatial control of the pump,” Phys. Rev. Lett. 109(3), 033903 (2012). [CrossRef]

31. L. Han and M. Neumann, “Effect of dimensionality on the Nelder–Mead simplex method,” Optim. Methods Softw. 21(1), 1–16 (2006). [CrossRef]

32. M. A. Kramer, “Nonlinear principal component analysis using autoassociative neural networks,” AIChE J. 37(2), 233–243 (1991). [CrossRef]

33. S. Hawkins, H. He, G. Williams, and R. Baxter, “Outlier Detection Using Replicator Neural Networks,” in International Conference on Data Warehousing and Knowledge Discovery, (Springer, Berlin Heidelberg, 2002), pp. 170–180.

34. D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature 323(6088), 533–536 (1986). [CrossRef]

35. D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” arXiv:1412.6980 [cs] (2017).

36. A. L. Maas, A. Y. Hannun, and A. Y. Ng, “Rectifier Nonlinearities Improve Neural Network Acoustic Models,” Proceedings of Machine Learning Research p. 6 (2013).

37. G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. R. Salakhutdinov, “Improving neural networks by preventing co-adaptation of feature detectors,” arXiv:1207.0580 [cs] (2012). ArXiv: 1207.0580.

38. C. Cortes, M. Mohri, and A. Rostamizadeh, “L2 Regularization for Learning Kernels,” arXiv:1205.2653 [cs, stat] (2012). ArXiv: 1205.2653.

39. J. A. Nelder and R. Mead, “A Simplex Method for Function Minimization,” The Comput. J. 7(4), 308–313 (1965). [CrossRef]

40. K. Perlin, “An image synthesizer,” ACM SIGGRAPH Comput. Graph. 19(3), 287–296 (1985). [CrossRef]

41. D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning internal representations by error propagation,” Tech. rep., California Univ San Diego La Jolla Inst for Cognitive Science (1985).

42. F. Chollet, “Keras,” https://keras.io (2015).

43. M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, S. Ghemawat, I. Goodfellow, A. Harp, G. Irving, M. Isard, Y. Jia, R. Jozefowicz, L. Kaiser, M. Kudlur, J. Levenberg, D. Mané, R. Monga, S. Moore, D. Murray, C. Olah, M. Schuster, J. Shlens, B. Steiner, I. Sutskever, K. Talwar, P. Tucker, V. Vanhoucke, V. Vasudevan, F. Viégas, O. Vinyals, P. Warden, M. Wattenberg, M. Wicke, Y. Yu, and X. Zheng, “TensorFlow: Large-scale machine learning on heterogeneous systems,” https://www.tensorflow.org (2015).

Deep learning control of THz QCLs

Abstract

1. Introduction

2. Methods

2.1 General

2.2 Iterative approach

2.3 Direct predictive approach

3. Results

4. Conclusion

Funding

Acknowledgments

Disclosures

Data availability

References

Data availability

Cited By

Figures (5)

Optics Express

Benedikt Limbacher	https://orcid.org/0000-0003-0885-541X
Martin Alexander Kainz	https://orcid.org/0000-0002-6504-5862
Aaron Maxwell Andrews	https://orcid.org/0000-0002-5790-2588
Hermann Detz	https://orcid.org/0000-0002-4167-3653
Gottfried Strasser	https://orcid.org/0000-0003-0147-0883