Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Turbulence aberration correction for vector vortex beams using deep neural networks on experimental data

Open Access Open Access

Abstract

The vector vortex beams (VVB) possessing non-separable states of light, in which polarization and orbital angular momentum (OAM) are coupled, have attracted more and more attentions in science and technology, due to the unique nature of the light field. However, atmospheric transmission distortion is a recurring challenge hampering the practical application, such as communication and imaging. In this work, we built a deep learning based adaptive optics system to compensate the turbulence aberrations of the vector vortex mode in terms of phase distribution and mode purity. A turbulence aberration correction convolutional neural network (TACCNN) model, which can learn the mapping relationship of intensity profile of the distorted vector vortex modes and the turbulence phase generated by first 20 Zernike modes, is well designed. After supervised learning plentiful experimental samples, the TACCNN model compensates turbulence aberration for VVB quickly and accurately. For the first time, experimental results show that through correction, the mode purity of the distorted VVB improves from 19% to 70% under the turbulence strength of D/r0 = 5.28 with correction time 100 ms. Furthermore, both spatial modes and the light intensity distribution can be well compensated in different atmospheric turbulence.

© 2020 Optical Society of America under the terms of the OSA Open Access Publishing Agreement

1. Introduction

The scalar vortex modes with a helical term of exp(ilφ) carrying orbital angular momentum (OAM), have drawn increasingly attention since pioneered by Allen and associates [1], where l is referred to as the topological charge and describes the number of the 2π helical phase winding in one wavelength. The integer topological charge l of OAM provides a degree of freedom for a large research fields with a myriad of applications in optical communication [2], quantum optics and information [3], optical tweezers and micromechanics [4], remote sensing [5], and so on (see [68] and reference therein). Furthermore, a similar but distinguishing mode, the vector vortex beams (VVB) [9] with non-separable states of light in which polarization and OAM are coupled, have also been studied in various disciplines. Similar to the scalar vortex beams with a phase singularity and a helical wave-front, the polarization singularity exists in the axis of the VVBs with an inhomogeneous polarization distribution. The vector beams especially cylindrical vector vortex (CVV) beams can be applied to sharp focus for high-resolution imaging [10], increased channel capacity for optical communication [1114], rotating body remote sensing detection [15] and so on (see [16] and reference therein). Furthermore, it has also been inferred numerically that VVB analogous to partially coherent beams, are more resilient in turbulence [17].

In practice, nonhomogeneous media such as atmospheric turbulence, oceanic turbulence, biological tissue, strongly effects the optical transmission. Non-uniformity of medium refractive index will distort the helical wave-front and decrease the energy of received light field, resulting in the crosstalk between different modes. Amounts of studies have been done in assessing the atmospheric turbulence effects on vortex beams [1822] and vector beams [2328]. The irradiance pattern, scintillation index, degree of polarization, and OAM spectrum of the beams in an atmospheric turbulence with different turbulence strength have been studied numerically and experimentally. In order to improve the robustness of the vortex beams against the non-uniform media, many methods have been employed. The adaptive optics (AO) system is an effective approach, and common algorithms are Gerchberg-Saxton (GS) [29], stochastic-parallel-gradient-descent (SPGD) algorithm [30]. However, existing algorithms generally belong to search algorithms taking lots of iterations to acquire global optimal results. Hence, reducing the latency and improving the correction ability in strong turbulence are still challenges. As well as, to our best knowledge, the aberration correction for vector beams on experimental data have not been investigated.

As the powerful interdisciplinary science combined with the mathematics, computer and biology science, machine learning, including deep learning, has become an important field. Deep Neural Network (DNN) [31] uses multi-layered artificial neural networks implemented with a computer to perform advanced tasks, comparable to or even superior than the performance of human experts. Recently deep learning has made major advances in medical image analysis [32], speech recognition [33], image classification [34], and inverse imaging problems [3541]. In addition, deep learning methods have potential applications in phase retrieval in image [42] and communication [4344].

In this scenario, we propose and investigate a turbulence aberration correction convolutional neural network (TACCNN) model for compensation of distorted VVB in terms of intensity profile and spatial modes purity, which respectively corresponding to the polarization distribution and polarization topological charge [4546]. To compensate the distortion for the vector beams, the distorted intensity captured by the CCD camera and the first 20 Zernike coefficients generating turbulence phase, are regarded as the studying samples. After supervised training, the TACCNN model can be trained to generate compensation phase screen according to the intensity distribution. After compensation, the phase distortion is decreased significantly in different atmosphere turbulences and the mode purity of the distorted vector beams can be improved. And then the intensity profiles get close to the original distribution. Meanwhile, experimental results shown that the TACCNN model can well experimentally predict the compensation phase and have advantages compared with previous work.

2. The vector vortex beams passing through nonhomogeneous media

Light beams can carry total angular momentum (TAM) consist of intrinsic spin angular momentum (SAM) associated with polarization, and OAM associated with the spatial profile of the phase. The vector vortex beams are a linear combination of orthogonal circular polarization scalar vortex optical beams of opposite topological charge [44], where the constituent components are eigenstates of TAM. In the paraxial approximation, a monochromatic generalized VBB has a unique spatial polarization, which can be represented as:

$$|{{\psi_l}} \rangle = \psi _R^l|{{R_l}} \rangle + \psi _L^l|{{L_l}} \rangle ,$$
with respect to the orthonormal circular polarization basis {Rl, Ll} such that
$$|{{R_l}} \rangle = \exp ( - il\varphi )(x + iy)/\sqrt 2 ,$$
$$|{{L_l}} \rangle = \exp ( + il\varphi )(x - iy)/\sqrt 2.$$
Eqs. (2a)–(2b) represent a right circular polarized (RCP) and left circular polarized (LCP) optical vortex of topological charge -l and + l, respectively. When the magnitude of $\psi _R^l$ is equal to that of $\psi _l^l$, Eq. (1) can be transformed as follows:
$$|{{\psi_{p,{\varphi_0}}}} \rangle = \vec{E}({r,\varphi } )= A(r )\cdot \left[ {\begin{array}{{c}} {\cos (p\varphi + {\varphi_0})}\\ {\sin (p\varphi + {\varphi_0})} \end{array}} \right]$$
with A(r) the amplitude distribution, φ the azimuthal angle, and φ0 the initial orientation. p is the index of polarization topological charge (PTC) equal to (l1-l2)/2, where l1 and l2 are the OAM values of the two basic modes respectively. The space-dependent polarization distribution associated with the SAM and OAM, can be characterized by PTC. The PTC is defined as the repetition number of polarization state change along the azimuthal axis, while its sign stands for the rotating direction of the polarization. Due to the vector beam modes with different PTCs are orthogonal to each other, they have also potential to improve the capacity of optical communication.

The atmosphere is largely non-birefringent [46]. As such, the polarization is unaffected during propagation. Let us consider vector modes behind the turbulence described by the φTurb, which state can be expressed as follows:

$$|{{\psi_{p,{\varphi_0}}}} \rangle_{Turb} = \vec{E}({r,\varphi } )\cdot \exp (i{\varphi _{Turb}}) = A(r )\cdot \left[ {\begin{array}{c} {\cos (p\varphi + {\varphi_0})}\\ {\sin (p\varphi + {\varphi_0})} \end{array}} \right] \cdot \exp (i{\varphi _{Turb}})$$
The Fourier expansion form of phase modulation function exp(iφTurb) is equal to $\sum {{B_n}} (\theta )\exp (in\varphi )$, and then Eq. (4) can be written as:
$$\vec{E}({r,{\varphi_{Turb}}} )= A(r )\cdot \sum {{B_n}(\theta )} \left[ {\begin{array}{c} {\exp (i((p + n)\varphi + {\varphi_0})) + \exp ( - i((p - n)\varphi + {\varphi_0}))}\\ {i(\exp (i((p + n)\varphi + {\varphi_0})) - i\exp ( - i((p - n)\varphi + {\varphi_0}))} \end{array}} \right]$$
From Eq. (5), one can see that the distorted VBB are composed of many modes with different orders n, which corresponding to different PTCs. Thus the PTC is dispersed, reducing mode purity. Then, we can acquire the distortion intensity pattern in the far field
$$I{(z)_{Turb}} = {f_z}(\vec{E}{({r,\varphi } )_{turb}}),$$
where f is the Fresnel propagation function for distance z. As shown in Fig. 1, the intensity patterns in vertical polarization and horizontal polarization directions behind turbulent atmosphere are distorted. Meanwhile the polarization distribution is also distorted. Both of the turbulence effect on the intensity and polarization distribution lead to poor performance in optical communication, super resolution imaging and so on. Thus, turbulence aberration correction is essential in practice.

 figure: Fig. 1.

Fig. 1. (a) The intensity profile of CVV beams at the waist in different polarized direction. The polarization topological charges equal to 1. (b) Atmospheric turbulence phase screen using Zernike polynomials with D / r0 value respectively in 1.0, 2.3, 3.74, 5.28, 6.9. (c) The intensity patterns in different polarized directions respectively after turbulent atmosphere.

Download Full Size | PDF

Then the distorted phases caused by the atmospheric turbulence can be described as follows:

$${\varphi _{Turb}} = F({I{{(z)}_{Turb}}} ),$$
where F is the inverse transform of the Fresnel propagation function f. According to Eqs. (6)–(7), provided that the mapping F and the distorted intensity ITurb, the compensate screen can be predicted and parameters of the CVV beams can be corrected. But F is hardly solvable, which belongs to an inverse problem of laser transmission without analytic solution in mathematics. Here, we have trained a TACCNN model to match the relationship F. Through learning plentiful experimental samples, TACCNN model optimizes its parameters to approximate the objective function and finally predicts the distorted phase once distorted intensity ITurb is input.

A turbulence phase obeys the Kolmogorov turbulence theory. The atmosphere phase screen here can be generated by the Zernike polynomials following Kolmogorov energy distribution [43], which can be written as:

$${\varphi _{Turb}} = \sum {{a_j}} {Z_j}$$
where coefficient aj and the j-th Zernike polynomial Zj. Zj defined is expressed as [47]:
$${Z_{evenj}} = \sqrt {2(n + 1)} R_n^m(r )\cos (m\theta ),m \ne 0,$$
$${Z_{oddj}} = \sqrt {2(n + 1)} R_n^m(r )\sin (m\theta ),m \ne 0,$$
$${Z_j} = \sqrt {n + 1} R_n^0(r ),m = 0.$$
The analytical definition of radial function $R_n^m(r )$ is given by:
$$R_n^m(r )= \sum\limits_{s = 0}^{(n - m)/2} {\frac{{{{( - 1)}^s}(n - s)!}}{{s![{(n + m)/2 - s} ]![{(n - m)/2 - s} ]!}}} {r^{n - 2s}}$$
where polar coordinates (r, θ) in the pupil plane, the radial degrees n and azimuthal frequency m. The value of n and m are integral and satisfy the following condition: m ≤ n, n - |m| = even. The index j represents both n and m, the relationship described in Table 1.

Tables Icon

Table 1. The relationship of the index j, m and n.

The Zernike coefficients aj characterizing the atmospheric turbulence are not irrelevant. The covariance between two Zernike polynomials zj and zj, with amplitudes aj and aj are satisfying [48]:

$$E({a_j},{a_{j^{\prime}}}) = \frac{{Kzz^{\prime}{\delta _z}\Gamma [{({n + n^{\prime} - 5/3} )/2} ]{{({D/{r_0}} )}^{5/3}}}}{{\Gamma [{({n - n^{\prime} + 17/3} )/2} ]\Gamma [{({n - n^{\prime} + 17/3} )/2} ]\Gamma [{({n - n^{\prime} + 23/3} )/2} ]}}$$
where δz is a logical Kronecker symbol depending upon azimuthal frequencies (m, m’) and Kzz is a factor depending upon the frequency characteristics (n, n’, m) of zj and zj’. D is the system aperture diameter and r0 is the Fired parameter. Turbulence effect is proportional to the value of D/r0. In the process, a diagonal matrix S and a unitary matrix U are acquired by the SVD (singular value decomposition) of the covariance E(aj,aj’). Gaussian random variables B with zero mean and variance given by S is generated, as the coefficients of the Karhunen-Loeve functions. Then the Zernike coefficients can be obtained by UT multiplying B. Figure 1(b) shows the simulated atmospheric phase screen with different D/r0 ratio. Here, only first 1-20 order Zernike coefficient are used, because higher-order Zernike polynomials over 20 take up relatively small components in turbulence atmosphere.

3. Aberration correction using TACCNN

The TACCNN model is constructed based on convolutional neural network (CNN) using the TensorFlow framework. As shown in Fig. 2, the model is composed of 8 convolution layers and a global average pooling layer. The convolution filters have 32, 64, 128, 256 kernels of size 5×5 respectively. The value of filter in each convolution layer is randomly initialized and the convolution layers (from 2th to 6th) are contained with Maxpooling. Meanwhile, each of the first 8th layers combined with an activation function named ReLu. The mean square error (MSE) between the estimated value and actual value of the first 20 Zernike polynomial coefficients is selected as the loss function as follow:

$$L = {\sum\limits_{i = 0}^{20} {({a_{turb}^i - a_{est}^i} )} ^2}/20$$
where aiturb is the actual Zernike coefficients for the i-th sample, aiest is the estimated Zernike coefficients for the i-th sample. Then the weights of the neural network w can be optimized by
$$w = \arg \mathop {\min }\limits_w L = \arg \mathop {\min }\limits_w {\sum\limits_{i = 0}^{20} {({a_{turb}^i - a_{est}^i} )} ^2}/20$$
Then, w are trained to bring lower value of the loss function, until reach the most appropriate values.

 figure: Fig. 2.

Fig. 2. Specific TACCNN framework based on Alexnet. The input is a series of pictures with 224×224×3 pixels and the output is 1×12 Zernike coefficients vector. ReLu, Rectified Linear Unit, a kind of activation function. Maxpooling, a kind of down-sampling method.

Download Full Size | PDF

In TACCNN model, the input is the distorted intensity profile captured by the CCD camera. The output is the Zernike coefficient vector of size 1×20, which are used to characterize the atmospheric turbulence. 42000 pairs of distorted images and their corresponding Zernike coefficients vectors are generated in experiment, which are used as the training set. The training set consists of five kinds of turbulence strengths (D/r0 equal to 1, 1.52, 3.74 5.28, 6.90), each kind has 8000 experimental samples. During the training progress, 1000 pairs of data are used as validation data, which is employed to check the training effect. If the turbulence phase predicted by CNN is different from the ideal turbulence phase, we stop the training and re-adjust the parameters or the model’s structure. Besides, the validation data are used to test the generalization ability of the model and whether overfitting existence. As well as the filters learn how to extract features to match the relationship between the distorted intensity profile and the phase distortion. After training, additional 1,000 pairs of data are employed to assess the performance of the network.

Figure 3(a) presents the crucial loss function in the training. With the increase of the number of iterations, the loss value becomes progressively smaller and then tends to the minimum value. The values of five points A, B, C, D, E on the loss curve reflects trends intuitively. When the iterations increase to 25000, the loss value is already close to 0, which indicates that the predicted value gets close to actual value. This curve implies that TACCNN model can well extract turbulence effect information via learning samples.

 figure: Fig. 3.

Fig. 3. (a) Loss function curve along with the number of training progress. (b) The MSE between the Zernike coefficients with and without the compensation in different turbulence strength on test data. The dotted line represents the MSE before compensation, and the solid line represents the MSE after compensation. Different colors denote different turbulence strengths, and different turbulence realization means different random complex matrixes when computing the turbulence mask [29]. (c) The phase distributions in three kinds of turbulence strengths (D/r0 equal to 3.74, 5.28, 6.9) before and after the turbulence aberration correction.

Download Full Size | PDF

Figure 3(b) shows the compensation results with the test sets in three kinds of turbulence strength (D/r0 equal to 3.74, 5.28, 6.9). The MSE of the Zernike coefficients are zero without the atmospheric turbulence. However, it can be seen that the MSE becomes larger due to the nonuniform of the atmospheric turbulence in practice before compensation. The larger value of D/r0, the larger value of MSE, which means more serious distorted phase. But, after compensation, the MSE is lower, which is even close to the original zero, and the phase distortion is decreased significantly in different atmosphere turbulences, which means that the proposed method is effective in changing atmospheric turbulences.

Figure 3(c) presents the phase distribution before and after turbulence aberration correction. We can see that the phase distribution is disorderly with a large fluctuation. The bigger the turbulence strength, the greater the phase fluctuation. However, the phase distribution after compensation is almost the same as the real phase screen. Especially, it can remove the larger phase fluctuations and return back a relatively smooth phase distribution. This larger phase fluctuation is difficult for traditional algorithms to recover, due to the limitations of iterative algorithms, which may fall into local optimality. This means that the TACCNN model has a better correction ability than the traditional algorithm in term of the larger phase fluctuations. These results indicate that the TACCNN model we proposed has high correction ability in term of large phase fluctuation.

With the help of this model, only one iteration is used to predict the turbulence aberration. For each prediction, the time consumption is ∼100ms when using a workstation computer(Intel(R) Xeon(R) CPU @ 2.30GHz, Kingston 64GB DDR4, NVIDIA Quadro P4000). We believe that when a higher-performance computer and more learning samples or iterations are implemented, the speed can be further improved.

4. Experimental configuration and results analysis

Figure 4 shows that the experimental setup used for TACCNN for VBB. It consisted of three main stages: generation of CVV modes, turbulence using a Spatial Light Modulator (SLM) and finally, correction and detection. In the first stage, Gaussian modes whose wavelength is 1.55µm are generated by a laser diode and collimated into free space with a diameter of 3mm. The CVV mode is generated using a Q-plate in conjunction with the polarized beam splitter (PBS1). Secondly, a SLM (Holoeye, PLUTO-TELCO-013-C) is used to simulate the atmospheric turbulence. Since the SLM only modulates horizontally polarized beams, a polarization in variant arrangement was implemented, as used in [24]. Finally, after propagation, the perturbed CCV modes in a horizontal direction were collected using a lens L1 and CCD1 (Xenics, Bobcat-320-star) is used to capture intensity distributions, which are used as the input of TACCNN model to predict phase distortion. The CCD2 are used to record the perturbed intensity distribution in different polarization direction (with PBS), or the total light field (without PBS). Then the mode purity can be measured by Q-plate and CCD3.

 figure: Fig. 4.

Fig. 4. Experimental configuration. PBS1& PBS2& PBS3& PBS4, polarized beams splitter. Q1, Q-plate. NPBS1& NPBS2& NPBS3, Non-polarization beams splitter. R1, reflector. HWP, half wave plate. SLM, liquid-crystal spatial light modulator. L1&L2, lenses with focal length f = 100 mm. L3, lens with focal length 200 mm. CCD, infrared CCD camera. TACCNN, the well-trained turbulence aberration correction CNN model.

Download Full Size | PDF

Nothing is added on SLM firstly to record the initial intensity distribution. Then, a turbulence phase screen obeying the Kolmogorov turbulence theory is uploaded on SLM. A turbulence phase screen characterized by the Fried coherence length r0 and the structure constant of the refractive index Cn2. The turbulence strength is imposed on the beams, which can be described by D/r0, with the beam diameter at the turbulence D. By this time, the pattern of distorted CVV beams recorded by CCD1 as the input of well-trained TACCNN model, to predict 20 Zernike coefficients corresponding to the turbulence phase screen. Then the compensation screen is added on SLM. We can measure corrective results in terms of the distorted, corrected light pattern and mode purity, through the CCD2 and CCD3.

The far-field intensity distribution without and with turbulence, as well as after compensation, are shown in Fig. 5. The intensities in different polarization directions of VBB with polarization topological charge +1 and +2 are shown respectively. The intensity distributions are distorted after propagating through the turbulence. The tip-tilt aberration arising from atmospheric turbulence and mechanical movement is one of the largest concerns. This form of atmospheric aberration results in a change of the beam propagation direction, which can be seen that the intensity distributions go outside the center white ring. Once the compensation phase is loaded, the turbulence effects will decrease a lot. The intensity distribution after compensation are close to the original intensity. Especially it can degrade position drift of the centroid induced by the tip and tilt mode of Zernike coefficients. The intensity distributions in different polarization directions nearly go inside the white circles. Thus, the well trained TACCNN model can correct the turbulence aberration in terms of the intensity distribution in different polarization directions of the vector beams.

 figure: Fig. 5.

Fig. 5. The distorted intensities of vector vortex beam without and with turbulence, as well as after compensation, in different polarization directions in (Fried parameters r0 = 1mm, and D/ r0= 5.28) turbulence. (a) - (c) with polarization topological charge +1, (d) - (f) with polarization topological charge +2. The white ring is the central ring which is used to compare the offset of the spot.

Download Full Size | PDF

Mode purity is an important parameter describing the cross talk between different spatial mode. The inverse process (exploiting the reciprocity of light) with the Q plate adding the light path before CCD3. Consequently, the CVV beams without atmospheric turbulence are back converted into a Gaussian-like beam (a bright spot in the beam center) and then recorded by CCD3 as shown in the first line of the Fig. 6(a). The normalized intensities, which can be regarded as the mode purity, without compensation, for different turbulence strengths, are sketched in the second line. It can be seen that the bright spot is weak and drift with shape distortion due to the turbulence. And then the third line shows the bright spot after the turbulence correction. The bright spot becomes more bright and center with a round shape.

 figure: Fig. 6.

Fig. 6. (a) The intensities of mode purity without and with turbulence, as well as after compensation. (b) Detected mode purity before and after compensation in various turbulence atmospheres (D/r0 within the scope of a value in 0 - 6.29). The light blue bar represents the mode purity before compensation, and the prink bar represents the incremental mode purity after compensation.

Download Full Size | PDF

For quantitative analysis of the correction effect in terms of mode purity, we compare the power of the bright spot before and after phase compensation, at the original position of the light spot without the turbulence. As shown in Fig. 6(b), the blue bar represents the distorted mode purity while the pink bar of each group stands for the value after the correction. Similar to the MSE change of the Zernike coefficient in terms of the phase distribution, mode purity is lower because of the turbulence. The stronger turbulence, the lower mode purity. After correction, the mode purity becomes higher in different turbulence strength. When D/r0 equals to 3.74, the mode purity increases from 0.12 to 0.82. When D/r0 equals to 5.28, the mode purity increases from 0.19 to 0.70. When D/r0 equals to 6.9, the mode purity increases from 0.05 to 0.35. The results show that the mode purity all increase significantly, which means the TACCNN model has great generalization ability to resist the larger turbulence strength.

5. Discussion

In fact, the TACCNN model containing multiple convolutional layers learns the mapping relationship of intensity profile of the distorted vector vortex modes and the turbulence phase. After supervised learning, the model can acquire the first 20 Zernike coefficients without iterative operation, compared with the traditional algorithm. In order to make clear the response latency of two kinds of phase retrieval algorithms, we compared the computing times of the CNN model and parallel gradient descent (SPGD) algorithm. As shown in Fig. 7, SPGD takes much longer time to complete hundreds of iterations while TACCNN model only requires one iteration. The latency is calculated on a workstation computer. Furthermore, the latency at different turbulence strengths is also the same (100ms). However, the SPGD algorithm approximately costs about 4s to complete 100 iterations. When using the same time about 100ms, SPGD completes so a few iterations that a well performance is hard to acquire. Moreover, the stronger the turbulence, the more iteration times are needed. Thus the TACCNN model has advantage in calibration time in spite of a large amount of training time. In another aspect, not only the TACCNN model has an advantage in terms of computing time, but also we find that it has strong aberration correction capability, especially tilt and offset issues of the intensity distribution, where traditional algorithms perform poorly in these situations due to interference of local optimality. Thus, deep learning can greatly promotes the update of adaptive optics in terms of calibration time and calibration capability.

 figure: Fig. 7.

Fig. 7. The latency of TACCNN model compared with the SPGD under different turbulence realization. The front row is the latency for CNN and the back row is the latency for SPGD algorithm.

Download Full Size | PDF

Despite all above, we think we just do the preliminary work about the performance of deep learning in adaptive optics system for turbulence aberration correction. As well as we believe the TACCNN model can be further optimized by some ways. Firstly, a lot work should be paid to study the performance of other CNNs model (such as VGG-Net, GoogleNet) in adaptive optics, and the performances of different CNN models in terms of calculation time remain to be discussed. Secondly hyperparameters in the network model (including learning rate, batch size and so on) should be further optimized for a lower MSE and a shorter calculation time. Finally, the 224 × 224 × 3 RGB images as intensity measurements can be replaced by the 224 × 224 × 1 grayscale images as TACCNN inputs for smaller data volumes. By these approaches, we believe the computation time will be greatly decreased.

6. Conclusion

We have demonstrated a deep learning based adaptive optics system to recover the turbulence aberrations of the VVB in terms of phase distribution, mode purity as well as intensity distribution. The turbulence aberration correction convolutional neural network (TACCNN) model, containing multiple convolutional layers, have learned the relationship between the intensity profile and the turbulence phase through lots of experimental examples. The turbulence compensation screen can be quickly and accurately acquired once the distorted intensity profile is input. The experimental configuration for TACCNN-based turbulence aberration correction is built up. The experimental results show that the corrected optical mode profile at the receiver are found to be nearly identical to the desired profiles, and the mode purity increase significantly (i.e. from 0.19 to 0.70, when D/r0=5.28). Furthermore, the method can deal with different turbulence strength, and the calculated time can be reduced to 100ms without iterations compared with the traditional algorithms. Thus the TACCNN model has advantage in calibration time and calibration capability. In conclusion, this scheme shows that the present results combining the fields of deep learning and adaptive optics will be great helpful for the structure beams in communication and imaging.

Funding

National Natural Science Foundation of China (11834001, 61905012); CETC Joint Research Foundation (6141B08231125); National Postdoctoral Program for Innovative Talents (BX20190036); China Postdoctoral Science Foundation (2019M650015).

Acknowledgments

The authors acknowledge Dr. Bing Dong, Chun Liu and Boyang, Xing for the helpful discussions.

Disclosures

The authors declare no conflicts of interest.

References

1. L. Allen, M. W. Beijersbergen, R. J. Spreeuw, and J. P. Woerdman, “Orbital angular momentum of light and transformation of Laguerre Gaussian Laser modes,” Phys. Rev. A 45(11), 8185–8189 (1992). [CrossRef]  

2. J. Wang, J. Y. Yang, I. M. Fazal, N. Ahmed, Y. Yan, H. Huang, Y. X. Ren, Y. Yue, S. Dolinar, M. Tur, and A. E. Willner, “Terabit free-space data transmission employing orbital angular momentum multiplexing,” Nat. Photonics 6(7), 488–496 (2012). [CrossRef]  

3. A. Mair, A. Vaziri, G. Weihs, and A. Zeilinger, “Entanglement of the orbital angular momentum states of photons,” Nature 412(6844), 313–316 (2001). [CrossRef]  

4. D. G. Grier, “A revolution in optical manipulation,” Nature 424(6950), 810–816 (2003). [CrossRef]  

5. M. P. J. Lavery, F. C. Speirits, S. M. Barnett, and M. J. Padgett, “Detection of a spinning object using lights orbital angular momentum,” Science 341(6145), 537–540 (2013). [CrossRef]  

6. M. J. Padgett, “Orbital angular momentum 25 years on,” Opt. Express 25(10), 11265–11274 (2017). [CrossRef]  

7. Y. Shen, X. Wang, Z. Xie, C. Min, X. Fu, Q. Liu, M. Gong, and X. Yuan, “Optical vortices 30 years on: OAM manipulation from topological charge to multiple singularities,” Light: Sci. Appl. 8(1), 90 (2019). [CrossRef]  

8. Y. Yang, Q. Zhao, L. Liu, Y. Liu, C. Rosales-Guzmán, and C. Qiu, “Manipulation of Orbital-Angular-Momentum Spectrum Using Pinhole Plates,” Phys. Rev. Appl. 12(6), 064007 (2019). [CrossRef]  

9. Q. Zhan, “Cylindrical vector beams: from mathematical concepts to applications,” Adv. Opt. Photonics 1(1), 1–57 (2009). [CrossRef]  

10. K. S. Youngworth and T. G. Brown, “Focusing of high numerical aperture cylindrical vector beams,” Opt. Express 7(2), 77–87 (2000). [CrossRef]  

11. G. Milione, M. P. J. Lavery, H. Huang, Y. Ren, G. Xie, T. Nguyen, E. Karimi, L. Marrucci, D. A. Nolan, R. Alfano, and A. E. Willner, “4 × 20 Gbit/s mode division multiplexing over free space using vector modes and a q-plate mode (de)multiplexer,” Opt. Lett. 40(9), 1980–1983 (2015). [CrossRef]  

12. G. Milione, T. A. Nguyen, J. Leach, D. A. Nolan, and R. R. Alfano, “Using the nonseparability of vector beams to encode information for optical communication,” Opt. Lett. 40(21), 4887–4890 (2015). [CrossRef]  

13. P. Li, B. Wang, and X. Zhang, “High-dimensional encoding based on classical nonseparability,” Opt. Express 24(13), 15143 (2016). [CrossRef]  

14. Y. Zhao and J. Wang, “High-base vector beam encoding/decoding for visible-light communications,” Opt. Lett. 40(21), 4843–4846 (2015). [CrossRef]  

15. X. Hu, B. Zhao, Z. Zhu, W. Gao, and C. Rosales-Guzmán, “In situ detection of a cooperative target’s longitudinal and angular speed using structured light,” Opt. Lett. 44(12), 3070–3073 (2019). [CrossRef]  

16. C. Rosales-Guzmán, B. Ndagano, and A. Forbes, “A review of complex vector light fields and their applications,” J. Opt. 20(12), 123001 (2018). [CrossRef]  

17. W. Cheng, J. W. Haus, and Q. W. Zhan, “Propagation of vector vortex beams through a turbulence atmosphere,” Opt. Express 17(20), 17829–17834 (2009). [CrossRef]  

18. Y. X. Ren, H. Huang, G. D. Xie, N. Ahmed, Y. Yan, B. I. Erkmen, N. Chandrasekaran, M. P. J. Lavery, N. K. Steinhoff, M. Tur, S. Dolinar, M. Neifeld, M. J. Padgett, R. W. Boyd, J. H. Shapiro, and A. E. Willner, “Atmospheric turbulence effects on the performance of a free space optical link employing orbital angular momentum multiplexing,” Opt. Lett. 38(20), 4062–4065 (2013). [CrossRef]  

19. S. Fu and C. Gao, “Influences of atmospheric turbulence effects on the orbital angular momentum spectra of vortex beams,” Photonics Res. 4(5), B1 (2016). [CrossRef]  

20. M. P. J. Lavery, C. Peuntinger, K. Günthner, P. Banzer, D. Elser, R. W. Boyd, M. J. Padgett, C. Marquardt, and G. Leuchs, “Free-space propagation of high-dimensional structured optical fields in an urban environment,” Sci. Adv. 3(10), e1700552 (2017). [CrossRef]  

21. M. J. Padgett, F. M. Miatto, M. P. J. Lavery, A. Zeilinger, and R. W. Boyd, “Divergence of an orbital-angular-momentum-carrying beam upon propagation,” New J. Phys. 17(2), 023011 (2015). [CrossRef]  

22. M. P. J. Lavery, “Vortex instability in turbulent free-space propagation,” New J. Phys. 20(4), 043023 (2018). [CrossRef]  

23. Y. J. Cai, Q. Lin, H. T. Eyyuboğlu, and Y. Baykal, “Average irradiance and polarization properties of a radially or azimuthally polarized beam in a turbulent atmosphere,” Opt. Express 16(11), 7665–7673 (2008). [CrossRef]  

24. M. A. Cox, C. Rosales-Guzmán, M. P. J. Lavery, D. J. Versfeld, and A. Forbes, “On the resilience of scalar and vector vortex modes in turbulence,” Opt. Express 24(16), 18105 (2016). [CrossRef]  

25. M. A. Cox, C. Rosales-Guzmán, L. Cheng, and A. Forbes, “Modal Diversity for Robust Free-Space Optical Communications,” Phys. Rev. Appl. 10(2), 024020 (2018). [CrossRef]  

26. Y. Gu, O. Korotkova, and G. Gbur, “Scintillation of nonuniformly polarized beams in atmospheric turbulence,” Opt. Lett. 34(15), 2261–2263 (2009). [CrossRef]  

27. C. Wei, D. Wu, C. Liang, F. Wang, and Y. J. Cai, “Experimental verification of significant reduction of turbulence-induced scintillation in a full Poincaré beam,” Opt. Express 23(19), 24331 (2015). [CrossRef]  

28. Y. Gu and G. Gbur, “Reduction of turbulence-induced scintillation by nonuniformly polarized beam arrays,” Opt. Lett. 37(9), 1553 (2012). [CrossRef]  

29. S. Fu, T. Wang, S. Zhang, Z. Zhang, Y. Zhai, and C. Gao, “Non-probe compensation of optical vortices carrying orbital angular momentum,” Photonics Res. 5(3), 251 (2017). [CrossRef]  

30. G. Xie, Y. Ren, H. Huang, M. P. J. Lavery, N. Ahmed, Y. Yan, C. Bao, L. Li, Z. Zhao, Y. Cao, M. Willner, M. Tur, S. J. Dolinar, R. W. Boyd, J. H. Shapiro, and A. E. Willner, “Phase correction for a distorted orbital angular momentum beam using a Zernike polynomials-based stochastic-parallel-gradient-descent algorithm,” Opt. Lett. 40(7), 1197–1200 (2015). [CrossRef]  

31. Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature 521(7553), 436–444 (2015). [CrossRef]  

32. G. Litjens, T. Kooi, B. E. Bejnordi, A. A. A. Setio, F. Ciompi, M. Ghafoorian, J. A. W. M. van der Laak, B. van Ginneken, and C. I. Sánchez, “A survey on deep learning in medical image analysis,” Med. Image Anal. 42, 60–88 (2017). [CrossRef]  

33. A. Graves, A. Mohamed, and G. Hinton, IEEE Conference on Acoustics, Speech and Signal Processing, 6645–6649 (2013).

34. A. Krizhevsky, I. Sutskever, and G. E. Hinton, in Advances in Neural Information Processing Systems (Morgan Kaufmann Publishers, 2012, pp. 1097–1105).

35. U. S. Kamilov, I. N. Papadopoulos, M. H. Shoreh, A. Goy, C. Vonesch, M. Unser, and D. Psaltis, “Learning approach to optical tomography,” Optica 2(6), 517 (2015). [CrossRef]  

36. Y. Rivenson, Z. Göröcs, H. Günaydin, Y. Zhang, H. Wang, and A. Ozcan, “Deep learning microscopy,” Optica 4(11), 1437 (2017). [CrossRef]  

37. K. H. Jin, M. T. McCann, E. Froustey, and M. Unser, “Deep convolutional neural network for inverse problems in imaging,” IEEE Trans. on Image Process. 26(9), 4509–4522 (2017). [CrossRef]  

38. Y. Rivenson, Y. Zhang, H. Gunaydin, D. Teng, and A. Ozcan, “Phase recovery and holographic image reconstruction using deep learning in neural networks,” Light: Sci. Appl. 7(2), 17141 (2018). [CrossRef]  

39. A. Sinha, J. Lee, S. Li, and G. Barbastathis, “Lensless computational imaging through deep learning,” Optica 4(9), 1117 (2017). [CrossRef]  

40. K. Hammernik, T. Klatzer, E. Kobler, M. P. Recht, D. K. Sodickson, T. Pock, and F. Knoll, “Learning a variational network for reconstruction of accelerated MRI data,” Magn. Reson. Med. 79(6), 3055–3071 (2018). [CrossRef]  

41. Y. Rivenson, H. Ceylan Koydemir, H. Wang, Z. Wei, Z. Ren, H. Günaydın, Y. Zhang, Z. Göröcs, K. Liang, D. Tseng, and A. Ozcan, “Deep learning enhanced mobile-phone microscopy,” ACS Photonics 5(6), 2354–2364 (2018). [CrossRef]  

42. Y. Zhang, C. Wu, Y. Song, K. Si, Y. Zheng, L. Hu, J. Chen, L. Tang, and W. Gong, “Machine learning based adaptive optics for doughnut-shaped beam,” Opt. Express 27(12), 16871–16881 (2019). [CrossRef]  

43. Q. Tian, C. Lu, B. Liu, L. Zhu, X. Pan, Q. Zhang, L. Yang, F. Tian, and X. Xin, “DNN-based aberration correction in a wavefront sensorless adaptive optics system,” Opt. Express 27(8), 10765–10776 (2019). [CrossRef]  

44. J. Liu, P. Wang, X. Zhang, Y. He, X. Zhou, H. Ye, Y. Li, S. Xu, S. Chen, and D. Fan, “Deep learning based atmospheric turbulence compensation for orbital angular momentum beam distortion and communication,” Opt. Express 27(12), 16671–16688 (2019). [CrossRef]  

45. G. Milione, H. I. Sztul, D. A. Nolan, and R. R. Alfano, “Higher-order Poincare sphere, stokes parameters, and the angular momentum of light,” Phys. Rev. Lett. 107(5), 053601 (2011). [CrossRef]  

46. B. Ndagano, I. Nape, M. A. Cox, C. Rosales-Guzman, and A. Forbes, “Creation and Detection of Vector Vortex Modes for Classical and Quantum Communication,” J. Lightwave Technol. 36(2), 292–301 (2018). [CrossRef]  

47. R. J. Noll, “Zernike polynomials and atmospheric turbulence,” J. Opt. Soc. Am. 66(3), 207–211 (1976). [CrossRef]  

48. N. Roddier, “Atmospheric wavefront simulation and Zernike polynomials,” Proc. SPIE 1237, 668–679 (1990). [CrossRef]  

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (7)

Fig. 1.
Fig. 1. (a) The intensity profile of CVV beams at the waist in different polarized direction. The polarization topological charges equal to 1. (b) Atmospheric turbulence phase screen using Zernike polynomials with D / r0 value respectively in 1.0, 2.3, 3.74, 5.28, 6.9. (c) The intensity patterns in different polarized directions respectively after turbulent atmosphere.
Fig. 2.
Fig. 2. Specific TACCNN framework based on Alexnet. The input is a series of pictures with 224×224×3 pixels and the output is 1×12 Zernike coefficients vector. ReLu, Rectified Linear Unit, a kind of activation function. Maxpooling, a kind of down-sampling method.
Fig. 3.
Fig. 3. (a) Loss function curve along with the number of training progress. (b) The MSE between the Zernike coefficients with and without the compensation in different turbulence strength on test data. The dotted line represents the MSE before compensation, and the solid line represents the MSE after compensation. Different colors denote different turbulence strengths, and different turbulence realization means different random complex matrixes when computing the turbulence mask [29]. (c) The phase distributions in three kinds of turbulence strengths (D/r0 equal to 3.74, 5.28, 6.9) before and after the turbulence aberration correction.
Fig. 4.
Fig. 4. Experimental configuration. PBS1& PBS2& PBS3& PBS4, polarized beams splitter. Q1, Q-plate. NPBS1& NPBS2& NPBS3, Non-polarization beams splitter. R1, reflector. HWP, half wave plate. SLM, liquid-crystal spatial light modulator. L1&L2, lenses with focal length f = 100 mm. L3, lens with focal length 200 mm. CCD, infrared CCD camera. TACCNN, the well-trained turbulence aberration correction CNN model.
Fig. 5.
Fig. 5. The distorted intensities of vector vortex beam without and with turbulence, as well as after compensation, in different polarization directions in (Fried parameters r0 = 1mm, and D/ r0= 5.28) turbulence. (a) - (c) with polarization topological charge +1, (d) - (f) with polarization topological charge +2. The white ring is the central ring which is used to compare the offset of the spot.
Fig. 6.
Fig. 6. (a) The intensities of mode purity without and with turbulence, as well as after compensation. (b) Detected mode purity before and after compensation in various turbulence atmospheres (D/r0 within the scope of a value in 0 - 6.29). The light blue bar represents the mode purity before compensation, and the prink bar represents the incremental mode purity after compensation.
Fig. 7.
Fig. 7. The latency of TACCNN model compared with the SPGD under different turbulence realization. The front row is the latency for CNN and the back row is the latency for SPGD algorithm.

Tables (1)

Tables Icon

Table 1. The relationship of the index j, m and n.

Equations (16)

Equations on this page are rendered with MathJax. Learn more.

| ψ l = ψ R l | R l + ψ L l | L l ,
| R l = exp ( i l φ ) ( x + i y ) / 2 ,
| L l = exp ( + i l φ ) ( x i y ) / 2 .
| ψ p , φ 0 = E ( r , φ ) = A ( r ) [ cos ( p φ + φ 0 ) sin ( p φ + φ 0 ) ]
| ψ p , φ 0 T u r b = E ( r , φ ) exp ( i φ T u r b ) = A ( r ) [ cos ( p φ + φ 0 ) sin ( p φ + φ 0 ) ] exp ( i φ T u r b )
E ( r , φ T u r b ) = A ( r ) B n ( θ ) [ exp ( i ( ( p + n ) φ + φ 0 ) ) + exp ( i ( ( p n ) φ + φ 0 ) ) i ( exp ( i ( ( p + n ) φ + φ 0 ) ) i exp ( i ( ( p n ) φ + φ 0 ) ) ]
I ( z ) T u r b = f z ( E ( r , φ ) t u r b ) ,
φ T u r b = F ( I ( z ) T u r b ) ,
φ T u r b = a j Z j
Z e v e n j = 2 ( n + 1 ) R n m ( r ) cos ( m θ ) , m 0 ,
Z o d d j = 2 ( n + 1 ) R n m ( r ) sin ( m θ ) , m 0 ,
Z j = n + 1 R n 0 ( r ) , m = 0.
R n m ( r ) = s = 0 ( n m ) / 2 ( 1 ) s ( n s ) ! s ! [ ( n + m ) / 2 s ] ! [ ( n m ) / 2 s ] ! r n 2 s
E ( a j , a j ) = K z z δ z Γ [ ( n + n 5 / 3 ) / 2 ] ( D / r 0 ) 5 / 3 Γ [ ( n n + 17 / 3 ) / 2 ] Γ [ ( n n + 17 / 3 ) / 2 ] Γ [ ( n n + 23 / 3 ) / 2 ]
L = i = 0 20 ( a t u r b i a e s t i ) 2 / 20
w = arg min w L = arg min w i = 0 20 ( a t u r b i a e s t i ) 2 / 20
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.