Closed loop, DM diversity-based, wavefront correction algorithm for high contrast imaging systems

Amir Give’on; Ruslan Belikov; Stuart Shaklan; Jeremy Kasdin

doi:10.1364/OE.15.012338

1. Introduction

The problem of direct detection of exoplanets requires high-contrast imaging of a dim point source (the planet) appearing adjacent to a much brighter source (the star). Many techniques have arisen, using various forms of coronagraphy, to achieve the very high contrast needed (up to 10^-10), to image extrasolar planets from space [1]. In this note we describe an algorithm using a single DM for estimating and correcting static wavefront error in a coronagraphic imaging system. Our approach works across the full spectrum of high-performance coronagraphs.

The performance of conventional adaptive optics (AO) typically has been limited by the accuracy of the wavefront phase estimation (known as phase reconstruction) and the ability of the deformable mirror (DM) to achieve the arbitrary shapes required [2]. As improvements have been made in these areas, the contrasts achieved by conventional phase conjugation have still been limited [3]. In [4], we demonstrated that this is due to intermodulation products of the high frequency content of the DM, call frequency folding. Frequency folding, not to be confused with aliasing due to wavefront sampling, occurs even with perfect estimation of the wavefront and a DM limited only by the highest spatial frequency it can achieve.

The accepted solution is to perform measurements on the final science camera in conjunction with, for example, the speckle nulling algorithm [5, 6]. While proven effective, classical speckle nulling requires many iterations, potentially imposing unreasonable stability requirements on the system. The algorithm we propose here is substantially more efficient. The correction part of the complete algorithm is based on a method developed by Bordè et al. [7] for correcting the field in a classical Lyot coronagraph which in turn has its origin in the linear solution of the ‘dark hole algorithm’ developed by Malbet et al. [8]. We have generalized these approaches by incorporating the full, nonlinear expression for the aberrations, including scattered light and the nominal diffraction of the system, and by formulating the correction in a manner applicable to a wide variety of high-contrast imaging systems.

2. Energy Minimization: a closed-loop correction algorithm for high contrast imaging systems

In this section we briefly describe the details of the correction algorithm which we call Energy Minimization. These are divided into two pieces: a correction stage where the DM actuators are set based on the measured field and an estimation, or reconstruction, stage where the real and imaginary parts of the complex electric field at the DM is computed from multiple measurements in the image plane. It is important to note that while we present a complete closed-loop algorithm, these two steps are functionally independent and could be implemented with other companion approaches.

2.1. The correction stage

Let the response of an optical system, operating in monochromatic light, be modeled by the linear transformationC between the electric field at the deformable mirror (DM) plane, E ₀, and the electric field at the science camera plane, E_f, where the measurements will take place,

E_{f} = C {E_{0}}

We are particularly interested in using shaped pupil coronagraphs for high-contrast [9]. There, C{E ₀}=𝓕{SE ₀}, where 𝓕 represents the Fourier transform and S represents the shaped pupil function (the DM is assumed to be at a conjugate plane to the shaped pupil). Similarly, for the band-limited Lyot coronagraph [10], C{E ₀}=(𝓕{SE ₀}M)⊗𝓕{L}, where S is the entrance pupil of the coronagraph, M is the image plane mask, L is the Lyot pupil and ⊗ represents the convolution (the DM is assumed to be at the same plane as the entrance pupil). Since all real optical systems have aberrations induced by errors in the optics, the input field in Eq. 1 can be modeled,

E_{0} = A e^{α + i β} e^{i ψ},

where A is the un-aberrated, ideal, electric field, α and β represent the amplitude and phase aberrations, respectively, and ψ represents the phase difference caused by the deformation of the DM surface.

Using Eqs. 1 and 2, the electric field in the science camera field for a system with phase and amplitude aberrations and a DM is thus given by,

E_{f} = C {A e^{α + i β} e^{i ψ}} \approx C {A (1 + Φ) (1 + i ψ)}

where we used the first order Taylor series for the effect of the DM, Φ=e^α ^+iβ-1 was defined for expansion purposes and the cross terms between ψ and Φ is assumed to be negligible.

The total energy in the dark zone, 𝓔, is given by the inner product of the electric field, 𝓔=〈E_f,E_f〉, or,

𝓔 = 〈 C {A e^{α + i β}} + i C {A ψ}, C {A e^{α + i β}} + i C {A ψ} 〉

where 〈f,g〉=∫∬f*gdξdη and the asterisks represents a complex conjugate.

Any optimal correction algorithm requires some model for the DM surface height, ψ as a function of the actuator commands. Here, we assume an influence function model,

ψ = \sum_{k = 1}^{N_{dm}} \sum_{l = 1}^{N_{dm}} a_{k, l} f_{k, l},

where the DM consists of an array of N_dm×N_dm actuators, a_k _,l is the kl ^th coefficient (actuator command), and f_k _,l is the DM’s influence function, centered at the location of the kl th actuator. The influence function is defined as the surface profile of the DM when one actuator is commanded (note that this model assumes no coupling between actuators and that superposition holds).

We choose as our correction criteria to minimize the energy in a region of the image plane. We then check to verify that this minimization created adequate contrast. The condition for minimum energy is that for all k and l, $\frac{\partial 𝓔}{\partial a_{k, l}} = 0$ , or,

\sum_{m = 1}^{N_{dm}} \sum_{n = 1}^{N_{dm}} a_{m, n} ℜ {〈 C {A f_{k, l}}, C {A f_{m, n}} 〉} = - ℑ {〈 C {A f_{k, l}}, C {A e^{α + i β}} 〉},

where ℜ and ℑ represent the real and imaginary parts, respectively, and we used the relations $C {A ψ} = \sum_{k = 1}^{N_{dm}} \sum_{l = 1}^{N_{dm}} a_{k, l} C {A f_{k, l}}$ , $\frac{\partial C {A ψ}}{\partial a_{k, l}} = C {A f_{k, l}}$ and the linearity of C. The criteria for minimum energy in Eq. 6 can be written in matrix form as

[\begin{matrix} G_{1, 1} & \dots & G_{1, N_{dm}^{2}} \\ ⋮ & ⋱ & ⋮ \\ G_{N_{dm}^{2}, 1} & \dots & G_{N_{dm}^{2}, N_{dm}^{2}} \end{matrix}] [\begin{matrix} a_{1} \\ ⋮ \\ a_{N_{dm}^{2}} \end{matrix}] = [\begin{matrix} H_{1} \\ ⋮ \\ H_{N_{dm}^{2}} \end{matrix}],

where G_r,q=ℜ{〈C{Af_r}, C{Af_q}〉} and H_r=-ℑ{〈C{Af_r},C{Ae ^α+iβ}〉} and r,q=1,2, …,N ² _dm. Therefore, in order to find the coefficients for the DM configuration that minimizes the energy in the dark zone, we need to estimate C{Ae ^α+iβ} which is essentially the complex valued electric field in the science camera plane of an uncorrected, aberrated system.

2.2. The reconstruction stage

For each DM configuration ψ_k, following similar steps as the ones leading to equation 3, the electric field in the final science camera plane is given by

E_{k} \approx C {A e^{α + i β}} + C {A Ψ_{k}}

where $Ψ_{k} = e^{i ψ_{k}} - 1$ and we assume the cross term C{AΦΨ_k} is negligible.

The intensity of light in the final science camera plane for any DM configuration, ψ_k, is therefore approximately given by,

I_{k} = {∣ C {A e^{α + i β}} + C {A Ψ_{k}} ∣}^{2} .

Note that this is an image-plane measurement, in contrast to conventional adaptive optics, where the electric field at the pupil is measured directly using a wavefront sensor such as a Shack-Hartmann (SH). While the SH sensor allows direct field estimation, it requires the use of additional optics not in the science path. This introduces uncorrectable non-common path error larger than the desired contrast. Our image-plane measurements with the science camera eliminate this problem. However, because we are measuring the intensity of the field, one image plane measurement is inadequate for estimating the complex electric field. We therefore take multiple images at different DM settings to estimate the field. Let the first such image, I ₀, be taken with ψ=0. The intensity of light in the image plane is then

I_{0} = {∣ C {A e^{α + i β}} ∣}^{2} .

The remaining images are taken with different DM configurations,ψ_k. Combining equations 9 and 10, for each image taken gives,

I_{k} - I_{0} - {∣ C {A Ψ_{k}} ∣}^{2} = {(C {A e^{α + i β}})}^{*} C {A Ψ_{k}} + C {A e^{α + i β}} {(C {A Ψ_{k}})}^{*} .

Using multiple DM configurations allows us to solve for C{Ae ^α+iβ} via the following matrix equation,

[\begin{matrix} I_{1} - I_{0} - {∣ C {A Ψ_{1}} ∣}^{2} \\ ⋮ \\ I_{k} - I_{0} - {∣ C {A Ψ_{k}} ∣}^{2} \end{matrix}] = 2 [\begin{matrix} ℜ {C {A Ψ_{1}}} & ℑ {C {A Ψ_{1}}} \\ ⋮ & ⋮ \\ ℜ {C {A Ψ_{k}}} & ℑ {C {A Ψ_{k}}} \end{matrix}] [\begin{matrix} ℜ {C {A e^{α + i β}}} \\ ℑ {C {A e^{α + i β}}} \end{matrix}]

implying that

[\begin{matrix} ℜ {C {A e^{α + i β}}} \\ ℑ {C {A e^{α + i β}}} \end{matrix}] = \frac{1}{2} {[\begin{matrix} ℜ {C {A Ψ_{1}}} & ℑ {C {A Ψ_{1}}} \\ ⋮ & ⋮ \\ ℜ {C {A Ψ_{k}}} & ℑ {C {A Ψ_{k}}} \end{matrix}]}^{†} [\begin{matrix} I_{1} - I_{0} - {∣ C {A Ψ_{1}} ∣}^{2} \\ ⋮ \\ I_{k} - I_{0} - {∣ C {A Ψ_{k}} ∣}^{2} \end{matrix}]

where † represents the pseudo inverse (typically solved by a least squares-based method such as the Singular Value Decomposition). The condition for the existence of a solution at a given point is that there exists m and n, such that

ℜ {C {A Ψ_{m}}} ℑ {C {A Ψ_{n}}} - ℜ {C {A Ψ_{m}}} ℑ {C {A Ψ_{n}}} \neq 0

The physical meaning of this equation is that there must be at least 2 DM settings that create 2 different electrical fields for every CCD pixel in the image plane that mix with the original field. Only then will there be enough diversity to reconstruct the field at each CCD pixel.

3. Experimental results

In this section, we briefly demonstrate the method described in the previous section on a real shaped pupil coronagraph at the Princeton Terrestrial Planet Finder Laboratory [11], and compare the performance to classical speckle nulling.

Our lab is designed to take an image of an artificial star with a mockup of a coronagraphic telescope and measure the resulting contrast. We used a 632nm He-Ne laser as our artificial star, and a 32×32 Boston Micromachines deformable mirror for correction. Our optics had λ/20 surface figure and were 6” in diameter (only a small portion of which was used).

We iteratively applied 2 types of correction: Classical Speckle Nulling and Energy Minimization. (See [6] for a brief description of our implementation of Classical Speckle Nulling.) For the case of Energy Minimization, we used two DM configurations (k=1, 2), so that the total number of images per iteration is 3. Each DM configuration was a sum of cosine ripples, designed to create a grid of overlapping PSFs in the image plane that completely covers the dark zone. The two DM configurations were the same except they were in quadrature phase (in which case they satisfy Eq. 14). Figure 1 shows the experimental results.

Fig. 1. CCD images. Left: Before correction. Average contrast across (5-14λ/D) is about 10^-5. Center: After Classical Speckle Nulling correction bottoms out. Contrast across (6^-14 λ/D) is about 10^-6. Right: After Energy Minimization bottoms out. Contrast across (6-14 λ/D) is about 6×10^-7.

Download Full Size | PDF

A dark hole was generated in the region from 5 to 14 λ/D for the first 17 iterations of energy minimization and the first 50 iteration of speckle nulling, and then switched to 6 to 14 λ/D in the horizontal direction and -2 to 2 λ/D in the vertical. (We use sky angle as image plane coordinates, in units of 1λ/D, which is the angular diffraction limit of a telescope of aperture D and wavelength λ). Figure 1 shows the images taken before correction, after Classical Speckle Nulling correction bottomed out, and after Energy Minimization bottomed out. The optical axis in all images is close to the right edge of the image and the images show the left dark zone out to about 25λ/D, together with a portion of the image plane stop which blocks the core of the star. The colormap is logarithmic, with contrast levels shown in the colorbar to the right. It is evident that in both cases of correction, a dark hole appears with improved contrast.

Figure 2 shows the average contrast in the dark hole as a function of the total number of images taken for the two algorithms. The black and red curves correspond, respectively, to classical speckle nulling (which used 7 images per iteration) and energy minimization (which required 3 images per iteration). It is evident that, at least in this case, the energy minimization algorithm (a) improved contrast faster than classical speckle nulling (computational times were not significant) and (b) achieved a modestly deeper null. (Though it is not shown on the graph, speckle nulling asymptotes around 10^-6 on the Princeton testbed.) It should be noted, however, that classical speckle nulling was observed to be much less sensitive to noise and errors in the model than energy minimization. The limiting factor in our classical speckle nulling experiment is, based on simulations, believed to be the inefficiency of Classical Speckle Nulling to correct for spatially small speckles, which arise at around the 10^-6 level. Energy minimization does not suffer from this limitation and therefore achieves a deeper null. However, it also hits a limit at roughly 6×10^-7 in our lab. The limiting factor is the haze around 4λ/D that is apparent on the rightmost image in Figure 1. This residual halo behaves as incoherent light, i.e. adding in intensity rather than amplitude (it is not known yet what it actually is) and thus is not eliminated by theDMcorrection. One of the appealing features of the subtraction based estimation formula in Eq. 11 is that any incoherent light common to each measurement is eliminated, resulting in an estimate of the desired coherent portion only. The contrast of this residual coherent light estimate is roughly 10^-7, averaged across the control region. At this level, quantization of the DM voltage signal is believed to be the limiting factor. Simulations further show that with no quantization (and no other non-fundamental limiting factors such as incoherent light) the contrast reaches 10^-10 with energy minimization.

Fig. 2. A log-log plot of average contrast in the control region vs. number of images taken by the estimation and control algorithms.

Download Full Size | PDF

4. Summary and conclusion

We have presented and demonstrated in monochromatic light a fast wavefront estimation and correction algorithm. In only a few iterations, the contrast improves by a factor of 10, and the contrast of the coherent light estimate in the image improves by a factor of 100. Limiting factors are being explored and are believed to be in hardware rather than the algorithm. Even though we have only considered monochromatic light, it is possible to generalize this algorithm for multi-spectral operation, though this is beyond the scope of the present paper.

Acknowledgments

This work was carried out in part at the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration.

References and links

1. W. A. Traub, ed. Proceedings of Coronagraph Workshop2006. JPL Publication 07-02.

2. R.K. TysonIntroduction to Adaptive Optics. SPIE Press, 2000. [CrossRef]

3. L.A. Poyneer and B. Macintosh “Spatially filtered wave-front sensor for high-order adaptive optics,” J. Opt. Soc. Am. A 21(5), 810–819 (2004). [CrossRef]

4. A. Give’on, N. J. Kasdin, R. J. Vanderbei, and Y. Avitzour “On representing and correcting wavefront errors in high-contrast imaging systems,” J. Opt. Soc. Am. A 23 (2006).

5. J. T. Trauger, C. Burrows, and B. Gordon et al, “Coronagraph contrast demonstration with the high-contrast imagaing testbed,” Proc. SPIE 5487, 1330–1336 (2004). [CrossRef]

6. R. Belikov, A. Give’on, J.T. Trauger, M. Carr, N.J. Kasdin, R.J. Vanderbei, F. Shi, K. Balasubramanian, and A. Kuhnert, “Toward 10¹⁰ contrast for terrestrial exoplanet detection: demonstration of wavefront correction in a shaped-pupil coronagraph,” Proc. SPIE 6265, pp. 626–518 (2006).

7. P. J. Borde and W. A. Traub, “High-contrast imaging from space: Speckle nulling in a low aberration regime,” Astrophys. J. 638 (2006).

8. F. Malbet, J.W. Yu, and M Shao, “High dynamic range imaging using a deformable mirror for space coronography”, PASP , 107, pp. 386 (1995). [CrossRef]

9. N. J. Kasdin, R. J. Vanderbei, D. N. Spergel, and M. G. Littman “Extrasolar Planet Finding via Optimal Apodized-Pupil and Shaped-Pupil Coronagraphs” Astrophys. J. 582(2), 1147–1161, (2003). [CrossRef]

10. M. J. Kuchner, J. Crepp, J. Ge, and Astrophys.“Eighth-Order Image Masks for Terrestrial Planet Finding,” J. 628(1), 466–473 (2005).

11. N.J. Kasdin, R. Belikov, J. Beall, R.J. Vanderbei, M.G. Littman, M. Carr, and A. Give’on, “Shaped pupil coronagraphs for planet finding: optimization, manufacturing, and experimental results,” Proc. SPIE 5905, 128–136 (2005).

Closed loop, DM diversity-based, wavefront correction algorithm for high contrast imaging systems

Abstract

1. Introduction

2. Energy Minimization: a closed-loop correction algorithm for high contrast imaging systems

2.1. The correction stage

2.2. The reconstruction stage

3. Experimental results

4. Summary and conclusion

Acknowledgments

References and links

Cited By

Figures (2)

Equations (16)

Optics Express