Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Off-axis aberrations improve the resolution limits of incoherent imaging

Open Access Open Access

Abstract

The presence of off-axis tilt and Petzval curvature, two of the lowest-order off-axis Seidel aberrations, are shown to improve the Fisher information of two-point separation estimation in an incoherent imaging system compared to an aberration-free system. Our results show that the practical localization advantages of modal imaging techniques within the field of quantum-inspired superresolution can be achieved with direct imaging measurement schemes alone.

© 2023 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

1. Introduction

In well-corrected imaging systems, the propagation from the object to image planes is often modeled with shift invariance [1]. This indicates that the profile of the imaging system’s point spread function (PSF) is independent of object impulse’s location. Recent works have recasted the associated Rayleigh’s criterion, which states that the separation of two nearby point sources cannot be precisely estimated, for such systems in terms of the classical Fisher information (CFI) [2]. The CFI informs on the precision one expects in extracting an unknown parameter with a given measurement scheme. In this context, Rayleigh’s criterion states that the CFI for the separation of two point sources, with direct intensity (DI) measurements, vanishes as the separation between the point sources approaches zero.

A shift in the well-established understanding of the Rayleigh’s criterion was provided by Tsang and Nair [3]. They showed that the quantum Fisher information (QFI), an upperbound for the CFIs over all possible measurement schemes, for the separation of two equally bright incoherent point sources remains non-zero in the sub-Rayleigh regime. This result, along with the introduction of modal imaging techniques like spatial mode demultiplexing (SPADE), a measurement scheme that saturates the QFI, is responsible for the wide net of recent research efforts that sought to extend the promise of this so-called quantum-inspired superresolution [410]. Such extensions include theoretical work in which strides have been made regarding the generalization of the object’s spatial and coherence details [7,1116]. Regarding experimental considerations, the advantages of SPADE-type measurements have been tested and confirmed for a variety of objects [8,17].

In this work, calculations are provided for the CFI and QFI for the separation between two equally bright incoherent point sources in imaging systems which possess off-axis/field-dependent aberrations. The presence of these aberrations causes imaging systems to be shift-variant; the analysis of such systems have been thus far neglected in the context of quantum-inspired superresolution. Remarkably, we show that the inclusion of simple low-order off-axis Seidel aberrations can give rise to more precise estimation of source separation when compared to an aberration-free system by improving on the CFI and QFI. In particular, we provide the analysis for off-axis tilt (OAT) and Petzval curvature. Both are shown to improve the CFI for both DI and modal imaging measure schemes, with the former providing a global improvement and the latter giving rise to improvements in the sub-Rayleigh regime. To derive these results, a general framework for shift-variant imaging systems is introduced in Section 2, from which a specialized treatment for two equally bright incoherent point sources is considered. Our results, given in Section 3, provide context to the nascent field of quantum-inspired superresolution and show that much improvement can be made through the intentional usage of off-axis aberrations.

2. Theoretical background

2.1 Shift-variant imaging systems

In a shift-invariant imaging system, the optical field at the image plane may be obtained from the optical field at the object plane via a convolution with the system’s PSF [1]. In the presence of off-axis aberrations, however, the relation between object field, $U_o$, and image field, $U_i$, is given by a more general integral transformation. For simplicity, we assume one transverse spatial coordinate for which the aforementioned transformation is given by

$$U_i(x) = \int_{-\infty}^\infty U_o(\xi) \psi(x,\xi)\, \textrm{d} \xi,$$
where $\xi$ and $x$ are the object and image plane spatial coordinates, respectively. The departure from a shift-invariant imaging system is captured by the form of the PSF $\psi (x,\xi )$, which indicate a dependence on both $\xi$ and $x$ that is not confined to their difference $x-\xi$. The form for $\psi (x,\xi )$ may be obtained by considering the propagation of a quasi-monochromatic point source with wavenumber $k = 2\pi / \lambda$, modeled as a Dirac delta impulse, located at $\xi$, through the imaging system. This impulse is mapped onto a linear phase factor at the pupil plane. There, the linear phase factor is modulated by a Gaussian pupil function of characteristic width $\sigma _p$ and a phase aberration function, $\Delta W$. The choice of a Gaussian pupil function, commonly made in related works, is used for mathematical convenience [3,11,12]. An inverse Fourier transformation is then performed on the pupil-plane field to give
$$\psi(x,\xi) = U_0 \int_{-\infty}^\infty \exp\left( - \textrm{i}{k} \frac{\xi u}{ f} \right) \exp\left( - \frac{u^2}{4 \sigma_p^2} \right) \exp \left[ -\textrm{i}{k} \Delta W(u,\xi) \right] \exp \left(\textrm{i}{k} \frac{ux}{ f} \right)\, \textrm{d}{u},$$
where $U_0$ is a constant with dimensions of optical field that ensures the intensity-normalization of $\psi$. Furthermore, $u$ is the pupil-plane spatial coordinate, and $f$ is the focal length of the lenses in a $4f$ imaging system (although the results presented in this work are valid for arbitrary other imaging configurations); the Fourier optical details regarding Eq. (2) is described further in Supplement 1. A schematic of the imaging system is shown in Fig. 1. It will be useful to define $\sigma = f\lambda /(4\pi \sigma _p)$ as the characteristic width of the diffraction-limited [$\Delta W(u,\xi ) = 0$] PSF. Notice, as is true for the case of a shift-invariant imaging system, the system PSF would explicitly be a function of the difference in coordinates $x - \xi$ if $\Delta W$ were independent of the object location $\xi$. Such on-axis/field-independent aberrations include well-known Zernike terms such as defocus and spherical aberration. Our work focuses instead on examples of $\Delta W$ which depend on $\xi$.

 figure: Fig. 1.

Fig. 1. A schematic of a $4f$ imaging system with $\xi,u,$ and $x$ as the spatial coordinates for the object, pupil, and image planes, respectively. An illustration of two point sources separated by $s$ is shown along with their image distributions in the presence of OAT and Petzval curvature of strengths $T$ and $P$, respectively, which are defined in Eq. (4). The individual PSFs have a half-width of $g$ (which depends on $P$), defined in Eq. (6), and their separation is magnified by a factor of $1 + 2\pi T \sigma$. The pupil is modeled as a Gaussian with half-width $\sigma _p$.

Download Full Size | PDF

Before proceeding to the analysis of off-axis aberrations, we note that the spatial profile of $\psi$ is affected by the strength of the aberration $\Delta W$. For the purposes of quantifying this strength in later sections, it is useful to use $2\sigma _p$ and $2 \sigma$ as standard lengths for the pupil coordinate $u$ and the object location $\xi$, respectively. The reason for this choice is due to (1) the Gaussian pupil function strongly attenuating any signal at the pupil plane located at $|u| > 2\sigma _p$ and (2) the context of quantum-inspired resolution is most relevant for objects located at $|\xi | < 2 \sigma$; that is, for objects whose features are below twice the diffraction-limited PSF. This regime is henceforth referred to as the sub-Rayleigh regime.

2.2 Off-axis aberrations

From Eq. (1), it is clear that the PSF of a shift variant system is determined by the aberration function $\Delta W(u,\xi )$. Although there is an infinitude of options; we restrict our analysis to the case where

$$\Delta W(u,\xi) = W_{111} \left( \frac{\xi}{2 \sigma} \right) \left( \frac{u}{2\sigma_p} \right) + W_{220} \left( \frac{\xi}{2 \sigma} \right)^2 \left( \frac{u}{2\sigma_p} \right)^2,$$
which is recognized as the superposition of the two low-order off-axis Seidel aberrations commonly called OAT and Petzval curvature [18]. Notice that Eq. (3) is written in terms of ratios for $\xi$ and $u$ so that $W_{111}$ and $W_{220}$ can be interpreted as the maximal [when $\xi = 2\sigma$ and $u = 2\sigma _p$, as discussed in the paragraph following Eq. (2)] optical path difference caused by OAT and Petzval curvature, respectively. By further defining $T$ and $P$ as
$$T \triangleq \frac{W_{111}}{\lambda \cdot (2\sigma)} \quad \text{and} \quad P \triangleq \frac{W_{220}}{\lambda \cdot (2\sigma)^2},$$
to be OAT and Petzval curvature strength parameters, respectively (they measure rates, for $u = 2\sigma _p$, at which the phase difference caused by OAT and Petzval curvature, respectively, increases as a function of $\xi$), one finds that the intensity-normalized shift-variant PSF is given through Eq. (2) by
$$\psi(x,\xi;P,T) = \frac{1}{[2\pi g^2(\xi;P)]^{1/4}} \exp \left\{- \frac{[x - \xi(1 + 2\pi T \sigma)]^2}{4 g^2(\xi,P)} + \textrm{i} \Phi(x,\xi;P,T)\right\} ,$$
where
$$g(\xi;P) \triangleq \sigma \sqrt{1 + 4\pi^2 P^2 \xi^4},$$
is the characteristic width of $\psi$ and
$$\Phi(x,\xi;P,T) \triangleq -\frac{1}{2} \left\{ \tan^{{-}1}(2\pi P\xi^2) + \frac{\pi P \xi^2 [x - \xi(1+2\pi T \sigma)]^2}{ g^2 (\xi;P)} \right\},$$
is the phase of $\psi$. Equation (5) shows why the choice of Eq. (3) was used: OAT and Petzval curvature cause intuitive changes in the PSF’s mean location and width, respectively. Therefore, much insight can be obtained from studying Eq. (3) since, while other off-axis aberrations may induce higher order effects on $\psi$, the effects of OAT and Petzval is expected to provide a useful leading-order description.

OAT and Petzval curvature, among other Seidel aberrations, are common in typical imaging systems and have well-known interpretations particularly in the language of geometrical (ray) optics. For a system with OAT, the plane wave emerging from the pupil associated with each object location $\xi$ is proportionally tilted (posses an additional linear phase). This often arises from unintentional tilts of the optics within the system and causes an incorrect magnification in the image. This magnification factor, according to Eq. (5) and Fig. 1, is $(1+2\pi T \sigma )$; in the case of an object scene with two point sources, this magnification translates to an amplification of the separation in the image field. Petzval curvature, on the other hand, describes an imaging system in which the image is perfect when the intensity is measured along a curved surface. If the intensity is instead measured, as it usually is, along a flat surface, off-axis object points $\xi$ are mapped to a shift-variant PSF whose width increases with $|\xi |$. According to Eqs. (5) and (6), this width is $g(\xi ;P)$.

For the aberration-free case ($P = 0$ and $T = 0$), Eq. (5) reduces to the shift-invariant Gaussian PSF with width $\sigma$. Furthermore, it should be noted that the proceeding comparisons of Fisher information (FI) values between systems where the PSF is given by $\psi (x,\xi ;P,T)$ and the aberration-free case $\psi (x,\xi ;0,0)$ are fair, since $g(\xi ;P) \ge \sigma$. In other words, the presence of OAT and Petzval curvature does not reduce the diffraction-limited spot size and therefore the improved FI is not attributed to the reduction of $\sigma$.

2.3 Measurement schemes

Although this work will primarily be concerned with incoherent imaging, a general framework is provided here for completeness and comprehension. To this end: the density matrix, which represents the cross-spectral density of the optical field at the image plane, for the case of two partially coherent point sources with intensity ratio $A$, a known centroid taken to be at the origin of the coordinate system, separated by a distance $s$ is given by

$$\rho(s;P,T) = \frac{|\psi_+ \rangle \langle \psi_+| + A|\psi_- \rangle \langle \psi_-| + \Gamma |\psi_+\rangle \langle \psi_-| + \Gamma^* |\psi_-\rangle \langle \psi_+|}{1 + A + 2 \text{Re}[\Gamma d(s;P,T)]},$$
which is expressed over the non-orthogonal, intensity-normalized basis $\{|\psi _+ \rangle, |\psi _-\rangle \}$. These basis kets are defined over position-space through Eq. (1) as
$$|\psi_\pm\rangle \triangleq \int_{-\infty}^\infty \psi_\pm(x;P,T) |x \rangle \, \textrm{d}{x} \triangleq \int_{-\infty}^\infty \psi\left(x, \pm \frac{s}{2} ;P,T\right) |x \rangle \, \textrm{d}{x}.$$

Furthermore,

$$d(s;P,T) \triangleq \langle \psi_- | \psi_+ \rangle = \int_{-\infty}^\infty \psi \left( x, \frac{s}{2} ; P, T\right) \psi^*\left(x, -\frac{s}{2}; P, T \right) \, \textrm{d}{x}$$
is the field-overlap integral between the two imaged PSFs and $\Gamma$ is a complex-valued coherence parameter whose values are restricted by the condition $|\Gamma | \le A$. The normalization factor in Eq. (8) indicates that the density matrix $\rho (s)$ uses the image-plane normalization scheme, in which the unit-trace condition on $\rho$ is ensured by the total number of received photons at the image plane. This choice is in contrast to the object-plane normalization scheme; a detailed discussion of the two options is provided elsewhere [14].

The act of performing a measurement on the received photons is captured mathematically through computing the modulus-square coefficients over a corresponding set of projections of $\rho$. These coefficients are taken to be the probabilities of finding an image-plane photon in a certain mode (which may be continuous or discrete). In this work, we focus on comparing the measurement schemes of DI and SPADE. For DI, the set of projection modes is the continuous position basis $\{|x\rangle \}$; the probability density of finding a photon in the $x$ position of the image plane (conditioned on a detection event) is given by

$$\begin{aligned} p_\text{I}(x,s;P,T) &= \langle x| \rho(s;P,T) | x \rangle\\ &= \frac{|\psi_+(x;P,T)|^2 + |\psi_-(x;P,T)|^2 + 2 \text{Re}\left[\Gamma \psi_+(x;P,T) \psi_-(x;P,T) \right]}{1 + A + 2 \text{Re}[\Gamma d(s;P,T)]}. \end{aligned}$$

In Section 3, we focus on the case where $\Gamma = 0$ and $A = 1$ (equally bright incoherent point sources). Figure 2 is provided for the visualization of $p_\text {I}(x,s;P,T)$, which is a normalized version of the image-plane intensity distribution, for various levels of OAT and Petzval curvature. The aberration-free case shown in Fig. 2(a) shows the usual intensity pattern of two Gaussian PSFs converging as the separation $s$ vanishes. The effects of non-zero values of $T$ and $P$ are illustrated in Figs. 2(b) - (d): OAT causes the intensity distributions from the two point sources to converge at a different rate due to the magnification factor $(1+2\pi T \sigma )$ and Petzval curvature gives the intensity distributions a $s$-dependent width. Regarding OAT, it is clear from the comparison of Fig. 2(a) and (b) that non-zero values of $T$ allows for the two point sources’ PSFs to be more easily distinguished for smaller values of $s/\sigma$. This effect, which will be detailed later, is responsible for larger values of CFI when using DI for imaging systems with OAT. Petzval curvature, on the other hand, does not have a similarly clear benefit in $s$-estimation whe compared to OAT. The resolution advantages offered by non-zero values of $P$ instead comes from the sensitivity of the width $p_\text {I}$ as $s$ approaches zero.

 figure: Fig. 2.

Fig. 2. Density plots of $p_\text {I}(x,s;P,T)$ over $x/\sigma$ and $s/\sigma$ for various values of $T\sigma$ and $P\sigma ^2$ in (a) - (d). In each, the solid red lines show the location of the two point sources. The dashed blue lines in (b) and (d) show the centers of intensity distribution from each point source for non-zero OAT. The dashed yellow lines in (c) and (d) show the outer envelope of $p_\text {I}$ for non-zero Petzval curvature.

Download Full Size | PDF

For SPADE, the projection is done over any set of discrete orthonormal modes. A natural choice is the $\sigma$-matched Hermite-Gauss (HG) basis $\{\phi _q(x)\}_{q=0}^\infty$, where the $q$-th mode is defined as

$$\phi_q(x) = \frac{1}{(2\pi \sigma^2)^{1/4}}\frac{1}{\sqrt{2^q q!}} H_q\left( \frac{x}{\sqrt{2} \sigma} \right) \exp \left( - \frac{x^2}{4\sigma^2} \right),$$
and $H_q$ is the $q$-th physicist’s Hermite polynomial. The probability of finding a photon (conditioned on a detection event) in the $q$-th mode is therefore the projection
$$\begin{aligned} p_\text{II}(q,s;P,T) &= \langle \phi_q| \rho(s;P,T)|\phi_q \rangle\\ &= \frac{|\langle \phi_q | \psi_+\rangle|^2 + |\langle \phi_q | \psi_-\rangle|^2 + 2 \text{Re}\left(\Gamma \langle \phi_q | \psi_+\rangle \langle \psi_- | \phi_q \rangle \right)}{1 + A + 2 \text{Re}[\Gamma d(s;P,T)]}. \end{aligned}$$

The probability densities given by Eqs. (11) and (13) for DI and SPADE, respectively, are needed in the calculation of the CFI of either measurement scheme. For the case where only the separation $s$ is an unknown parameter, the corresponding CFIs for DI and SPADE are given by

$$F_\text{I}(s;P,T) = \int_{-\infty}^\infty\frac{1}{p_\text{I}(x,s;P,T)} \left[ \frac{\partial}{\partial s} p_\text{I}(x,s;P,T) \right]^2 \, \textrm{d}{x} ,$$
and
$$F_\text{II}(s;P,T)= \sum_{q = 0}^\infty \frac{1}{p_\text{II}(q,s;P,T)} \left[ \frac{\partial}{\partial s} p_\text{II}(q,s;P,T) \right]^2,$$
respectively. Although a full background on SPADE has been presented thus far, it is sufficient to consider a simplified version of SPADE known as binary SPADE (BSPADE) when one is interested in the sub-Rayleigh regime. In BSPADE, a photon arriving from the object plane is measured either in the $q = 0$ mode or in the combined $q>0$ mode. Intuition for this simplification comes from the fact that, as the object shrinks, most of the photons arriving at the image plane will project into the lowest order modes and the higher order modes contains progressively less information. In this case, the summation in Eq. (15) simplifies and can can be explicitly expressed as
$$F_{\text{II}}(s;P,T) \approx \frac{1}{p_\text{II}(0,s;P,T) - p_\text{II}^2(0,s;P,T)} \left[ \frac{\partial}{\partial s} p_\text{II}(0,s;P,T) \right]^2.$$

The quantities $F_\text {I}$ and $F_\text {II}$ are valuable in that, via estimation theory, their reciprocals provide lower bounds for the variance in the estimation of $s$ when the corresponding measurement scheme is used. In other words, if $\check {s}_\text {I}$ and $\check {s}_{\text {II}}$ are estimators for $s$ constructed from data acquired via DI or BSPADE measurements, respectively, then

$$\text{Var}(\hat{s}_\text{I}) \ge F_\text{I}^{{-}1} \quad \text{and} \quad \text{Var}(\hat{s}_\text{II}) \ge F_\text{II}^{{-}1}.$$

The CFI for DI and BSPADE measurements are compared for various values of $T$ and $P$ in Section 3. There, it is shown that non-zero values of these aberration strength parameters can lead to larger values of $F_\text {I}$ and $F_\text {II}$.

2.4 Quantum FI

In addition to the CFI, which depends on the choice of measurement scheme, it is informative to consider the QFI, $Q_s$, which provides an upperbound to all possible CFI once the transformation between object and image planes is stipulated. In other words,

$$Q_s \ge F_\text{I} \quad \text{and} \quad Q_s \ge F_\text{II}.$$

As is standard in deriving the QFI, one first finds the symmetric logarithm derivative (SLD), $L_s$, associated with the separation parameter $s$. The SLD is defined implicitly through

$$\frac{\partial \rho(s;P,T)}{\partial s} = \frac{\rho(s;P,T) L_s + L_s \rho(s;P,T)}{2},$$
where $\rho$ is the image-plane density matrix given by Eq. (8); one should note that this prescription of $\rho$ corresponds to the image-plane normalization scheme detailed in [14]. With $L_s$ in hand, the QFI for $s$ is immediately obtained via
$$Q_s(s;P,T) = \text{Tr} \left[ \frac{\partial \rho(s;P,T)}{\partial s} L_s \right].$$

Details pertaining to the derivation of the QFI for imaging systems with off-axis aberrations are found in Supplement 1. It should be emphasized that such calculations for the separation QFI for imaging systems in which the PSF is shift-variant require care and themselves constitute a novel result from which interesting discussions may arise. However, to keep the focus on well-known measurement schemes like DI and BSPADE, the QFI will primarily serve as a confirmation of CFI results through Eq. (18).

3. Results and discussion

3.1 Results

Although the framework developed in Section 2 is general, we now specialize to the case of two equally bright incoherent point sources. This choice corresponds to the selection of $A = 1$ and $\Gamma = 0$ in Eqs. (8), (11), and (13). We are interested in calculating the CFI for DI and BSPADE for various values of $P$ and $T$ and comparing them to the case where the imaging system is aberration-free (and therefore shift-invariant), i.e., for $P = 0$ and $T = 0$. The values of $P$ and $T$ used in the subsequent analysis were not chosen for any particular reason beside to provide a sufficient illustration of the effects of OAT and Petzval curvature on the CFI and QFI. However, we note that such values correspond to aberrated wavefronts at the pupil plane whose maximum deviations from the nominal wavefront are fractions of a wavelength.

In the following, the CFI for DI, given by Eq. (14), is calculated numerically. For BSPADE, it can be shown using Eqs. (5), (12), and (13) that

$$p_\text{II}(0,s;P,T) = \frac{1}{\sqrt{1 + P^2 \pi^2 (s/2)^4}}\exp \left\{ - \frac{(s/2)^2 (1 + 2\pi T \sigma)^2}{4 \sigma^2 [1 + P^2 \pi^2 (s/2)^4]} \right\},$$
which may be inserted into Eq. (16) to obtain $F_\text {II}$. Comparisons of $F_\text {I}$ and $F_\text {II}$ for various values of $T$ and $P$ are displayed in Fig. 3. When considering the $P = 0$ case (the imaging system only has OAT and no Petzval curvature), Fig. 3(a) shows that increasing values of $T$ leads to greater $F_\text {I}$ for all values of $s/\sigma$: this global improvement in $F_\text {I}$ for non-zero values of $T$ is one of the main results of this analysis. For BSPADE, Fig. 3(b) shows that a similar improvement in $F_\text {II}$ occurs when $T \neq 0$ in the sub-Rayleigh regime. Additionally, notice that $F_\text {II}$ saturates the QFI at exactly $s = 0$ for all values of $T$, which indicates that BSPADE continues to be an optimal measurement for $s$ even for shift-variant imagine systems. However, we re-emphasize that the main purpose of this analysis to to show that introducing off-axis aberrations in a system can vastly improve CFI even if one only considers a DI measurement scheme. The introduction of OAT with strength $T = 0.4 \sigma ^{-1}$ leads to high information (in comparison with the QFI value of $0.25$ in the aberration-free case) well into the sub-Rayleigh regime ($s < 2\sigma$). That is, even though $F_\text {I} = 0$ at exactly $s = 0$ always, one in principle can obtain any $F_\text {I}$ for non-zero $s/\sigma$ by increasing the strength of OAT even if $s/\sigma$ is small.

 figure: Fig. 3.

Fig. 3. CFI (solid curves) associated with the separation $s$ for (a,c) DI and (b,d) BSPADE are shown for the case of (a,b) $P = 0$, varying $T$ and (c,d) $T = 0$, varying $P$. The QFI (dashed curves), $Q_s$, is also plotted for the corresponding values of $P$ and $T$. The black curves in each plot correspond to the aberration-free (shift-invariant) imaging system.

Download Full Size | PDF

Figures 3(c) and (d) consider the case where the imaging system contains Petzval curvature, but no OAT. It can be seen, in a behavior similar to that seen in Figs. 3(a) and (b) that larger values of $P$ lead to larger values of $F_\text {I}$ and $F_\text {II}$ in the sub-Rayleigh regime, with the latter saturating the QFI near $s = 0$. Therefore, like OAT, Petzval curvature can improve the performance of an imaging system regarding two-point separation estimation. However, there are some notable differences between the behaviors of the CFI when the system contains OAT and when it contains Petzval curvature. First, increasing $P$ does not affect the value of the $F_\text {II}$ at exactly $s = 0$ ($F_\text {I} = 0$ regardless of $P$ and $T$). In other words, all the curves seen in Fig. 3(d) coincide in the limit of vanishing separation. Second, there is an interval of $s$ within the sub-Rayleigh regime in which DI outperforms BSPADE. For example, comparing the $P = 0.4 \sigma ^{-2}$ case for $F_\text {I}$ and $F_\text {II}$ in Figs. 3(c) and (d), repsectively, it can be seen that the former reaches a peak of roughly $1.5$ compared to the latter’s $0.75$. Finally, we note that Petzval curvature may lead to the formation of local minima in $F_\text {I}$ and $F_\text {II}$, which indicate that the corresponding measurement schemes perform optimally neither at exactly $s = 0$ nor at $s \rightarrow \infty$, but rather at some intermediate value that, according to Figs. 3(c) and (d), occur in the sub-Rayleigh regime.

For completeness, CFI and QFI curves for the case where both OAT and Petzval curvature are present are shown in Fig. 4 for a fixed value of $T = 0.2 \sigma ^{-1}$ and varying values of $P$. As was observed from Fig. 3, larger values of $T$ raise the maximal attainable information while various values of $P$ lead to CFI curves that peak locally in the sub-Rayleigh regime [although such peaks for the BSPADE CFIs in Fig. 4(b) are not apparent for smaller values of $P$ since the nonzero value of $T$ raises the CFIs at $s = 0$].

 figure: Fig. 4.

Fig. 4. CFI (solid curves) associated with the separation $s$ for (a) DI and (b) BSPADE are shown for the case of $T = 0.2 \sigma ^{-1}$, varying $P$. The QFI (dashed curves), $Q_s$, is also plotted for the corresponding values of $P$ and $T$.

Download Full Size | PDF

It is prudent to point out the behavior of the QFI curves in Figs. 3 and 4 as well. When only OAT is present, the QFI remains constant over $s$, with its value given by

$$Q_s(s;0,T) = F_\text{II}(0;0,T) = \frac{(1 + 2\pi T \sigma)^2}{4\sigma^2},$$
which indicates that $Q_s(s;0,T)$ grows quadratically as a function of $T$. On the other hand, when Petzval curvature is present, the QFI increases from the value given by Eq. (22) at $s = 0$ to larger values as $s$ increases. This peculiar behavior, which is had not been seen for incoherent quantum-inspired superresolution studies of shift-invariant systems, is a result of the presence of the second term in the phase, $\Phi$, of the PSF given by Eq. (7). This quadratic (in $x$) phase contribution is due to the presence of Petzval curvature in $\Delta W$, as stipulated in Eq. (3); Supplement 1 provides more insight regarding the QFI’s behavior.

Maximum likelihood estimation (MLE) simulations for the separation $s$ were performed for DI to support the findings from Eq. (14) and Fig. 3(a) and (c). For the $i$-th (out of $M$) iteration of the simulation, the number of photons arriving at the image plane, $N_i$, was chosen using Poisson statistics around an average photon number of $N$. That is, the probability mass function for $N_i$ is given by

$$p(N_i) = \frac{N^{N_i} \exp({-}N)}{N_i!}.$$

Once $N_i$ is chosen, an equivalent number of photon positions in the image plane is chosen using Eq. (11) as the probability density function for $N_i$ independent events. Given these $N_i$ photon locations, the unbiased MLE estimator, denoted as $\hat {s}$, is used to find an estimate for the separation $s$. The variance of $\hat {s}$, whose average value is $s$, is then calculated over the ensemble of $M$ iterations. In order to compare the simulation results to $F_\text {I}$, one must divide the reciprocal of the aforementioned variance by $N$ to compute the per-photon CFI. This process is repeated for different values of $s, T,$ and $P$ and the results of these simulations are shown in Fig. 5 over the sub-Rayleigh regime $(s<2\sigma )$ for $N = 2000$ photons and $M = 500$ iterations. As is done in Figs. 3(a) and (c), the cases of $P = 0$ and $T = 0$ were analyzed, respectively. It is evident that there is good agreement between the simulation results and $H_\text {I}$.

 figure: Fig. 5.

Fig. 5. Comparisons of $F_\text {I}$ (solid curves) and $\text {Var}^{-1}(\hat {s}-s)/N$ from MLE simulations (discrete points) for the cases of (a) $P = 0$, varying $T$ and (b) $T = 0$, varying $P$. A mean photon number of $N = 2000$ was used for each iteration (each value of $s$) and the variance was calculated over $M = 500$ iterations.

Download Full Size | PDF

3.2 Discussion

The primary result of this work is that it is possible to improve upon the two-point resolution of an aberration-free imaging system by intentionally introducing known off-axis aberrations. Particularly, this is true even for DI measurement schemes as seen in Figs. 3 and 4. In fact, whether the improvement is global (over all $s$, as it is when OAT is present) or only over the sub-Rayleigh regime (when only Petzval curvature is present), the improvement can exceed the improvements offer by modal imaging techniques like BSPADE for aberration-free systems. In other words, if a sufficient amount of off-axis aberration can be provided, DI imaging schemes can lead to large $F_\text {I}$ even in the sub-Rayleigh regime. Since practical applications of two-point imaging deal with non-zero values of $s/\sigma$, increasing $T$ can effectively allow DI measurement schemes to attain the same information allowed by BSPADE in shift-invariant systems. However, this is not to say that BSPADE (and other modal imaging schemes) are fruitless; it is clear from Figs. 3 and 4 that BSPADE benefits in a similar fashion to DI when off-axis aberrations are introduced. Importantly, at least for OAT and Petzval curvature, BSPADE still (as it did for the aberration-free case) yield a $F_\text {II}$ that saturates the QFI in the sub-Rayleigh regime.

The findings in this work are somewhat counter-intuitive as many traditional imaging systems start with substantial aberrations and a significant amount of effort is often needed to minimize such errors towards the goal of an aberration-free (diffraction-limited) system. However, at least in the context of resolving two incoherent and equally bright point sources where the separation is the only unknown parameter, it turns out that the presence of off-axis aberrations can actually improve performance. It may be of interest then, given the number of well-corrected imaging systems in existence, to understand how one may intentionally introduce OAT and Petzval curvature back into these imaging systems in order to take advantage of the larger DI or BSPADE CFI. Of course, it is possible to purposely (re)introduce aberrations by misaligning or adding/subtracting optical elements in the system. However, we should mention that despite their simple geometrical optics interpretations, OAT and Petzval curvature cannot be properly introduced by simply tilting or curving the image plane, respectively. Although these geometrical manipulations of the image surface create effective PSFs that are SV, they do not have the necessary dependence on $\xi$, the object plane coordinate, in order to bring about the results from a genuine $\Delta W$ given by Eq. (3).

4. Concluding remarks

The analysis presented in this work inform on the CFI (for DI and BSPADE) and QFI regarding the separation estimation between two equally bright incoherent point sources when the imaging system includes off-axis aberrations. The resulting shift-variant nature of the system’s field PSF gives rise to CFI and QFI that are novel and, importantly, greater than their counterparts in the aberration-free case. Specifically, OAT and Petzval curvature, which constitute two of the lowest order off-axis Seidel aberrations, were shown to provide improvements to the CFI for both DI and BSPADE measurement schemes. A summary of their effects is provided as follows:

  • • OAT [$\Delta W(\xi,u)$ is linear in both object ($\xi$) and pupil ($u$) locations] induces a magnification on the image field. In other words, two object points with separation $s$ are mapped to two PSFs with (larger) separation $(1+2\pi T \sigma )s$. This magnified separation provides the intuition for globally a larger CFI and QFI compared to the aberration-free case.
  • • Petzval curvature [$\Delta W(\xi,u)$ is quadratic in both object ($\xi$) and pupil ($u$) locations] induces an image field where the width of the PSF varies with object location. As two object points with separation $s$ approach each other $(s\rightarrow 0)$, the width of the two PSFs approaches the aberration-free case ($g \rightarrow \sigma$). This object-dependent width increases the sensitivity of the image field, which in turn leads to a larger CFI and QFI despite the fact that $g > \sigma$ for nonzero separation $s$.
QFI results were also developed and compared with the CFI. Once again, the details of the QFI derivation are shown in Supplement 1.

To more fully appreciate the results of this work, it is valuable to contextualize them with the majority of the research done so far regarding quantum-inspired superresolution. The primary message in recent history is that novel modal imaging schemes, like BSPADE, can provide an advantage in resolution compared to traditional DI measurements. Such claims have been extended, and supported through theory and experiments, to more complicated object distributions (discrete or continuous). Furthermore, additional works have sought to optimize modal imaging. Examples include the development of practical adaptive imaging schemes that leverage advantages in both DI and BSPADE as well as a theoretical analysis on the effect of photon statistics [9,19]. However, our present work demonstrates that the presence of off-axis aberrations, whose study have been largely ignored in the context of quantum-inspired superresolution, provides an improvement to both CFI and QFI that is relatively intuitive and whose mathematical treatment is straightforward.

The findings here are reminiscent of computational imaging techniques in which an imaging system is intentionally altered from the traditional aberration-free DI scheme in order to benefit from the known adjustments. We show that the introduction of OAT and Petzval curvature into imaging systems can improve the CFI in two-point separation estimation and therefore provide a link between the recent field of quantum-inspired superresolution with aspects of computational imaging. However, although the derivation presented here and in Supplement 1 for the CFI and QFI encompasses more than the case where the object scene consists of two equally bright incoherent point sources, realistic objects are much more complicated and require more care in their treatment. In addition to this concern, which is relevant in all of quantum-inspired superresolution analyses, realistic imaging systems have constraints regarding the strength of off-axis aberrations like OAT and Petzval curvature due to manufacturing and tolerancing limitations. Further studies are required to determine the benefits of introducing off-axis imaging systems to resolve general, realistic object scenes. Additional related topics of further exploration include the effects of off-axis aberrations in axial resolution and subsequent experimental verification.

Acknowledgements

The author thanks S. A. Wadood, A. N. Vamivakas, and M. A. Alonso for useful discussions.

Disclosures

The author declares no conflicts of interest.

Data availability

No data were generated or analyzed in the presented research.

Supplemental document

See Supplement 1 for supporting content.

References

1. J. W. Goodman, Fourier Optics (Roberts and Company, 2005), pp. 128–158.

2. C. W. Helstrom, Quantum Detection and Estimation Theory (Academic, 1976), pp. 25–266.

3. M. Tsang, R. Nair, and X.-M. Lu, “Quantum theory of superresolution for two incoherent optical point sources,” Phys. Rev. X 6(3), 031033 (2016). [CrossRef]  

4. M. Tsang, “Subdiffraction incoherent optical imaging via spatial-mode demultiplexing: Semiclassical treatment,” Phys. Rev. A 97(2), 023830 (2018). [CrossRef]  

5. M. Tsang, “Quantum limit to subdiffraction incoherent optical imaging,” Phys. Rev. A 99(1), 012305 (2019). [CrossRef]  

6. M. Tsang and R. Nair, “Resurgence of rayleigh’s curse in the presence of partial coherence: comment,” Optica 6(4), 400–401 (2019). [CrossRef]  

7. S. Kurdzialek, “Back to sources – the role of losses and coherence in super-resolution imaging revisited,” Quantum 6, 697 (2022). [CrossRef]  

8. V. Ansari, B. Brecht, J. Gil-Lopez, J. M. Donohue, J. Řeháček, Z. C. V. Hradil, L. L. Sánchez-Soto, and C. Silberhorn, “Achieving the ultimate quantum timing resolution,” PRX Quantum 2(1), 010301 (2021). [CrossRef]  

9. M. Paúr, B. Stoklasa, Z. Hradil, L. L. Sánchez-Soto, and J. Rehacek, “Achieving the ultimate optical resolution,” Optica 3(10), 1144–1147 (2016). [CrossRef]  

10. S. Zhou and L. Jiang, “Modern description of rayleigh’s criterion,” Phys. Rev. A 99(1), 013808 (2019). [CrossRef]  

11. W. Larson and B. E. A. Saleh, “Resurgence of rayleigh’s curse in the presence of partial coherence,” Optica 5(11), 1382–1389 (2018). [CrossRef]  

12. K. Liang, S. A. Wadood, and A. N. Vamivakas, “Coherence effects on estimating two-point separation,” Optica 8(2), 243–248 (2021). [CrossRef]  

13. K. Liang, S. A. Wadood, and A. N. Vamivakas, “Coherence effects on estimating general sub-rayleigh object distribution moments,” Phys. Rev. A 104(2), 022220 (2021). [CrossRef]  

14. K. Liang, S. A. Wadood, and A. N. Vamivakas, “Quantum fisher information for estimating N partially coherent point sources,” Opt. Express 31(2), 2726–2743 (2023). [CrossRef]  

15. E. Bisketzi, D. Branford, and A. Datta, “Quantum limits of localisation microscopy,” New J. Phys. 21(12), 123032 (2019). [CrossRef]  

16. Z. Hradil, J. Řeháček, L. Sánchez-Soto, and B.-G. Englert, “Quantum fisher information with coherence,” Optica 6(11), 1437–1440 (2019). [CrossRef]  

17. S. A. Wadood, K. Liang, Y. Zhou, J. Yang, M. A. Alonso, X.-F. Qian, T. Malhotra, S. M. H. Rafsanjani, A. N. Jordan, R. W. Boyd, and A. N. Vamivakas, “Experimental demonstration of superresolution of partially coherent light sources using parity sorting,” Opt. Express 29(14), 22034–22043 (2021). [CrossRef]  

18. J. C. Wyant and K. Creath, Basic Wavefront Aberration Theory for Optical Metrology (Academic, 1992), p. 15.

19. M. R. Grace, Z. Dutton, A. Ashok, and S. Guha, “Approaching quantum-limited imaging resolution without prior knowledge of the object location,” J. Opt. Soc. Am. A 37(8), 1288–1299 (2020). [CrossRef]  

Supplementary Material (1)

NameDescription
Supplement 1       Supplement 1 provides additional derivation details for the framework used in the main text.

Data availability

No data were generated or analyzed in the presented research.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (5)

Fig. 1.
Fig. 1. A schematic of a $4f$ imaging system with $\xi,u,$ and $x$ as the spatial coordinates for the object, pupil, and image planes, respectively. An illustration of two point sources separated by $s$ is shown along with their image distributions in the presence of OAT and Petzval curvature of strengths $T$ and $P$, respectively, which are defined in Eq. (4). The individual PSFs have a half-width of $g$ (which depends on $P$), defined in Eq. (6), and their separation is magnified by a factor of $1 + 2\pi T \sigma$. The pupil is modeled as a Gaussian with half-width $\sigma _p$.
Fig. 2.
Fig. 2. Density plots of $p_\text {I}(x,s;P,T)$ over $x/\sigma$ and $s/\sigma$ for various values of $T\sigma$ and $P\sigma ^2$ in (a) - (d). In each, the solid red lines show the location of the two point sources. The dashed blue lines in (b) and (d) show the centers of intensity distribution from each point source for non-zero OAT. The dashed yellow lines in (c) and (d) show the outer envelope of $p_\text {I}$ for non-zero Petzval curvature.
Fig. 3.
Fig. 3. CFI (solid curves) associated with the separation $s$ for (a,c) DI and (b,d) BSPADE are shown for the case of (a,b) $P = 0$, varying $T$ and (c,d) $T = 0$, varying $P$. The QFI (dashed curves), $Q_s$, is also plotted for the corresponding values of $P$ and $T$. The black curves in each plot correspond to the aberration-free (shift-invariant) imaging system.
Fig. 4.
Fig. 4. CFI (solid curves) associated with the separation $s$ for (a) DI and (b) BSPADE are shown for the case of $T = 0.2 \sigma ^{-1}$, varying $P$. The QFI (dashed curves), $Q_s$, is also plotted for the corresponding values of $P$ and $T$.
Fig. 5.
Fig. 5. Comparisons of $F_\text {I}$ (solid curves) and $\text {Var}^{-1}(\hat {s}-s)/N$ from MLE simulations (discrete points) for the cases of (a) $P = 0$, varying $T$ and (b) $T = 0$, varying $P$. A mean photon number of $N = 2000$ was used for each iteration (each value of $s$) and the variance was calculated over $M = 500$ iterations.

Equations (23)

Equations on this page are rendered with MathJax. Learn more.

U i ( x ) = U o ( ξ ) ψ ( x , ξ ) d ξ ,
ψ ( x , ξ ) = U 0 exp ( i k ξ u f ) exp ( u 2 4 σ p 2 ) exp [ i k Δ W ( u , ξ ) ] exp ( i k u x f ) d u ,
Δ W ( u , ξ ) = W 111 ( ξ 2 σ ) ( u 2 σ p ) + W 220 ( ξ 2 σ ) 2 ( u 2 σ p ) 2 ,
T W 111 λ ( 2 σ ) and P W 220 λ ( 2 σ ) 2 ,
ψ ( x , ξ ; P , T ) = 1 [ 2 π g 2 ( ξ ; P ) ] 1 / 4 exp { [ x ξ ( 1 + 2 π T σ ) ] 2 4 g 2 ( ξ , P ) + i Φ ( x , ξ ; P , T ) } ,
g ( ξ ; P ) σ 1 + 4 π 2 P 2 ξ 4 ,
Φ ( x , ξ ; P , T ) 1 2 { tan 1 ( 2 π P ξ 2 ) + π P ξ 2 [ x ξ ( 1 + 2 π T σ ) ] 2 g 2 ( ξ ; P ) } ,
ρ ( s ; P , T ) = | ψ + ψ + | + A | ψ ψ | + Γ | ψ + ψ | + Γ | ψ ψ + | 1 + A + 2 Re [ Γ d ( s ; P , T ) ] ,
| ψ ± ψ ± ( x ; P , T ) | x d x ψ ( x , ± s 2 ; P , T ) | x d x .
d ( s ; P , T ) ψ | ψ + = ψ ( x , s 2 ; P , T ) ψ ( x , s 2 ; P , T ) d x
p I ( x , s ; P , T ) = x | ρ ( s ; P , T ) | x = | ψ + ( x ; P , T ) | 2 + | ψ ( x ; P , T ) | 2 + 2 Re [ Γ ψ + ( x ; P , T ) ψ ( x ; P , T ) ] 1 + A + 2 Re [ Γ d ( s ; P , T ) ] .
ϕ q ( x ) = 1 ( 2 π σ 2 ) 1 / 4 1 2 q q ! H q ( x 2 σ ) exp ( x 2 4 σ 2 ) ,
p II ( q , s ; P , T ) = ϕ q | ρ ( s ; P , T ) | ϕ q = | ϕ q | ψ + | 2 + | ϕ q | ψ | 2 + 2 Re ( Γ ϕ q | ψ + ψ | ϕ q ) 1 + A + 2 Re [ Γ d ( s ; P , T ) ] .
F I ( s ; P , T ) = 1 p I ( x , s ; P , T ) [ s p I ( x , s ; P , T ) ] 2 d x ,
F II ( s ; P , T ) = q = 0 1 p II ( q , s ; P , T ) [ s p II ( q , s ; P , T ) ] 2 ,
F II ( s ; P , T ) 1 p II ( 0 , s ; P , T ) p II 2 ( 0 , s ; P , T ) [ s p II ( 0 , s ; P , T ) ] 2 .
Var ( s ^ I ) F I 1 and Var ( s ^ II ) F II 1 .
Q s F I and Q s F II .
ρ ( s ; P , T ) s = ρ ( s ; P , T ) L s + L s ρ ( s ; P , T ) 2 ,
Q s ( s ; P , T ) = Tr [ ρ ( s ; P , T ) s L s ] .
p II ( 0 , s ; P , T ) = 1 1 + P 2 π 2 ( s / 2 ) 4 exp { ( s / 2 ) 2 ( 1 + 2 π T σ ) 2 4 σ 2 [ 1 + P 2 π 2 ( s / 2 ) 4 ] } ,
Q s ( s ; 0 , T ) = F II ( 0 ; 0 , T ) = ( 1 + 2 π T σ ) 2 4 σ 2 ,
p ( N i ) = N N i exp ( N ) N i ! .
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.