Orthonormal vector polynomials in a unit circle, Part I: basis set derived from gradients of Zernike polynomials

Chunyu Zhao; James H. Burge

doi:10.1364/OE.15.018014

1. Introduction

Zernike polynomials [1–3] are commonly used in optical testing, engineering, and analysis. There are two reasons for this. First of all, Zernike polynomials are orthogonal in a unit circle, which is convenient since many optics are circular in shape. Secondly, the lower order members of Zernike polynomials represent typical optical wavefront aberrations such as power, astigmatism, coma and spherical aberration. Besides direct wavefront measurements, wavefront slopes are often measured as well, e.g. with shearing interferometry [4], Shack- Hartmann sensors [5], or a scanning pentaprism test [6]. Various techniques have been developed to convert measured slope data to a wavefront map expressed in terms of Zernike polynomials. Garvrelides [7] developed a set of vector polynomials that are orthogonal to the gradients of Zernike polynomials but not mutually orthogonal. The coefficient for a specific Zernike polynomial representing the wavefront can then be directly calculated from integration of the dot product of the slope and the corresponding vector polynomial. Acosta [8] et al, took a different approach but arrived at similar results. This approach skips the intermediate step of fitting the vector slope data and obtains the wavefront directly. Yet, it is desirable to fit measurement data in the measurement space. In this case, a set of vector polynomials is needed to fit the vector slope data.

Vector polynomials are also used for quantifying mapping distortion, which is important for accurate measurement of optical surfaces [9] and can be severe due to the use of null optics. Typically, polynomial mapping functions are defined and the coefficients are fit to data using least squares techniques. [10, 11]

Although the above problems can be solved using a least squares fit to vector functions that are not orthogonal over the domain, the results are not optimal. The fit to a non-orthogonal basis set can require many more terms than are necessary, and the coefficients themselves may not be meaningful, because the value for any particular coefficients will change as higher order terms are fit. When fitting to real data, the propagation of noise is increased with the use of non-orthogonal basis functions. If the functions are truly orthogonal, the least squares solution is not necessary, coefficients can be determined by a much simpler and computationally efficient inner product. Clearly, an orthonormal basis is desired.

In this paper, we present such a desired set of vector polynomials which are orthonormal in a unit circle. These polynomials are perfect for fitting slope data. Since they are gradients of linear combinations of Zernike polynomials, it is also straightforward to convert the fitted slope map to the wavefront map expressed in terms of Zernike polynomials.

In Section 2, we present the Zernike notations that we adopted from Noll’s landmark paper¹ and list the gradients of the Zernikes following the recursion relationships presented there. We then use the Zernike gradients as a basis to obtain an orthonormal set of vector polynomials using the Gram-Schmidt method and present the result in Section 3. The mapping from the orthonormal vector polynomials to gradients of scalar functions represented by standard Zernike polynomials is discussed in Section 4.

The vector set is made complete with the addition of a complementary set of vector polynomials with non-zero curl, as presented in a subsequent paper. [12] The addition of this second set of functions provides a complete basis, capable of representing any vector distribution in the circular domain. Applications of the vector polynomials for fitting the slope data taken from Shack-Hartmann sensors or other slope measurement devices, and in correcting mapping distortions for null tests of aspheric surfaces will be presented in subsequent papers as well.

2. Zernike polynomials and their gradients

There are different numbering schemes for Zernike polynomials. In this paper we adopt Noll’s notation and numbering scheme which defines the polynomials in polar coordinates as

\begin{matrix} Z_{even j} = \sqrt{n + 1} R_{n}^{m} (r) \sqrt{2} \cos (m θ) \\ Z_{odd j} = \sqrt{n + 1} R_{n}^{m} (r) \sqrt{2} \sin (m θ) \end{matrix}} m \neq 0

where

R_{n}^{m} (r) = \sum_{s = 0}^{(n - m) ⁄ 2} \frac{{(- 1)}^{s} (n - s)!}{s! [(n + m) ⁄ 2 - s]! [(n - m) ⁄ 2 - s]!} r^{n - 2 s}

j: the general index of Zernike polynomials

n: the power of the radial coordinate r

m: the multiplication factor of the angular coordinate θ

n and m have the following relations: m≤n and (n - m) is even

The general index j has no physical meaning, while the indices n and m do. For a given j, there is a unique corresponding pair of (n, m), and the parity of j determines the angle dependence of the polynomial. While for a given pair of (n, m), j is ambiguous when m≠0. In some relationships given in the subsequent text, n and m are usually known, but the corresponding j (therefore the sine or cosine angle dependence of the polynomial) depends on other factors. For this reason, we choose to use j(n, m) for the general index of a Zernike polynomial to show that n and m are known and the actual j will be determined by other conditions. The first 37 polynomials of this numbering scheme are listed in the Appendix, where the aforementioned relationship between j and (n, m) can be seen as well.

As the first step toward establishing an orthonormal basis of vector polynomials, we derive the gradients of the Zernike polynomials. We take the gradient of each Zernike polynomial and apply the recursion relationships from Noll to represent the gradients as linear combinations of lower order Zernike polynomials. The first 37 gradient terms are presented in Table 1. These functions provide a complete basis to represent gradients, but they require further manipulation to create an orthonormal set.

Table 1. Gradient of Zernike polynomials

View Table | View all tables in this article

3. An orthonormal set of vector polynomials

We use linear combinations of the above terms to create an orthogonal set. We define the inner product of two vector polynomials defined in a unit circle as

(\vec{A}, \vec{B}) = \frac{1}{π} \iint (\vec{A} \cdot \vec{B}) dx dy

where the integration is over a unit circle.

The inner product is taken of the above gradient functions, and some results are shown in Table 2 (the table is symmetric about the diagonal, but only non-zero elements under the diagonal are shown). These Zernike gradient polynomials are not orthogonal, as the matrix of inner products listed in Table 2 is not diagonal.

Table 2. List of the inner products of the first 13 Zernike gradients

View Table | View all tables in this article

3.1 Orthogonalization of gradient functions

Using the Gram-Schmidt orthogonalization method [13, 14] (general description for the method can be found in Reference 13, and an optical application can be found in Reference 14), we construct a new set of vector polynomials with Zernike gradient polynomials as basis. The gradient of Z₁ is zero, therefore it is not used in the construction of the new set. We choose to index the first polynomial of this new set as 2 to maintain its correspondence with Zernike polynomials. The first 36 such polynomials are listed in Table 3.

Table 3. List of first 36 orthonormal vector polynomials S⃗_i as functions of Zernike gradients.

View Table | View all tables in this article

In general, the S polynomials can be simply expressed in terms of Zernike gradient polynomials:

For all j with n=m,

{\vec{S}}_{j} = \frac{1}{\sqrt{2 n (n + 1)}} \nabla Z_{j} .

For all j with n≠m,

{\vec{S}}_{j} = \frac{1}{\sqrt{4 n (n + 1)}} (\nabla Z_{j} - \sqrt{\frac{n + 1}{n - 1}} \nabla Z_{j' (n' = n - 2, m' = m)})

where j-j′ is even when m≠0.

3.3 S as linear combinations of Zernike polynomials

Given that the vector polynomials S are functions of Zernike gradient polynomials and the Zernike gradient polynomials are functions of Zernike polynomials, we can express S in terms of Zernike polynomials as listed in Table 4.

Table 4. List of S polynomials expressed as linear combinations of Zernike polynomials.

View Table | View all tables in this article

For a given S _j with corresponding indices j(n, m), we define its x and y components as S _jx and S _jy, respectively, i.e.

{\vec{S}}_{j} = \hat{i} S_{jx} + \hat{j} S_{jy}

From observation of the first 37 S polynomials, we found that both S _jx and S _jy are linear combinations of at most two Zernike polynomials with corresponding indices j ^′(n-1,m±1) which may or may not exist binding by the rules n≥m≥0:

S_{jx} = C_{a} Z_{ja (n - 1, m - 1)} + C_{a'} Z_{ja' (n - 1, m + 1)},

For a given j(n, m), a set of rules can be used to determine all the parameters in Eq. (7) to express S _j as linear combinations of Zernike polynomials. These rules are summarized in Table 5. These rules are useful for obtaining analytical expression of any S polynomial by programming. They are complex since we have to deal with different cases of j, n and m combinations. The complexity mostly comes from the numbering scheme. In Noll’s numbering scheme, even j correspond to cosine angle terms and odd j to sine angle terms and these terms swap order each time after an m=0 term. The rules will be simpler if we just use the sine/cosine dependence of the terms. Basically, if an S polynomial has the same j index of a Zernike polynomial, its x-component is the linear combination of the Zernikes with same sine or cosine angle dependence, and the y-component has the opposite angle dependence. For example, for S ₃₂, the corresponding Z ₃₂ has cosine angle dependence, so the x-component of S ₃₂ has Z₂₄ and Z₂₆ terms which both have cosine angle dependence, while the y-component of S ₃₂ has Z₂₃ and Z₂₅ terms which both have sine angle dependence.

Table 5. The rules for writing S in terms of linear combinations of Zernikes.

View Table | View all tables in this article

3.4 Plots of vector polynomial functions

The plots of first 12 S vector polynomials are shown in Table 6.

Table 6. Plots of first 12 S polynomials in a unit circle.

View Table | View all tables in this article

4. Relating the vector polynomials to gradients of scalar functions

The set of S polynomials fully spans the space of vector distributions V⃗(x, y) over the unit circle where a scalar function Φ(x,y) exists such that V⃗(x,y)=∇Φ(x,y). It is useful to represent the vector data using the vector polynomials S and relate to a scalar functions ϕ that are defined as S⃗_i=∇ϕ _i.

Applying the rules listed in (4) and (5), the scalar functions can be calculated as

For all j with n=m,

ϕ_{j} = \frac{1}{\sqrt{2 n (n + 1)}} Z_{j} .

For all j with n≠m,

ϕ_{j} = \frac{1}{\sqrt{4 n (n + 1)}} (Z_{j} - \sqrt{\frac{n + 1}{n - 1}} Z_{j' (n' = n - 2, m' = m)}),

where j-j ^′ is even when m≠0.

These relationships match those demonstrated for the vector functions listed in Table 3. For example, ${\vec{S}}_{7} = \frac{1}{\sqrt{48}} (\nabla Z_{7} - \sqrt{2} \nabla Z_{3})$ , which leads $ϕ_{7} = \frac{1}{\sqrt{48}} (Z_{7} - \sqrt{2} Z_{3})$ .

Applying these relations, the vector data V⃗(x,y) is decomposed into a linear combination of the orthonormal S polynomials as

\vec{V} = \sum α_{i} {\vec{S}}_{i} .

Using the definitions of the scalar functions Φ and ϕ _i(V⃗=∇Φ, S⃗_i=∇ϕ _i), we have

Φ = \sum α_{i} ϕ_{i},

where the coefficients α _i were found from the vector decomposition in Eq. (10). Then the scalar function Φ can in turn be represented as linear combinations of standard Zernike polynomials:

Φ = \sum α_{i} ϕ_{i} = \sum γ_{i} Z_{i}

The coefficients of these standard Zernike polynomials can be found by

γ_{j} = \frac{α_{j (n, m)}}{\sqrt{2 n (n + 1)}} n = m

where j-j ^′ is even when m≠0.

This procedure is useful for applications such as processing data from a Shack Hartmann sensor. The centroid data, which is proportional to wavefront slopes, can be fit to the vector S polynomials to give a set of coefficients α _i. These are converted directly to a standard Zernike polynomial representation of the wavefront, with coefficients γ _i.

A reverse problem is: given a scalar function Φ and its Zernike decomposition coefficients γ _i, we can find α _i from Eq. (13). When Φ is a wavefront, the rms spot radius is $r = 2 f \sqrt{(\sum α_{i}^{2})}$ , where f is the system F number.

5. Summary

We derived an orthonormal set of vector polynomials in a unit circle. It has many potential applications, one of which is fitting slope data in optical testing. These polynomials are linear combinations of at most two Zernike polynomial’s gradients. They can be expressed as linear combinations of at most four scalar Zernike polynomials as well. After wavefront slope data, e.g. data taken with a Shack-Hartmann sensor, is fit with the vector polynomials, it is straightforward to convert the fitted slope map to the wavefront map expressed in terms of Zernike polynomials.

Appendix:

The first 37 Zernike polynomials according to Noll’s numbering:

View Table | View all tables in this article

References and links

1. R. J. Noll, “Zernike polynomials and atmospheric turbulence,” J. Opt. Soc. Am. 66, 207–211 (1976). [CrossRef]

2. M. Born and E. Wolf, Principles of Optics, (Pergamon Press, 1980) pg. 464–468.

3. See http://wyant.opt-sci.arizona.edu/zernikes/zernikes.htm.

4. G. Harbers, P. J. Kunst, and G. W. R. Leibbrandt, “Analysis of lateral shearing interferograms by use of Zernike polynomials,” Appl. Opt. 35, 6162–6172 (1996). [CrossRef] [PubMed]

5. R. G. Lane and M. Tallon, “Wave-front reconstruction using a Shack-Hartmann sensor,” Appl. Opt. 31, 6902–6907 (1992). [CrossRef] [PubMed]

6. P. C. V. Mallik, C. Zhao, and J. H. Burge, “Measurement of a 2-meter flat using a pentaprism scanning system,” Opt. Eng. 46, 023602 (2007). [CrossRef]

7. A. Gavrielides, “Vector polynomials orthogonal to the gradient of Zernike polynomials,” Opt. Lett , 7, 526–528 (1982). [CrossRef] [PubMed]

8. E. Acosta, S. Bara, M. A. Rama, and S. Rios, “Determination of phase mode components in terms of local wave-front slopes: an analytical approach,” Opt. Lett. 20, 1083–1085 (1995). [CrossRef] [PubMed]

9. P. E. Murphy, T. G. Brown, and D. T. Moore, “Interference imaging for aspheric surface testing,” Appl. Opt. 39, 2122–2129 (2000). [CrossRef]

10. J. H. Burge, Advanced Techniques for Measuring Primary Mirrors for Astronomical Telescopes, Ph. D. Dissertation, Optical Sciences, University of Arizona (1993).

11. Durango^TM Interferometry Software, Diffraction International, Minnetonka, MN.

12. C. Zhao and J. H. Burge, “Orthonormal vector polynomials in a unit circle, Part II : completing the basis set,” to be submitted to Optics Express (2007).

13. T. M. Apostol, Linear Algebra: A First Course, with Applications to Differential Equations (John Wiley & Sons, 1997), Page 111–114.

14. R. Upton and B. Ellerbroek, “Gram-Schmidt orthogonalization of the Zernike polynomials on apertures of arbitrary shape,” Opt. Lett. 29, 2840–2842 (2004). [CrossRef]

∇Z₁=0	∇Z₆=î√6Z₂-ĵ√6Z₃
∇Z₂=î2Z₁	∇Z₇=î2√3Z₅+ĵ2(√2Z₁+√6Z₄-√3Z₆)
∇Z₃=ĵ2Z₁	∇Z₈=î2(√2Z₁+√6Z₄+√3Z₆)+ĵ2√3Z₅
∇Z₄=î2√3Z₂+ĵ2√3Z₃	∇Z₉=î2√3Z₅+ĵ2√3Z₆
∇Z₅=î√6Z₃+ĵ√6Z₂	∇Z₁₀=î2√3Z₆-ĵ2√3Z₅
∇Z₁₁=î2√5(Z₂+√2Z₈)+ĵ2√5(Z₃+√2Z₇)
∇Z₁₂=î√10(Z₂+√2Z₈+√2Z₁₀)+ĵ√10(-Z₃-√2Z₇+√2Z₉)
∇Z₁₃=î√10(Z₃+√2Z₇+√2Z₉)+ĵ√10(Z₂+√2Z₈-√2Z₁₀)
∇Z₁₄=î2√5Z₁₀-ĵ2√5Z₉
∇Z₁₅=î2√5Z₉+ĵ2√5Z₁₀
∇Z₁₆=î(2√3Z₁+6Z₄+3√2Z₆+2√15Z₁₁+√30Z₁₂)+ĵ(3√2Z₅+√30Z₁₃)
∇Z₁₇=î(3√2Z₅+√30Z₁₃)+ĵ(2√3Z₁+6Z₄-3√2Z₆+2√15Z₁₁-√30Z₁₂)
∇Z₁₈=î(3√2Z₆+√Z₁₂+√30Z₁₄)+ĵ(-3√2Z₅-√30Z₁₃+√30Z₁₅)
∇Z₁₉=î(3√2Z₅+√30Z₁₃+√30Z₁₅)+ĵ(3Z₆+√30Z₁₂-√30Z₁₄)
∇Z₂₀=î√30Z₁₄-ĵ√30Z₁₅
∇Z₂₁=î√30Z₁₅+ĵ30Z₁₄
∇Z₂₂=î2√7(Z₂+√2Z₈+√3Z₁₆)+ĵ2√7(Z₃+√2Z₇+√3Z₁₇)
∇Z₂₃=î√14(Z₃+√2Z₇+√2Z₉+√3Z₁₇+√3Z₁₉)+ĵ√14(Z₂+√2Z₈-√2Z₁₀+√3Z₁₆-√3Z₁₈)
∇Z₂₄=î√14(Z₂+√2Z₈+√2Z₁₀+√3Z₁₆+√3Z₁₈)+ĵ√14(-Z1₃-√2Z₇+√2Z₉-√3Z₁₇+√3Z₁₉)
∇Z₂₅=î√7(2Z₉+√6Z₁₉+√6Z₂₁)+ĵ√7(2Z₁₀+√6Z₁₈-√6Z₂₀)
∇Z₂₆=î√7(2Z₁₀+√6Z₁₈+√6Z₂₀)+ĵ√7(-2Z₉-√6Z₁₉+√6Z₂₁)
∇Z₂₇=î√42Z₂₁+ĵ√42Z₂₀
∇Z₂₈=î√42Z₂₀-ĵ√42Z₂₁
∇Z₂₉=î√8(√3Z₅+√5Z₁₃+√7Z₂₃)+ĵ√8(√2Z₁+√6Z₄-√3Z₆+√10Z₁₁-√5Z₁₂+√14Z₂₂-√7Z₂₄)
∇Z₃₀=î√8(√2+√6Z₄+√3Z₆+√10Z₁₁+√5Z₁₂+√14Z₂₂+√7Z₂₄)+ĵ√8(√3Z₅+√5Z₁₃+√7Z₂₃)
∇Z₃₁=î√8(√3Z₅+√5Z₁₃+√5Z₁₅+√7Z₂₃+√7Z₂₅)+ĵ√8(√3Z₆+√5Z₁₂-√5Z₁₄+√7Z₂₄-√7Z₂₆)
∇Z₃₂=î√8(√3Z₆+√5Z₁₂+√5Z₁₄+√7Z₂₄+√7Z₂₆)+ĵ√8(-√3Z₅-√5Z₁₃+√5Z₁₅-√7Z₂₃+√7Z₂₅)
∇Z₃₃=î√8(√5Z₁₅+√7Z₂₅+√7Z₂₇)+ĵ√8(√5Z₁₄+√7Z₂₆-√7Z₂₈)
∇Z₃₄=î√8(√5Z₁₄+√7Z₂₆+j√7Z₂₈)+ĵ√8(-√5Z₁₅-√7Z₂₅+√7Z₂₇)
∇Z₃₅=î2√14Z₂₇+ĵ2√14Z₂₈
∇Z₃₆=î2√14Z₂₈-ĵ2√14Z₂₇
∇Z₃₇=î6(Z₂+√2Z₈+√3Z₁₆+2Z₃₀)+ĵ6(Z₃+√2Z₇+√3Z₁₇+2Z₂₉)

${\vec{S}}_{2} = \frac{1}{\sqrt{4}} \nabla Z_{2}$	${\vec{S}}_{14} = \frac{1}{\sqrt{40}} \nabla Z_{14}$	${\vec{S}}_{26} = \frac{1}{\sqrt{168}} (\nabla Z_{26} - \sqrt{\frac{7}{5}} \nabla Z_{14})$
${\vec{S}}_{3} = \frac{1}{\sqrt{4}} \nabla Z_{3}$	${\vec{S}}_{15} = \frac{1}{\sqrt{40}} \nabla Z_{15}$	${\vec{S}}_{27} = \frac{1}{\sqrt{84}} \nabla Z_{27}$
${\vec{S}}_{4} = \frac{1}{\sqrt{24}} \nabla Z_{4}$	${\vec{S}}_{16} = \frac{1}{\sqrt{120}} (\nabla Z_{16} - \sqrt{\frac{3}{2}} \nabla Z_{8})$	${\vec{S}}_{28} = \frac{1}{\sqrt{84}} \nabla Z_{28}$
${\vec{S}}_{5} = \frac{1}{\sqrt{12}} \nabla Z_{5}$	${\vec{S}}_{17} = \frac{1}{\sqrt{120}} (\nabla Z_{17} - \sqrt{\frac{3}{2}} \nabla Z_{7})$	${\vec{S}}_{29} = \frac{1}{\sqrt{224}} (\nabla Z_{29} - \sqrt{\frac{4}{3}} \nabla Z_{17})$
${\vec{S}}_{6} = \frac{1}{\sqrt{12}} \nabla Z_{6}$	${\vec{S}}_{18} = \frac{1}{\sqrt{120}} (\nabla Z_{18} - \sqrt{\frac{3}{2}} \nabla Z_{10})$	${\vec{S}}_{30} = \frac{1}{\sqrt{224}} (\nabla Z_{30} - \sqrt{\frac{4}{3}} \nabla Z_{16})$
${\vec{S}}_{7} = \frac{1}{\sqrt{48}} (\nabla Z_{7} - \sqrt{2} \nabla Z_{3})$	${\vec{S}}_{19} = \frac{1}{\sqrt{120}} (\nabla Z_{19} - \sqrt{\frac{3}{2}} \nabla Z_{9})$	${\vec{S}}_{31} = \frac{1}{\sqrt{224}} (\nabla Z_{31} - \sqrt{\frac{4}{3}} \nabla Z_{19})$
${\vec{S}}_{8} = \frac{1}{\sqrt{48}} (\nabla Z_{8} - \sqrt{2} \nabla Z_{2})$	${\vec{S}}_{20} = \frac{1}{\sqrt{60}} \nabla Z_{20}$	${\vec{S}}_{32} = \frac{1}{\sqrt{224}} (\nabla Z_{32} - \sqrt{\frac{4}{3}} \nabla Z_{18})$
${\vec{S}}_{9} = \frac{1}{\sqrt{24}} \nabla Z_{9}$	${\vec{S}}_{21} = \frac{1}{\sqrt{60}} \nabla Z_{21}$	${\vec{S}}_{33} = \frac{1}{\sqrt{224}} (\nabla Z_{33} - \sqrt{\frac{4}{3}} \nabla Z_{21})$
${\vec{S}}_{10} = \frac{1}{\sqrt{24}} \nabla Z_{10}$	${\vec{S}}_{22} = \frac{1}{\sqrt{168}} (\nabla Z_{22} - \sqrt{\frac{7}{5}} \nabla Z_{11})$	${\vec{S}}_{34} = \frac{1}{\sqrt{224}} (\nabla Z_{34} - \sqrt{\frac{4}{3}} \nabla Z_{20})$
${\vec{S}}_{11} = \frac{1}{\sqrt{80}} (\nabla Z_{11} - \sqrt{\frac{5}{3}} \nabla Z_{4})$	${\vec{S}}_{23} = \frac{1}{\sqrt{168}} (\nabla Z_{23} - \sqrt{\frac{7}{5}} \nabla Z_{13})$	${\vec{S}}_{35} = \frac{1}{\sqrt{112}} \nabla Z_{35}$
${\vec{S}}_{12} = \frac{1}{\sqrt{80}} (\nabla Z_{12} - \sqrt{\frac{5}{3}} \nabla Z_{6})$	${\vec{S}}_{24} = \frac{1}{\sqrt{168}} (\nabla Z_{24} - \sqrt{\frac{7}{5}} \nabla Z_{12})$	${\vec{S}}_{36} = \frac{1}{\sqrt{112}} \nabla Z_{36}$
${\vec{S}}_{13} = \frac{1}{\sqrt{80}} (\nabla Z_{13} - \sqrt{\frac{5}{3}} \nabla Z_{5})$	${\vec{S}}_{25} = \frac{1}{\sqrt{168}} (\nabla Z_{25} - \sqrt{\frac{7}{5}} \nabla Z_{15})$	${\vec{S}}_{37} = \frac{1}{\sqrt{288}} (\nabla Z_{37} - \sqrt{\frac{9}{7}} \nabla Z_{22})$

S⃗₂=îZ ₁	${\vec{S}}_{20} = \frac{1}{\sqrt{2}} (\hat{i} Z_{14} - \hat{j} Z_{15})$
S⃗₃=ĵZ ₁	${\vec{S}}_{21} = \frac{1}{\sqrt{2}} (\hat{i} Z_{15} + \hat{j} Z_{14})$
${\vec{S}}_{4} = \frac{1}{\sqrt{2}} (\hat{i} Z_{2} + \hat{j} Z_{3})$	${\vec{S}}_{22} = \frac{1}{\sqrt{2}} (\hat{i} Z_{16} + \hat{j} Z_{17})$
${\vec{S}}_{5} = \frac{1}{\sqrt{2}} (\hat{i} Z_{3} + \hat{j} Z_{2})$	${\vec{S}}_{23} = \frac{1}{2} (\hat{i} (Z_{17} + Z_{19}) + \hat{j} (Z_{16} - Z_{18}))$
${\vec{S}}_{6} = \frac{1}{\sqrt{2}} (\hat{i} Z_{2} - \hat{j} Z_{3})$	${\vec{S}}_{24} = \frac{1}{2} (\hat{i} (Z_{16} + Z_{18}) + \hat{j} (- Z_{17} + Z_{19}))$
${\vec{S}}_{7} = \frac{1}{2} (\hat{i} Z_{5} + \hat{j} (\sqrt{2} Z_{4} - Z_{6}))$	${\vec{S}}_{25} = \frac{1}{2} (\hat{i} (Z_{19} + Z_{21}) + \hat{j} (Z_{18} - Z_{20}))$
${\vec{S}}_{8} = \frac{1}{2} (\hat{i} (\sqrt{2} Z_{4} + Z_{6}) + \hat{j} Z_{5})$	${\vec{S}}_{26} = \frac{1}{2} (\hat{i} (Z_{18} + Z_{20}) + \hat{j} (- Z_{19} + Z_{21}))$
${\vec{S}}_{9} = \frac{1}{\sqrt{2}} (\hat{i} Z_{5} + \hat{j} Z_{6})$	${\vec{S}}_{27} = \frac{1}{\sqrt{2}} (\hat{i} Z_{21} + \hat{j} Z_{20})$
${\vec{S}}_{10} = \frac{1}{\sqrt{2}} (\hat{i} Z_{6} - \hat{j} Z_{5})$	${\vec{S}}_{28} = \frac{1}{\sqrt{2}} (\hat{i} Z_{20} - \hat{j} Z_{21})$
${\vec{S}}_{11} = \frac{1}{\sqrt{2}} (\hat{i} Z_{8} + \hat{j} Z_{7})$	${\vec{S}}_{29} = \frac{1}{2} (\hat{i} Z_{23} + \hat{j} (\sqrt{2} Z_{22} - Z_{24}))$
${\vec{S}}_{12} = \frac{1}{2} (\hat{i} (Z_{8} + Z_{10}) + \hat{j} (- Z_{7} + Z_{9}))$	${\vec{S}}_{30} = \frac{1}{2} (\hat{i} (\sqrt{2} Z_{22} + Z_{24}) + \hat{j} Z_{23})$
${\vec{S}}_{13} = \frac{1}{2} (\hat{i} (Z_{7} + Z_{9}) + \hat{j} (Z_{8} - Z_{10}))$	${\vec{S}}_{31} = \frac{1}{2} (\hat{i} (Z_{23} + Z_{25}) + \hat{j} (Z_{24} - Z_{26}))$
${\vec{S}}_{14} = \frac{1}{\sqrt{2}} (\hat{i} Z_{10} - \hat{j} Z_{9})$	${\vec{S}}_{32} = \frac{1}{2} (\hat{i} (Z_{24} + Z_{26}) + \hat{j} (- Z_{23} + Z_{25}))$
${\vec{S}}_{15} = \frac{1}{\sqrt{2}} (\hat{i} Z_{9} + \hat{j} Z_{10})$	${\vec{S}}_{33} = \frac{1}{2} (\hat{i} (Z_{25} + Z_{27}) + \hat{j} (Z_{26} - Z_{28}))$
${\vec{S}}_{16} = \frac{1}{2} (\hat{i} (\sqrt{2} Z_{11} + Z_{12}) + \hat{j} Z_{3})$	${\vec{S}}_{34} = \frac{1}{2} (\hat{i} (Z_{26} + Z_{28}) + \hat{j} (- Z_{25} + Z_{27}))$
${\vec{S}}_{17} = \frac{1}{2} (\hat{i} Z_{3} + \hat{j} (\sqrt{2} Z_{11} - Z_{12}))$	${\vec{S}}_{35} = \frac{1}{\sqrt{2}} (\hat{i} Z_{27} + \hat{j} Z_{28})$
${\vec{S}}_{18} = \frac{1}{2} (\hat{i} (Z_{12} + Z_{14}) + \hat{j} (- Z_{13} + Z_{15}))$	${\vec{S}}_{36} = \frac{1}{\sqrt{2}} (\hat{i} Z_{28} - \hat{j} Z_{27})$
${\vec{S}}_{19} = \frac{1}{2} (\hat{i} (Z_{13} + Z_{15}) + \hat{j} (Z_{12} - Z_{14}))$	${\vec{S}}_{37} = \frac{1}{\sqrt{2}} (\hat{i} Z_{30} + \hat{j} Z_{29})$

S _j (n,m)=îS _jx +ĵS _jy			Rules
S _j (n,m)=îS _jx +ĵS _jy						S _jx		S _jy
m	n	j	ja, ja^’	jb, jb^’	ja-jb	C_a (Z_ja(n-1,m-1))	C_a^’ (Z_{ja^’(n-1,m+1)})	C_b (Z_jb(n-1,m-1))	C_b’ (Z_{jb^’(n-1,m+1}))
1	1	2	ja=1, no ja^’	jb=1, no jb^’	0	1	NA	0	NA
1	1	3	ja=1, no ja^’	jb=1, no jb^’	0	0	NA	1	NA
0	even ≥2		no ja, ja^’ even	no jb, jb^’ odd		NA	$\sqrt{1 ⁄ 2}$	NA	$\sqrt{1 ⁄ 2}$
1	=2s+1, s odd	even	ja=jb, ja^’-ja=2	jb=ja, jb^’-jb=1	0	$\sqrt{1 ⁄ 2}$	1/2	0	1/2
	=2s+1, s odd	odd	ja=jb, ja^’-ja=1	jb=ja, jb^’-jb=2	0	0	1/2	$\sqrt{1 ⁄ 2}$	-1/2
	=2s+1, s even	even	ja=jb, ja^’-ja=1	jb=ja, jb^’-jb=2	0	$\sqrt{1 ⁄ 2}$	1/2	0	1/2
	=2s+1, s even	odd	ja=jb, ja^’-ja=2	jb=ja, jb^’-jb=1	0	0	1/2	$\sqrt{1 ⁄ 2}$	-1/2
m=n>1	=2s or 2s-1 s odd	even	no ja^’	no jb^’	-1	$\sqrt{1 ⁄ 2}$	NA	$- \sqrt{1 ⁄ 2}$	NA
	=2s or 2s-1 s odd	odd no ja^’	no jb^’	1	$\sqrt{1 ⁄ 2}$	NA	$\sqrt{1 ⁄ 2}$	NA
	=2s or 2s-1, s even	even	no ja^’	no jb^’	1	$\sqrt{1 ⁄ 2}$	NA	$- \sqrt{1 ⁄ 2}$	NA
	=2s or 2s-1, s even	odd	no ja^’	no jb^’	-1	$\sqrt{1 ⁄ 2}$	NA	$\sqrt{1 ⁄ 2}$	NA
m>1 m<n	=2s or 2s-1 s odd	even	ja^’-ja=2	jb^’-jb=2	-1	1/2	1/2	-1/2	1/2
	=2s or 2s-1 s odd	odd	ja^’-ja=2	jb^’-jb=2	1	1/2	1/2	1/2	-1/2
	=2s or 2s-1, s even	even	ja^’-ja=2	jb^’-jb=2	1	1/2	1/2	-1/2	1/2
	=2s or 2s-1, s even	odd	ja^’-ja=2	jb’-jb=2	-1	1/2	1/2	1/2	-1/2

j	n, m	Zernikes
1	n=0 m=0	Z₁=1
2	n=1 m=1	Z₂=2r cos θ
3	n=1 m=1	Z₃=2r sin θ
4	n=2 m=0	Z₄=√3(2r ² − 1)
5	n=2 m=2	Z₅=√6r ² sin 2θ
6	n=2 m=2	Z₆=√6r ² cos 2θ
7	n=3 m=1	Z₇=√8(3r ³ − 2r) sin θ
8	n=3 m=1	Z₈=√8(3r ³ − 2r) cos θ
9	n=3 m=3	Z₉=√8r ³ sin 3θ
10	n=3 m=3	Z₁₀=√8r ³ cos 3θ
11	n=4 m=0	Z₁₁=√5(6r ⁴ − 6r ²+1)
12	n=4 m=2	Z₁₂=√10(4r ⁴ − 3r ²) cos 2θ
13	n=4 m=2	Z₁₃=√10(4r ⁴ − 3r²) sin 2θ
14	n=4 m=4	Z₁₄=√10r⁴ cos 4θ
15	n=4 m=4	Z₁₅=√10r⁴ sin 4θ
16	n=5 m=1	Z₁₆=√12(10r ⁵ − 12r ³+3r) cos θ
17	n=5 m=1	Z₁₇=√12(10r ⁵ − 12r ³+3r) sin θ
18	n=5 m=3	Z₁₈=√12(5r ⁵ − 4r ³) cos 3θ
19	n=5 m=3	Z₁₉=√12(5r ⁵ − 4r ³) sin 3θ
20	n=5 m=5	Z₂₀=√12r ⁵ cos 5θ
21	n=5 m=5	Z₂₁=√12r ⁵ sin 5θ
22	n=6 m=0	Z₂₂=√7(20r ⁶ − 30r ⁴+12r ² − 1)
23	n=6 m=2	Z₂₃=√14(15r ⁶ − 20r ⁴+6r ²) sin 2θ
24	n=6 m=2	Z₂₄=√14(15r ⁶ − 20r ⁴+6r ²) cos 2θ
25	n=6 m=4	Z₂₅=√14(6r ⁶ − 5r ⁴) sin 4θ
26	n=6 m=4	Z₂₆=√14(6r ⁶ − 5r ⁴) cos 4θ
27	n=6 m=6	Z₂₇=√14r ⁶ sin 6θ
28	n=6 m=6	Z₂₈=√14r ⁶ cos 6θ
29	n=7 m=1	Z₂₉=√16(35r ⁷ − 60r ⁵+30r ³ − 4r) sin θ
30	n=7 m=1	Z₃₀=√16(35r ⁷ − 60r ⁵+30r ³ − 4r) cos θ
31	n=7 m=3	Z₃₁=√16(21r ⁷ − 30r ⁵+10r ³) sin 3θ
32	n=7 m=3	Z₃₂=√16(21r ⁷ − 30r ⁵+10r ³) cos 3θ
33	n=7 m=5	Z₃₃=√16(7r ⁷ − 6r ⁵) sin 5θ
34	n=7 m=5	Z₃₄=√16(7r ⁷ − 6r ⁵) cos 5θ
35	n=7 m=7	Z₃₅=√16r ⁷ sin 7θ
36	n=7 m=7	Z₃₆=√16r ⁷ cos 7^θ
37	n=8 m=0	Z₃₇=√9(70r ⁸ − 140r ⁶+90r ⁴ − 20r ²+1)

Orthonormal vector polynomials in a unit circle, Part I: basis set derived from gradients of Zernike polynomials

Abstract

1. Introduction

2. Zernike polynomials and their gradients

3. An orthonormal set of vector polynomials

3.1 Orthogonalization of gradient functions

3.3 S as linear combinations of Zernike polynomials

3.4 Plots of vector polynomial functions

4. Relating the vector polynomials to gradients of scalar functions

5. Summary

Appendix:

References and links

Cited By

Tables (7)

Equations (16)

Optics Express