Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Spinning metasurface stack for spectro-polarimetric thermal imaging

Open Access Open Access

Abstract

Spectro-polarimetric imaging in the long-wave infrared (LWIR) region plays a crucial role in applications from night vision and machine perception to trace gas sensing and thermography. However, the current generation of spectro-polarimetric LWIR imagers suffers from limitations in size, spectral resolution, and field of view (FOV). While meta-optics-based strategies for spectro-polarimetric imaging have been explored in the visible spectrum, their potential for thermal imaging remains largely unexplored. In this work, we introduce an approach for spectro-polarimetric decomposition by combining large-area stacked meta-optical devices with advanced computational imaging algorithms. The co-design of a stack of spinning dispersive metasurfaces along with compressive sensing and dictionary learning algorithms allows simultaneous spectral and polarimetric resolution without the need for bulky filter wheels or interferometers. Our spinning-metasurface-based spectro-polarimetric stack is compact (${\lt}\;{10} \times {10} \times {10}\;{\rm cm}$) and robust, and it offers a wide field of view (20.5°). We show that the spectral resolving power of our system substantially enhances performance in machine learning tasks such as material classification, a challenge for conventional panchromatic thermal cameras. Our approach represents a significant advance in the field of thermal imaging for a wide range of applications including heat-assisted detection and ranging (HADAR).

© 2024 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

1. INTRODUCTION

The demand for high-resolution, information-rich image data has been amplified by the widespread industry adoption of machine learning algorithms [16]. Meta-optics-enabled spectral and polarimetric imaging exhibits potential in meeting these rising data demands of learning algorithms within the visible spectrum [715]. Yet, the integration of meta-optics with infrared thermal imaging remains a relatively unexplored domain. Recent strides in heat-assisted detection and ranging (HADAR) have demonstrated the potential of thermal imaging [16] for machine perception tasks utilizing the infrared spectrum and the atmospheric transparency window. By combining spectral-resolved thermal imaging with artificial intelligence, HADAR offers a platform for machine perception through pitch darkness like broad daylight [16]. Here, our goal is to show that integrating meta-optics with infrared thermal imaging emerges as a crucial enabling factor for such next-generation machine vision algorithms. Our work offers a pathway for realizing compact spectro-polarimetric thermal imagers beyond the conventional technologies that use push-broom, filter-wheel, or interferometer modules. This new class of thermal imagers enhances the ability of machine learning algorithms to capture information such as temperature, material composition, and surface morphology.

Within the broad thermal infrared spectrum, the long-wave infrared (LWIR) region stands out as especially advantageous for heat-assisted detection and ranging, as most objects at room temperature radiate thermal energy at these wavelengths [Fig. 1(a)]. Moreover, the LWIR atmospheric transmission window facilitates the thermal radiation signal to propagate long distances [17]. However, the blurry nature of thermal images poses a significant challenge for machine vision algorithms. For example, perception tasks such as semantic segmentation and ranging rely on texture and contrast for learning features and show poor performance with panchromatic thermal imagery [18]. HADAR overcomes the blurry nature of thermal images by decomposing the LWIR thermal radiation into its spectral channels [16]. Unfortunately, the commonly used mosaic filter technique for spectral decomposition [Fig. 1(b)] [1924] falls short for LWIR thermal imaging due to the restricted pixel count in LWIR focal plane arrays. Current HADAR spectral imagers rely on cumbersome filter wheels [Fig. 1(c)] with restricted spectral resolution, or fragile and bulky interferometers [Fig. 1(d)] with a confined field of view (FOV) [16]. Such constraints considerably hinder the widespread applicability and adoption of HADAR.

 figure: Fig. 1.

Fig. 1. Long-wave infrared (LWIR) spectro-polarimetric thermal imaging. (a) Room-temperature blackbody radiation (shown in red) and atmospheric transmission spectrum (shown as a shaded area). The LWIR spectral region is crucial for thermal imaging due to its peaked room-temperature thermal radiation spectrum and the atmospheric transparency window. (b)–(d) Conventional methods for spectral imaging, such as using (b) a mosaic sensor, (c) a filter wheel, or (d) interferometry, either pose limitations or are infeasible for LWIR thermal imaging. (e) In this study, we propose a new approach for spectro-polarimetric thermal imaging, achieved by combining large-area spinning metasurfaces and compressive sensing reconstruction algorithms.

Download Full Size | PDF

To address these limitations, in this work, we harness the capabilities of large-area meta-optics in the infrared domain to create a platform for spectro-polarimetric thermal imaging. We achieve spectral decomposition by designing the dispersion and polarization through stacked metasurfaces, while the spectral reconstruction is accomplished using compressive sensing and dictionary learning algorithms. Our designs employ 2D structures with large feature sizes (${\gt}1\;{\unicode{x00B5}{\rm m}}$), which can be made using photo-lithography techniques, facilitating large-area fabrication and scalable manufacturing. Despite its compact form, our Spinning MetaCam delivers exceptional spectral resolution across diverse materials and can effectively unveil subtle polarimetric features, enabling applications such as material classification. By leveraging meta-optics and advanced computational spectroscopy methods, we demonstrate three times higher accuracy for machine vision tasks than conventional panchromatic thermal imaging. In the future, the angular speed of the spinning stage can be increased to enable real-time video leading to widespread adoption of HADAR technology similar to RADAR, LIDAR, and SONAR.

2. DESIGN OF THE SPINNING METASURFACE STACK

The architecture of our spinning-metasurface based spectro-polarimetric imaging system is depicted in Fig. 1(e). It comprises a broadband linear polarizer, three anisotropic and dispersive metasurfaces, and an LWIR imaging sensor. The polarizer is utilized to polarize the incoming thermal radiation signals, and the metasurfaces are utilized to realize spectral filtering. We design the metasurfaces with high anisotropy to produce distinct spectral responses for orthogonal polarizations. Additionally, the metasurfaces’ dispersion rotates different wavelengths of radiation to varying polarization orientations. By using the metasurfaces in tandem and axially spinning the polarizer and metasurfaces to different angles, we obtain tunable transmission spectra that sample the incident thermal radiation in its spectral and polarimetric channels. We then reconstruct unknown spectra of imaging targets using compressive sensing and dictionary learning algorithms. Dictionary learning generates a set of basis functions that represent the unknown spectra in a sparse format [25]. Compressive sensing enables accurate reconstruction of the sparse spectra from a limited number of measurements [26]. Combining these two techniques enables accurate and stable spectral reconstruction in the presence of noise and measurement errors [27]. The four-dimensional spectro-polarimetric data generated by our system offers a wealth of physical information about an imaging target, making it a valuable tool for physics-driven machine vision [5,28], facilitating various tasks such as object detection and semantic segmentation [29,30].

 figure: Fig. 2.

Fig. 2. Design and characterization of the spinning metasurfaces. (a)–(c) Schematics of the three different metasurface devices. (d) Optical microscope image of a fabricated metasurface. Inset, an optical image of a 25.4 mm-diameter device used for the imaging experiments, highlighting the large-area uniformity. (e) Measured polarized transmission spectra (${t_{\textit{ip}}}$ and ${t_{\textit{is}}}$) of the three metasurfaces (M1–M3), displaying strong anisotropy and distinctive dispersion. (f) Generated tunable transmission spectra by our spinning metasurface stack. The upper boundary of colored areas represents the transmission spectra corresponding to specific spinning angle combinations of the three metasurfaces. The colored areas highlight the differences between the generated transmission spectra. The low correlation between the generated transmission spectra is crucial for improving spectral reconstruction performance. Inset, an optical image of the integrated spinning-metasurface module based on motorized rotatory mounts. The overall size of the module is smaller than ${10}\;{\rm cm} \times {10}\;{\rm cm} \times {10}\;{\rm cm}$, making it a promising platform for next-generation high-contrast LWIR thermal imaging. (g) Normalized spatial transmittance of the module at two representative spinning angle combinations $\Theta = { (0^ \circ}{,0^ \circ}{,0^ \circ})$ shown at the top and $\Theta = { (90^ \circ}{,90^ \circ}{,90^ \circ})$ illustrated at the bottom. The red circles correspond to a transmittance value of 0.5. Note that the spatial transmittance remains relatively consistent across all different $\Theta$ combinations, considering that the field of view is primarily limited by the diameters of the metasurface devices. The estimated field of view of the imaging system across the LWIR range is approximately 20.5°.

Download Full Size | PDF

To quantitatively describe the mechanism of the spinning-angle-controlled transmission spectra, we represent the spectro-polarimetric response of the metasurfaces using Jones matrices. Assuming that the transmission axis of the input linear polarizer is at 0 deg relative to the $x$ axis, the Jones matrix ${J_i}$ of a metasurface $i$ with the principal axis ($p$) at a spinning angle ${\theta _i}$ relative to the $x$ axis can be expressed as

$$\begin{split}{{J_i}({\theta _i},\lambda)}& = {R(- {\theta _i}) \cdot {J_{{M_i}}}(\lambda) \cdot R({\theta _i})}\\ &= {\left[{\begin{array}{*{20}{l}}{\cos ({\theta _i})}&{- \sin ({\theta _i})}\\{\sin ({\theta _i})}&{\cos ({\theta _i})}\end{array}} \right]\left[{\begin{array}{*{20}{c}}{{t_{\textit{ip}}}(\lambda)}&0\\0&{{t_{\textit{is}}}(\lambda)}\end{array}} \right]\left[{\begin{array}{*{20}{l}}{\cos ({\theta _i})}&{\sin ({\theta _i})}\\{- \sin ({\theta _i})}&{\cos ({\theta _i})}\end{array}} \right]}\end{split},$$
where ${ R}$ is the rotation matrix, ${J_{\textit{Mi}}}$ contains the anisotropic transmission of the metasurface ${t_{\textit{ip}}}$ and ${t_{\textit{is}}}$ along the two principle axes $p$ and $s$.

The Jones matrix of the three-metasurface assembly is given by

$$J\!\left({\Theta ,\lambda} \right) = {J_1}\!\left({{\theta _1},\lambda} \right) \cdot {J_2}\!\left({{\theta _2},\lambda} \right) \cdot {J_3}\!\left({{\theta _3},\lambda} \right) .$$

Thus, the total transmission spectrum of the three spinning metasurfaces strongly depends on the spinning-angle combinations $\Theta = ({{\theta _1},\,{\theta _2},\,{\theta _3}})$ when the constituted metasurfaces are strongly anisotropic and dispersive, i.e., ${t_{\textit{ip}}}(\lambda) \ne {t_{\textit{is}}}(\lambda)$ (see Supplement 1 for the detailed analysis). We note that large differences between the spectral responses of the three metasurfaces (M1, M2, M3) are also introduced to minimize the correlations between the generated spectra, which can significantly improve the spectral reconstruction performance [31]. We note that the metasurfaces do not necessitate rotation at fixed angular frequencies during the imaging process. As depicted in Eq. (2), it is the static angular positioning of the metasurfaces that is pivotal for selectively filtering the incident thermal radiation in its spectral content. We emphasize that our design generates a large set of distinct transmission spectra with only three metasurfaces, while the total number of spectra in the traditional mosaic array is limited to the number of metasurfaces/filters used [7,8,1923].

 figure: Fig. 3.

Fig. 3. Schematic of the spectral reconstruction process. (a) The measured raw signal can be expressed by (b) the pre-calibrated spectral response function of the imaging system multiplied by (c) the spectrum of an imaging target. (c), (d) For the spectral reconstruction, the unknown spectrum ${P_\lambda}$ is projected onto a sparse representation basis ${D_{{k\lambda}}}$ using dictionary learning. (e) This sparse representation ${\phi _k}$ is then used for compressive sensing based reconstruction. The use of compressive sensing and dictionary learning in the reconstruction process significantly improves the reconstruction accuracy, making the spinning-metasurface-based spectro-polarimetric imaging more robust against noise and measurement errors.

Download Full Size | PDF

Accordingly, we design the metasurfaces and experimentally achieve three key characteristics for optimized spectro-polarimetric imaging performance: (1) strong anisotropy and dispersion for efficient wavelength decomposition; (2) high transmission and low self-emission for high signal to noise ratio (SNR); (3) small angular dependence for a large FOV. The unit cells of the three designed metasurfaces are shown in Figs. 2(a)–2(c). Strong dispersive anisotropy of the transmission spectra can be observed in Fig. 2(e). Additionally, we emphasize that large-area devices are generally required for imaging applications to ensure sufficient numerical aperture. All the metasurfaces designed here have feature sizes larger than 1 µm. Large-area devices (25.4 mm in diameter) with high structural quality and uniformity [Fig. 2(d)] can be rapidly fabricated by standard photo-lithography techniques, enabling scalable manufacturing for practical applications. This is in strong contrast to recent works on miniaturized spectrometers [3240], where the device footprint is on the micrometer scale and thus not suitable for imaging applications.

The tunable transmission spectra produced by our spinning metasurfaces are shown in Fig. 2(f). The distinct spectra are a result of the tuned spinning-angle combination $\Theta$. We integrate the three fabricated metasurfaces tandemly via compact rotatory mounts to independently control the rotation of each metasurface [inset of Fig. 2(f)]. We also optimize the spinning-angle combinations of the three spinning metasurfaces using genetic algorithms to generate largely uncorrelated transmission spectra for optimal spectral reconstruction performance (see Supplement 1 for details). Additionally, we note that increasing the number of metasurfaces can further improve the spectral resolution but simultaneously reduces the SNR as the peak transmissions of the LWIR devices are limited to around 0.6 (see Supplement 1, Fig. S5 for details). However, our method has the potential to scale up into the hyperspectral regime by adding more high-transmission LWIR metasurfaces.

We also evaluate the FOV of our imaging module by integrating it with an LWIR thermal camera and capturing images of a large-area uniform blackbody. To determine the spatial transmission efficiency, we normalize the signal counts of each pixel by the counts at the center of the images. We also define the angular range with transmittance above 0.5 as the effective FOV of a system. As seen in Fig. 2(g), our spinning metasurface module has an FOV of around 20.5 deg, which is difficult to achieve with interferometer-based spectral imagers.

3. ALGORITHM DESIGN FOR SPECTRAL RECONSTRUCTION

To extract the unknown spectro-polarimetric properties of various imaging targets, we use a combination of dictionary learning and compressive sensing algorithms in the reconstruction process. The tunable transmission spectra produced by our spinning metasurfaces [shown in Fig. 2(f)] are not narrowband, which means that the collected raw signals at different spinning-angle combinations $\Theta$ do not directly reflect the spectral radiance at different wavelengths. Instead, the collected signal $I(\Theta)$ at each pixel can be described as an integral of the spectral response function $R(\Theta ,\lambda)$ multiplied by the ground truth spectrum $P(\lambda)$ that we wish to obtain, i.e., $I(\Theta) = \int_{{\lambda _{{\min}}}}^{{\lambda _{{\max}}}} R(\Theta ,\lambda)P(\lambda){\rm d}\lambda$. To solve for this equation, we discretize the spectral range of interest and express it in a tensor form as shown in Eq. (3),

$${I_\Theta} = {R_{\Theta \lambda}}{P_\lambda}.$$

We emphasize that directly solving Eq. (3) does not produce accurate spectral reconstructions [33,41]. In theory, we can use measured signals ${I_\Theta}$ and the pre-calibrated response function ${R_{\theta \lambda}}$ to directly determine unknown spectra ${P_\lambda}$ at each pixel of a scene. However, in practice, two limitations impede the performance of spectral reconstruction: the problem becomes underdetermined when there are many discretized wavelength bands, and measurement noise affects both ${I_\Theta}$ and ${R_{\Theta \lambda}}$, making the direct reconstruction method unstable and the results inaccurate.

 figure: Fig. 4.

Fig. 4. Spectro-polarimetric thermal imaging results. (a) An optical image of the “PURDUE” imaging target that is constructed from titanium letters on a glass substrate (${75}\;{\rm mm} \times {50}\;{\rm mm}$). Inset, a zoomed-in optical microscope image of the micro-structures in the letters, which generate distinctive spectral and polarimetric signatures. (b)–(e) Reconstructed spectra of four representative pixels (corresponding to the letter “R”, “U”, “E”, and the glass substrate, respectively) compared with the ground truth spectra measured by a Fourier-transform infrared spectrometer. (f) Reconstructed spectral frames at 6 representative wavelengths. The contrast between different frames demonstrates that the system can effectively reveal the LWIR spectral properties of various materials and structures. (g), (h) Degree-of-linear-polarization and angle-of-linear-polarization frames. Distinctive polarimetric signatures can be observed for each letter in the images. (i) Simulated spectral reconstruction results. The ground truth spectra (solid lines) are Gaussian peaks with 0.6 µm FWHM centered at different wavelengths (8.5 µm, 9.3 µm, 10.2 µm, 11 µm, 11.8 µm, 12.7 µm, and 13.5 µm). The reconstructed spectra (dotted lines) show good agreements with the ground truth.

Download Full Size | PDF

To enhance both the precision and consistency of spectral reconstruction, we leverage the capabilities of compressive sensing and dictionary learning algorithms to solve Eq. (3) (Fig. 3). Our approach begins with dictionary learning, a process we employ to generate a dictionary comprising 32 basis functions, designated as $D$. The spectra utilized for the dictionary learning are derived from the infrared emissivity spectra drawn from the ECOSTRESS Spectral Library [42,43], as well as blackbody thermal radiation spectra across various temperatures. The resultant dictionary can provide sparse representations for any thermal radiation spectrum in the space of spectra we are studying. We point out that the efficiency of this sparse coding is fundamentally linked to the prevalent spectral similarities found within the LWIR thermal radiation spectra. We thus project the unknown spectrum ${P_\lambda}$ as a linear combination of the basis functions in the dictionary [Figs. 3(c) and 3(d)]. We have

$${P_\lambda} = {D_{\lambda k}}{\phi _k},$$
where ${\phi _k}$ is a sparse coding of the spectrum ${P_\lambda}$. With this sparse representation, the spectral reconstruction problem can be solved by first obtaining ${\phi _{{\rm recon}}}$,
$$\begin{split}&{{\phi _{{\rm recon}}} = \arg \mathop {\min}\limits_{{\phi _k}} \parallel\! {\phi _k}{\parallel _1}}\\&{{\rm s}{\rm .t}{\rm .}\,{{\left\| {{I_\Theta} - {R_{\Theta \lambda}}{P_\lambda}} \right\|}_2} = {{\left\| {{I_\Theta} - {A_{\Theta k}}{\phi _k}} \right\|}_2} \lt \epsilon},\end{split}$$
where ${A_{\Theta k}} = {R_{\Theta \lambda}}{D_{\lambda k}}$, and $\epsilon$ is the residual error. Finally, the spectra ${P_\lambda}$ at each pixel of a scene is reconstructed by
$${P_{{\rm recon}}} = {D_{_{\lambda k}}}{\phi _{{\rm recon}}}.$$

Our reconstruction method significantly improves the reconstruction accuracy, making spinning-metasurface-based spectro-polarimetric imaging more robust against noise and measurement errors.

 figure: Fig. 5.

Fig. 5. Material classification using the Spinning MetaCam. (a) Simulated spectral reconstruction results for materials commonly found in everyday life. (b) Calculated confusion matrix illustrating material classification without the use of spinning metasurfaces. The classification in this case is solely reliant on the integrated intensity of thermal radiation within the LWIR region. (c) Calculated confusion matrix presenting material classification with spinning metasurfaces incorporated. The inclusion of spectral information substantially augments the performance of material classification, improving the mean average precision from 0.293 to 0.834.

Download Full Size | PDF

4. SPECTRO-POLARIMETRIC IMAGING AND MACHINE VISION

To evaluate the performance of our prototype imaging system, we conduct experiments using a custom-designed “PURDUE” target made of letters constructed from titanium and a glass substrate [Fig. 4(a)]. Each letter has unique micro-grating structures [inset of Fig. 4(a)] that generate distinctive spectro-polarimetric signatures in the thermal radiation signal. The glass substrate also features a characteristic emission peak around 11 µm. Note that we heat the image target to 150°C to generate high signal intensity. The reconstructed spectra of four representative pixels are shown in Figs. 4(b)–4(e). We compare them with the ground truth spectra measured by a Fourier-transform infrared spectrometer, validating the effectiveness of our reconstruction approach. The reconstructed spectral frames [Fig. 4(f)] also exhibit high contrasts between different wavelengths, demonstrating that the system can effectively reveal the LWIR spectral properties of different targets. We note that the relatively low reconstruction accuracy at shorter wavelengths (8–10 µm) results from the low transmission (low SNR) and the high correlation (similarity) between the tuned spectra [Fig. 2(f)].

To characterize the spectral resolution of our system, we conduct a numerical simulation of the spectral reconstruction performance using Gaussian peaks centered at varying wavelengths. This testing method is a widely adopted way to determine the fundamental resolution constraints of reconstruction-based spectroscopy [21,22,24,33,35,41]. The simulation incorporates the tunable transmission spectrum set [as depicted in Fig. 2(f)] produced by the spinning-metasurface module and utilizes the same dictionary learning and compressive sensing algorithms previously discussed. Figure 4(i) reveals that Gaussian peaks with a 0.6 µm full width at half maximum (FWHM) centered above 10 µm can be precisely reconstructed with negligible mean square errors (MSEs). Additional simulations also show that our system is capable of detecting narrower spectral peaks with FWHM as low as 0.1 µm (see Supplement 1, Fig. S4 for details). We again observe that the performance of spectral reconstruction is dependent on wavelength. A higher level of accuracy is achieved at longer wavelengths due to the significant differences between the tuned transmission spectra in this region [Fig. 2(f)]. This implies that we could further enhance the spectral resolving power by refining the design of the metasurfaces and generate a broader array of mutually uncorrelated tunable transmission spectra.

We also obtain the polarimetric information including degree of the linear polarization (DOLP) and the angle of linear polarization (AoLP) using the designed system. For polarimetric imaging, we collectively rotate the spinning metasurfaces and the input polarizer, selecting four different polarizations (0°, 90°, 45°, and ${-}{45^ \circ}$) while maintaining the same spectral transmission. We use the first three Stokes parameters to quantify the polarimetric information associated with each pixel, i.e., ${S_0} = {I_0} + {I_{90}}$, ${S_1} = {I_0} - {I_{90}}$, and ${S_2} = {I_{45}} - {I_{- 45}}$, where ${I_0}$, ${I_{90}}$, ${I_{45}}$, and ${I_{- 45}}$ are the light intensity at polarization angles of 0°, 90°, 45°, and ${-}{45^ \circ}$, respectively. The DoLP and AoLP are then calculated at each wavelength through ${\rm DoLP} = \sqrt {S_1^2 + S_2^2} /{S_0}$ and ${\rm AoLP} = {\rm arctan}({S_2}/{S_1})$. As shown in Figs. 4(g) and 4(h), we can clearly distinguish between different letters based on their polarimetric signatures in the thermal radiation signal. The four-dimensional spatial-spectro-polarimetric data-tesseract provides significantly more insight associated with an object, making it a powerful tool for a wide range of imaging applications.

To prove the effectiveness and reveal the potential of the Spinning MetaCam, we conduct numerical tests demonstrating its applicability in material classification. Figure 5(a) shows that our spinning metasurfaces can accurately reconstruct the LWIR thermal radiation spectra of a diverse range of materials commonly found in everyday life (ground truth obtained from the ECOSTRESS Spectral Library [42,43]). These results also highlight the spectral fidelity and universality of our spectral reconstruction method. Employing these thermal radiation spectra, we execute material classification by identifying the material whose ground truth spectrum exhibits the least mean square error (MSE) when compared to the reconstructed spectrum. We repeat this spectral reconstruction and material detection procedure 100 times for each material and use the outcomes to generate a confusion matrix (see Supplement 1 for details). Figures 5(b) and 5(c) compare the resulting confusion matrices derived with and without the use of spinning metasurfaces, while maintaining a consistent noise level of 5%. Without the spinning metasurfaces, material classification relies solely on the integrated intensity of the thermal radiation over the LWIR region. This approach exhibits a low precision for most materials, being effective only for those materials with substantially disparate thermal radiation intensities, such as metallic paint. In stark contrast, material classification significantly improves in accuracy when the spectra collected by the Spinning MetaCam is utilized. We note that instances of lower accuracy only arise when there is a pronounced similarity in the LWIR spectra of materials, as observed between construction glass and felsic. For a more quantitative analysis, we perform the material classification test at varying noise levels ranging from 1% to 10%, and subsequently calculate the mean average precision (mAP) [44]. Representing the overall accuracy of classification for all classes of materials at all different noise levels, the mAP reaches an impressively high value of 0.834, providing a significant improvement from the low mAP of 0.293 obtained without the spinning metasurfaces.

5. CONCLUSION

Our results provide an innovative approach for spectro-polarimetric thermal imaging by combining meta-optics and computational spectral reconstruction. The low-SWaP (size, weight, and power) system opens the door for physics-driven machine vision. The high-dimensional thermal image data can significantly improve the performance of tasks such as depth estimation, object detection, and semantic segmentation when only radiative heat signal is available. Furthermore, we foresee that spectro-polarimetric thermal imaging can also be a powerful tool for scientific research, allowing for non-destructive characterization in the infrared region to investigate a wide range of novel physical phenomena, such as anisotropic thermal conduction [45] and directional or nonreciprocal radiative heat transfer [4648]. Overall, our work provides a key development in the rapidly growing field of thermal imaging, offering a pivotal technology for heat-assisted detection and ranging.

Funding

U.S. Department of Energy (DE-SC0017717); Defense Advanced Research Projects Agency (Nascent Light-Matter Interactions (NLM) program).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Supplemental document

See Supplement 1 for supporting content.

REFERENCES

1. A. M. Valm, S. Cohen, W. R. Legant, et al., “Applying systems-level spectral imaging and analysis to reveal the organelle interactome,” Nature 546, 162–167 (2017). [CrossRef]  

2. M. C. Martin, C. Dabat-Blondeau, M. Unger, et al., “3D spectral imaging with synchrotron Fourier transform infrared spectro-microtomography,” Nat. Methods 10, 861–864 (2013). [CrossRef]  

3. T. Ideguchi, S. Holzner, B. Bernhardt, et al., “Coherent Raman spectro-imaging with laser frequency combs,” Nature 502, 355–358 (2013). [CrossRef]  

4. Y. Ozeki, W. Umemura, Y. Otsuka, et al., “High-speed molecular spectral imaging of tissue with stimulated Raman scattering,” Nat. Photonics 6, 845–851 (2012). [CrossRef]  

5. G. E. Karniadakis, I. G. Kevrekidis, L. Lu, et al., “Physics-informed machine learning,” Nat. Rev. Phys. 3, 422–440 (2021). [CrossRef]  

6. Y. Tang, S. Song, S. Gui, et al., “Active and low-cost hyperspectral imaging for the spectral analysis of a low-light environment,” Sensors 23, 1437 (2023). [CrossRef]  

7. A. McClung, S. Samudrala, M. Torfeh, et al., “Snapshot spectral imaging with parallel metasystems,” Sci. Adv. 6, eabc7646 (2020). [CrossRef]  

8. M. Makarenko, A. Burguete-Lopez, Q. Wang, et al., “Real-time hyperspectral imaging in hardware via trained metasurface encoders,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022), pp. 12692–12702.

9. N. A. Rubin, G. D’Aversa, P. Chevalier, et al., “Matrix Fourier optics enables a compact full-stokes polarization camera,” Science 365, eaax1839 (2019). [CrossRef]  

10. L. Huang, S. Colburn, A. Zhan, et al., “Full-color metaoptical imaging in visible light,” Adv. Photon. Res. 3, 2100265 (2022). [CrossRef]  

11. L. Li, H. Ruan, C. Liu, et al., “Machine-learning reprogrammable metasurface imager,” Nat. Commun. 10, 1082 (2019). [CrossRef]  

12. A. Ji, J.-H. Song, Q. Li, et al., “Quantitative phase contrast imaging with a nonlocal angle-selective metasurface,” Nat. Commun. 13, 7848 (2022). [CrossRef]  

13. P. Thureja, R. Sokhoyan, C. U. Hail, et al., “Toward a universal metasurface for optical imaging, communication, and computation,” Nanophotonics 11, 3745–3768 (2022). [CrossRef]  

14. L. V. Poulikakos, M. Lawrence, D. R. Barton, et al., “Guided-mode-resonant dielectric metasurfaces for colorimetric imaging of material anisotropy in fibrous biological tissue,” ACS Photon. 7, 3216–3227 (2020). [CrossRef]  

15. Z. Lin, R. Pestourie, C. Roques-Carmes, et al., “End-to-end metasurface inverse design for single-shot multi-channel imaging,” Opt. Express 30, 28358–28370 (2022). [CrossRef]  

16. F. Bao, X. Wang, S. H. Sureshbabu, et al., “Heat-assisted detection and ranging,” Nature 619, 743–748 (2023). [CrossRef]  

17. A. P. Raman, M. A. Anoma, L. Zhu, et al., “Passive radiative cooling below ambient air temperature under direct sunlight,” Nature 515, 540–544 (2014). [CrossRef]  

18. K. P. Gurton, A. J. Yuffa, and G. W. Videen, “Enhanced facial recognition for thermal imagery using polarimetric imaging,” Opt. Lett. 39, 3857–3859 (2014). [CrossRef]  

19. T. Xu, Y.-K. Wu, X. Luo, et al., “Plasmonic nanoresonators for high-resolution colour filtering and spectral imaging,” Nat. Commun. 1, 59 (2010). [CrossRef]  

20. F. Yesilkoy, E. R. Arvelo, Y. Jahani, et al., “Ultrasensitive hyperspectral imaging and biodetection enabled by dielectric metasurfaces,” Nat. Photonics 13, 390–396 (2019). [CrossRef]  

21. J. Bao and M. G. Bawendi, “A colloidal quantum dot spectrometer,” Nature 523, 67–70 (2015). [CrossRef]  

22. Z. Wang, S. Yi, A. Chen, et al., “Single-shot on-chip spectral sensors based on photonic crystal slabs,” Nat. Commun. 10, 1020 (2019). [CrossRef]  

23. A. Tittl, A. Leitis, M. Liu, et al., “Imaging-based molecular barcoding with pixelated dielectric metasurfaces,” Science 360, 1105–1109 (2018). [CrossRef]  

24. K. Monakhova, K. Yanny, N. Aggarwal, et al., “Spectral diffusercam: lensless snapshot hyperspectral imaging with a spectral filter array,” Optica 7, 1298–1307 (2020). [CrossRef]  

25. K. Kreutz-Delgado, J. F. Murray, B. D. Rao, et al., “Dictionary learning algorithms for sparse representation,” Neural Comput. 15, 349–396 (2003). [CrossRef]  

26. E. J. Candès, J. Romberg, and T. Tao, “Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information,” IEEE Trans. Inf. Theory 52, 489–509 (2006). [CrossRef]  

27. S. Zhang, Y. Dong, H. Fu, et al., “A spectral reconstruction algorithm of miniature spectrometer based on sparse optimization and dictionary learning,” Sensors 18, 644 (2018). [CrossRef]  

28. A. Signoroni, M. Savardi, A. Baronio, et al., “Deep learning meets hyperspectral image analysis: a multidisciplinary review,” J. Imaging 5, 52 (2019). [CrossRef]  

29. J. Lu, H. Liu, Y. Yao, et al., “HSI road: a hyper spectral image dataset for road segmentation,” in IEEE International Conference on Multimedia and Expo (ICME) (IEEE, 2020), pp. 1–6.

30. K. Usmani, G. Krishnan, T. O’Connor, et al., “Deep learning polarimetric three-dimensional integral imaging object recognition in adverse environmental conditions,” Opt. Express 29, 12215–12228 (2021). [CrossRef]  

31. Z. Wang and Z. Yu, “Spectral analysis based on compressive sensing in nanophotonic structures,” Opt. Express 22, 25608–25614 (2014). [CrossRef]  

32. Z. Yang, T. Albrow-Owen, W. Cai, et al., “Miniaturization of optical spectrometers,” Science 371, eabe0722 (2021). [CrossRef]  

33. S. Yuan, D. Naveh, K. Watanabe, et al., “A wavelength-scale black phosphorus spectrometer,” Nat. Photonics 15, 601–607 (2021). [CrossRef]  

34. H. H. Yoon, H. A. Fernandez, F. Nigmatulin, et al., “Miniaturized spectrometers with a tunable van der Waals junction,” Science 378, 296–299 (2022). [CrossRef]  

35. Z. Yang, T. Albrow-Owen, H. Cui, et al., “Single-nanowire spectrometers,” Science 365, 1017–1020 (2019). [CrossRef]  

36. B. Redding, S. F. Liew, R. Sarma, et al., “Compact spectrometer based on a disordered photonic chip,” Nat. Photonics 7, 746–751 (2013). [CrossRef]  

37. R. Cheng, C.-L. Zou, X. Guo, et al., “Broadband on-chip single-photon spectrometer,” Nat. Commun. 10, 4104 (2019). [CrossRef]  

38. S. N. Zheng, J. Zou, H. Cai, et al., “Microring resonator-assisted Fourier transform spectrometer with enhanced resolution and large bandwidth in single chip solution,” Nat. Commun. 10, 2349 (2019). [CrossRef]  

39. M. C. Souza, A. Grieco, N. C. Frateschi, et al., “Fourier transform spectrometer on silicon with thermo-optic non-linearity and dispersion correction,” Nat. Commun. 9, 665 (2018). [CrossRef]  

40. D. M. Kita, B. Miranda, D. Favela, et al., “High-performance and scalable on-chip digital fourier transform spectroscopy,” Nat. Commun. 9, 4405 (2018). [CrossRef]  

41. Y. Xiao, C. Wan, J. Salman, et al., “Planck spectroscopy,” Laser Photon. Rev. 15, 2100121 (2021). [CrossRef]  

42. S. K. Meerdink, S. J. Hook, D. A. Roberts, et al., “The ECOSTRESS spectral library version 1.0,” Remote Sens. Environ. 230, 111196 (2019). [CrossRef]  

43. A. M. Baldridge, S. J. Hook, C. Grove, et al., “The ASTER spectral library version 2.0,” Remote Sens. Environ. 113, 711–715 (2009). [CrossRef]  

44. J. Redmon, S. Divvala, R. Girshick, et al., “You only look once: unified, real-time object detection,” in IEEE Conference on Computer Vision and Pattern Recognition (2016).

45. S. Huang, M. Segovia, X. Yang, et al., “Anisotropic thermal conductivity in 2D tellurium,” 2D Mater. 7, 015008 (2019). [CrossRef]  

46. J. Xu, J. Mandal, and A. P. Raman, “Broadband directional control of thermal emission,” Science 372, 393–397 (2021). [CrossRef]  

47. C. Yang, W. Cai, and Z. M. Zhang, “Polarimetric analysis of thermal emission from both reciprocal and nonreciprocal materials using fluctuation electrodynamics,” Phys. Rev. B 106, 245407 (2022). [CrossRef]  

48. K. J. Shayegan, S. Biswas, B. Zhao, et al., “Direct observation of the violation of Kirchhoff’s law of thermal radiation,” Nat. Photonics 17, 891–896 (2023). [CrossRef]  

Supplementary Material (1)

NameDescription
Supplement 1       Supplemental Document.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (5)

Fig. 1.
Fig. 1. Long-wave infrared (LWIR) spectro-polarimetric thermal imaging. (a) Room-temperature blackbody radiation (shown in red) and atmospheric transmission spectrum (shown as a shaded area). The LWIR spectral region is crucial for thermal imaging due to its peaked room-temperature thermal radiation spectrum and the atmospheric transparency window. (b)–(d) Conventional methods for spectral imaging, such as using (b) a mosaic sensor, (c) a filter wheel, or (d) interferometry, either pose limitations or are infeasible for LWIR thermal imaging. (e) In this study, we propose a new approach for spectro-polarimetric thermal imaging, achieved by combining large-area spinning metasurfaces and compressive sensing reconstruction algorithms.
Fig. 2.
Fig. 2. Design and characterization of the spinning metasurfaces. (a)–(c) Schematics of the three different metasurface devices. (d) Optical microscope image of a fabricated metasurface. Inset, an optical image of a 25.4 mm-diameter device used for the imaging experiments, highlighting the large-area uniformity. (e) Measured polarized transmission spectra (${t_{\textit{ip}}}$ and ${t_{\textit{is}}}$) of the three metasurfaces (M1–M3), displaying strong anisotropy and distinctive dispersion. (f) Generated tunable transmission spectra by our spinning metasurface stack. The upper boundary of colored areas represents the transmission spectra corresponding to specific spinning angle combinations of the three metasurfaces. The colored areas highlight the differences between the generated transmission spectra. The low correlation between the generated transmission spectra is crucial for improving spectral reconstruction performance. Inset, an optical image of the integrated spinning-metasurface module based on motorized rotatory mounts. The overall size of the module is smaller than ${10}\;{\rm cm} \times {10}\;{\rm cm} \times {10}\;{\rm cm}$, making it a promising platform for next-generation high-contrast LWIR thermal imaging. (g) Normalized spatial transmittance of the module at two representative spinning angle combinations $\Theta = { (0^ \circ}{,0^ \circ}{,0^ \circ})$ shown at the top and $\Theta = { (90^ \circ}{,90^ \circ}{,90^ \circ})$ illustrated at the bottom. The red circles correspond to a transmittance value of 0.5. Note that the spatial transmittance remains relatively consistent across all different $\Theta$ combinations, considering that the field of view is primarily limited by the diameters of the metasurface devices. The estimated field of view of the imaging system across the LWIR range is approximately 20.5°.
Fig. 3.
Fig. 3. Schematic of the spectral reconstruction process. (a) The measured raw signal can be expressed by (b) the pre-calibrated spectral response function of the imaging system multiplied by (c) the spectrum of an imaging target. (c), (d) For the spectral reconstruction, the unknown spectrum ${P_\lambda}$ is projected onto a sparse representation basis ${D_{{k\lambda}}}$ using dictionary learning. (e) This sparse representation ${\phi _k}$ is then used for compressive sensing based reconstruction. The use of compressive sensing and dictionary learning in the reconstruction process significantly improves the reconstruction accuracy, making the spinning-metasurface-based spectro-polarimetric imaging more robust against noise and measurement errors.
Fig. 4.
Fig. 4. Spectro-polarimetric thermal imaging results. (a) An optical image of the “PURDUE” imaging target that is constructed from titanium letters on a glass substrate (${75}\;{\rm mm} \times {50}\;{\rm mm}$). Inset, a zoomed-in optical microscope image of the micro-structures in the letters, which generate distinctive spectral and polarimetric signatures. (b)–(e) Reconstructed spectra of four representative pixels (corresponding to the letter “R”, “U”, “E”, and the glass substrate, respectively) compared with the ground truth spectra measured by a Fourier-transform infrared spectrometer. (f) Reconstructed spectral frames at 6 representative wavelengths. The contrast between different frames demonstrates that the system can effectively reveal the LWIR spectral properties of various materials and structures. (g), (h) Degree-of-linear-polarization and angle-of-linear-polarization frames. Distinctive polarimetric signatures can be observed for each letter in the images. (i) Simulated spectral reconstruction results. The ground truth spectra (solid lines) are Gaussian peaks with 0.6 µm FWHM centered at different wavelengths (8.5 µm, 9.3 µm, 10.2 µm, 11 µm, 11.8 µm, 12.7 µm, and 13.5 µm). The reconstructed spectra (dotted lines) show good agreements with the ground truth.
Fig. 5.
Fig. 5. Material classification using the Spinning MetaCam. (a) Simulated spectral reconstruction results for materials commonly found in everyday life. (b) Calculated confusion matrix illustrating material classification without the use of spinning metasurfaces. The classification in this case is solely reliant on the integrated intensity of thermal radiation within the LWIR region. (c) Calculated confusion matrix presenting material classification with spinning metasurfaces incorporated. The inclusion of spectral information substantially augments the performance of material classification, improving the mean average precision from 0.293 to 0.834.

Equations (6)

Equations on this page are rendered with MathJax. Learn more.

J i ( θ i , λ ) = R ( θ i ) J M i ( λ ) R ( θ i ) = [ cos ( θ i ) sin ( θ i ) sin ( θ i ) cos ( θ i ) ] [ t ip ( λ ) 0 0 t is ( λ ) ] [ cos ( θ i ) sin ( θ i ) sin ( θ i ) cos ( θ i ) ] ,
J ( Θ , λ ) = J 1 ( θ 1 , λ ) J 2 ( θ 2 , λ ) J 3 ( θ 3 , λ ) .
I Θ = R Θ λ P λ .
P λ = D λ k ϕ k ,
ϕ r e c o n = arg min ϕ k ϕ k 1 s . t . I Θ R Θ λ P λ 2 = I Θ A Θ k ϕ k 2 < ϵ ,
P r e c o n = D λ k ϕ r e c o n .
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.