3D far-field Lidar sensing and computational modeling for human identification

A. Glandon; L. Vidyaratne; N. K. Dhar; B. O. Familoni; N. Sadeghzadehyazdi; S. T. Acton; K. M. Iftekharuddin

doi:10.1364/AO.508033

Applied Optics
Vol. 63,
Issue 8,
pp. C15-C23
(2024)
•https://doi.org/10.1364/AO.508033

3D far-field Lidar sensing and computational modeling for human identification

A. Glandon, L. Vidyaratne, N. K. Dhar, B. O. Familoni, N. Sadeghzadehyazdi, S. T. Acton, and K. M. Iftekharuddin

Open Access

Get PDF
Email
Share
Get Citation
Copy Citation Text
A. Glandon, L. Vidyaratne, N. K. Dhar, B. O. Familoni, N. Sadeghzadehyazdi, S. T. Acton, and K. M. Iftekharuddin, "3D far-field Lidar sensing and computational modeling for human identification," Appl. Opt. 63, C15-C23 (2024)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Check for updates

Related Topics
Table of Contents Category
- Optical Devices, Sensors, and Detectors
Optics & Photonics Topics
?

The topics in this list come from the Optics and Photonics Topics applied to this article.

About this Article
History
- Original Manuscript: October 13, 2023
- Revised Manuscript: December 3, 2023
- Manuscript Accepted: December 4, 2023
- Published: December 21, 2023
Virtual Issues
Optics Express Joint Feature Issue in Optics Express and Applied Optics: Computational Optical Sensing and Imaging 2023 (2023)

Abstract

3D sensors offer depth sensing that may be used for task-specific data processing and computational modeling. Many existing methods for human identification using 3D depth sensors primarily focus on Kinect data, where the range is very limited. This work considers a 3D long-range Lidar sensor for far-field imaging of human subjects in 3D Lidar full motion video (FMV) of “walking” action. 3D Lidar FMV data for human subjects are used to develop computational modeling for automated human silhouette and skeleton extraction followed by subject identification. We propose a matrix completion algorithm to handle missing data in 3D FMV due to self-occlusion and occlusion from other subjects for 3D skeleton extraction. We further study the effect of noise in the 3D low resolution far-field Lidar data in human silhouette extraction performance of the model. Moreover, this work addresses challenges associated with far-field 3D Lidar including learning with a limited amount of data and low resolution. Moreover, we evaluate the proposed computational algorithm using a gallery of 10 subjects for human identification and show that our method is competitive with the state-of-the-art OpenPose and V2VPose skeleton extraction models using the same dataset for human identification.

Full Article | PDF Article

Corrections

2 January 2024: A correction was made to the author listing.

More Like This

LiDAR-camera-system-based unsupervised and weakly supervised 3D object detection

Haosen Wang, Tiankai Chen, Xiaohang Ji, Feng Qian, Yue Ma, and Shifeng Wang
J. Opt. Soc. Am. A 40(10) 1849-1860 (2023)

3D object detection through fog and occlusion: passive integral imaging vs active (LiDAR) sensing

Kashif Usmani, Timothy O’Connor, Pranav Wani, and Bahram Javidi
Opt. Express 31(1) 479-491 (2023)

Enhancing 3D human pose estimation with NIR single-pixel imaging and time-of-flight technology: a deep learning approach

Carlos Osorio Quero, Daniel Durini, Jose Rangel-Magdaleno, Jose Martinez-Carranza, and Ruben Ramos-Garcia
J. Opt. Soc. Am. A 41(3) 414-423 (2024)

Previous Article Next Article

Data availability

Data underlying the results presented in this paper are not publicly available as the data are considered sensitive by the US Army NVESD.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.

Fig. 1. Proposed flow diagram of computational model for silhouette and skeleton extraction.

Download Full Size | PDF

Fig. 2. Single person silhouette extraction (a) Raw data. (b) Naïve extraction. (c) Noise removed. (d) All limbs.

Download Full Size | PDF

Fig. 3. Multiple person silhouette extraction (a) Raw data. (b) Two persons silhouette extraction.

Download Full Size | PDF

Fig. 4. Silhouette size to demonstrate model operating limits.

Download Full Size | PDF

Fig. 5. Examples of challenging sensor operation regions. (a) Subject too close to sensor. (b) Subject too far from sensor. (c) Subject middle range with noise.

Download Full Size | PDF

Fig. 6. Example Lidar input video frame with 3D skeleton output. (a) Single subject skeleton output. (b) Multiple subject skeleton output.

Download Full Size | PDF

Fig. 7. Comparison of occlusion completion methods.

Download Full Size | PDF

Fig. 8. ROC curves for single and multiple subject identification.

Download Full Size | PDF

Tables (2)

Table 1. Best Accuracy for Single Subject per Frame

View Table | View all tables in this article

Table 2. Performance Comparison with Existing Methods

View Table | View all tables in this article

Equations (12)

Equations on this page are rendered with MathJax. Learn more.

\begin{aligned} {prop}_{1} (x, y, t) & = n e a r f i e l d < r a n g e (x, y, t) \\ < max_{t} r a n g e (x, y, :) - ϵ_{b a c k g r o u n d} . \end{aligned}

\begin{aligned} {prop}_{2} (x, y, t) & = {m e d i a n}_{p r o p 1} - α_{c l o s e} < r a n g e (x, y, t) \\ < {m e d i a n}_{p r o p 1} + α_{f a r} \end{aligned}

\begin{aligned} {prop}_{3} (x, y, t) = ({m e d i a n}_{p r o p 1} - α_{c l o s e} < r a n g e (x, y, t) \\ < {m e d i a n}_{p r o p 1} + α_{r e l a x e d}); \\ A N D \\ i n t e n s i t y (x, y, t) > {i n t e n s i t y}_{n o i s e} . \end{aligned}

{prop}_{4} (x, y, t) = {p r o p}_{1} A N D ({p r o p}_{2} {O R p r o p}_{3}) .

\begin{aligned} {prop}_{5} (x, y, t, k) \\ = {p r o p}_{1} (x, y, t) A N D r a n g e (x, y, t) \in {s e t}_{k} . \end{aligned}

{prop}_{6} (x, y, t, k) = {p r o p}_{5} (x, y, t, k) A N D x > x_{o f f s e t};

{prop}_{7} (x, y, t, k) = {u n i o n}_{f i n d ({p r o p}_{6} (x, y, t, k) \forall t, k)};

\begin{aligned} {prop}_{8} (x, y, t, k) & = {p r o p}_{7} (x, y, t, k) O R \\ \times (x > x_{o f f s e t} A N D {p r o p}_{5} (x, y, t, k)) . \end{aligned}

\bar{y} (t) = \frac{\sum y (t) | s i l h o u e t t e (x, y, t)}{n},

\bar{z} (t) = \frac{\sum z (x, y, t) | s i l h o u e t t e (x, y, t)}{n} .

\begin{aligned} P = \\ [\begin{array}{llllllllll} X_{11} & Y_{11} & Z_{11} & X_{12} & Y_{12} & Z_{12} & X_{1 N} & Y_{1 N} & Z_{1 N} \\ X_{21} & Y_{21} & Z_{21} & X_{22} & Y_{22} & Z_{22} & X_{2 N} & Y_{2 N} & Z_{2 N} \\ ⋮ & ⋱ \\ X_{M 1} & Y_{M 2} & Z_{M 3} & X_{M 2} & Y_{M 2} & Z_{M 2} & X_{M N} & Y_{M N} & Z_{M N} \end{array}] \end{aligned}

X_{h e i g h t} = X_{p i x e l s} \cdot Z .

	Best Accuracy Dataset A (single subject per frame)	Feature Set with Best Accuracy Dataset A (single subject per frame)	Best Accuracy Dataset B (two subjects per frame)	Feature Set with Best Accuracy Dataset B (two subjects per frame)
Computational model	92.04%	Joint locations $+$ silhouette (nearest neighbor)	93.02%	Joint locations (nearest neighbor)
OpenPose model	91.60%	Joint locations $+$ silhouette (matrix Completion)	90.12%	Joint locations $+$ silhouette (matrix Completion)

Algorithm	Data (range / modality) Image resolution (in pixel) Number of subjects (Dataset A)	Top-1 Recognition Performance
V2V Pose network [24]	3D Lidar gray/D (Far-field/Lidar) $128 \times 128$ 10 subjects	81.6%
OpenPose model		91.6%
Proposed computational modeling		92.0%

	Best Accuracy Dataset A (single subject per frame)	Feature Set with Best Accuracy Dataset A (single subject per frame)	Best Accuracy Dataset B (two subjects per frame)	Feature Set with Best Accuracy Dataset B (two subjects per frame)
Computational model	92.04%	Joint locations $+$ silhouette (nearest neighbor)	93.02%	Joint locations (nearest neighbor)
OpenPose model	91.60%	Joint locations $+$ silhouette (matrix Completion)	90.12%	Joint locations $+$ silhouette (matrix Completion)

Algorithm	Data (range / modality) Image resolution (in pixel) Number of subjects (Dataset A)	Top-1 Recognition Performance
V2V Pose network [24]	3D Lidar gray/D (Far-field/Lidar) $128 \times 128$ 10 subjects	81.6%
OpenPose model		91.6%
Proposed computational modeling		92.0%

Abstract

Corrections

Data availability

Cited By

Figures (8)

Tables (2)

Equations (12)

Applied Optics