Label enhanced and patch based deep learning for phase retrieval from single frame fringe pattern in fringe projection 3D measurement

Jiashuo Shi; Jiashuo Shi; Xinjun Zhu; Xinjun Zhu; Hongyi Wang; Limei Song; Qinghua Guo

doi:10.1364/OE.27.028929

1. Introduction

Fringe projection as a non-contact and whole field three dimensional (3D) shape measurement technology with high speed, high resolution, and low cost has been widely employed in diverse fields with biomedical applications, industrial and scientific applications, kinematics applications, and biometric identification applications [1–3]. The principle of this method is to measure the deformation of projected fringe pattern demodulated by the height of tested object. The height information is related to the phase in the deformed fringe pattern, and the phase is recovered by phase retrieval operator. Phase retrieval is a key and difficult problem in fringe projection measurement technique [1,4]. The phase retrieval methods are mainly divided into two categories: the methods from single frame fringe pattern and the phase shift methods. The phase shift methods usually require multiple fringe patterns at different moment [5,6]. In the measurement of objects in fast motion or in a temporally unstable environment, it is difficult or costly to take several projection fringe patterns in an extremely short period of time. Compared with the latter, the former only requires one fringe pattern in single shot, which makes it less interfered by the external environment and is more suitable for 3D measurement of dynamic objects [5–7].

However, phase retrieval from single frame fringe pattern is a challenging problem in fringe projection 3D measurement especially for objects with edges or abrupt changes in depth, which attracts wide attention. Numerous methods have been proposed, such as the well-known Fourier Transform method (FT), Windowed Fourier Transform method (WFT), the Wavelet Transform method (WT), Shearlet Transform method (ST), and the more effective methods such as Empirical Mode Decomposition (EMD) method and more recently proposed variational image decomposition(VID) and variational mode decomposition (VMD) methods [8–15]. Although extensive research efforts have been made for phase retrieval, it is hard to implement an accurate and fast retrieval phase method for the tradeoff between the accuracy and computational efficiency in traditional phase retrieval methods. For instance, FT method is simple but could not work well for object with edges. The more effective phase methods such as VID and VMD cause a great computation amount.

In recently, the discriminant learning such as deep learning has demonstrated to be successful in many areas ranging from computer vision such as image recognition, image denoising, and image super-resolution to optical imaging such as digital microscopy and digital holography [16–19]. Inspired by the success in those areas, Feng and Zuo et. al. recently introduced the deep learning method into fringe pattern analysis and they proposed the deep neural network (DNN) to conduct phase retrieval from single frame fringe pattern [20,21]. The process of phase retrieval is learned from the input data and the output labeled data in the training dataset by DNN. Their work demonstrates that the deep-learning-based technique can provide high accuracy phase retrieval results with rapid time. Owing to the ability of deep learning to learn the mapping between the input data and output labeled data, one can introduce the DNN to the problems such as phase unwrapping and 3D mapping in fringe projection 3D measurement [22–24].

As a data driven fashion, the performance of deep learning-based phase retrieval is subject to the training data both in quality and quantity. Abundant yet accurate labeled data in real fringe projection is important but difficult in acquisition. Hence, the use of fewer samples is desirable for the deep learning-based phase retrieval method provided the learning and prediction performance is unchanged. In addition, the fringe patterns which evidently contain noise will decrease the accuracy of phase retrieval results. By now, the two issues are not addressed in existing deep learning-based phase retrieval method for fringe projection 3D measurement. In this paper, we developed a new phase retrieval method based on the recently proposed DnCNN model to tackle the noise problems in phase retrieval and training samples problems [25]. In the proposed method, we use the fringe pattern and the enhanced fringe part as the input data and output labeled data in the train dataset of DNN to learn their mapping to implement the data driven phase retrieval. Since the labeled data is enhanced, our proposed is expected to deal with the noisy fringe pattern without pre-processing or post-processing. Moreover, the proposed method needs fewer samples as we can expand the samples by cropping the original samples into more overlapped small patches. The phase retrieval by the proposed method was performed, and the performance of the method was verified by experimental results. The contributions of our work are as follows:

(1) We proposed to use the denoised and enhanced fringe part as the labeled data in the training stage. In this way, the proposed DNN network can learn the denoised and enhanced fringe part from a noisy fringe pattern, therefore it can simultaneously achieve the fringe part extraction and enhancing, which does not require the filtering pre-processing or post-processing in phase retrieval. For the simulated data, output labeled data is known in advance. However, for the real fringe pattern, the labeled data is not exactly known. Therefore, we use the phase shift and Shearlet transform filtering method to produce the enhanced labeled data. The applicability and advantages of the label enhancement were demonstrated in fringe projection.
(2) We proposed the patch strategy to expand the training dataset by cropping the input fringe pattern and output labeled data into overlapped small patches. In this way, the samples were expected to be expanded to deal with the problems existed in traditional DNN that the training dataset is difficult to acquire. Meanwhile, the small patches will decrease the size of network computation, leading to sensible reductions in running time and memory requirements. It is noted that although the patch strategy has been employed in image super-resolution and denoising, etc., it is firstly introduced in the field of phase retrieval for fringe projection 3D measurement. The advantages of the patch strategy were demonstrated by real fringe patterns.

2. The proposed method

In fringe projection 3D measurement, the intensity distribution of a fringe pattern can be expressed as

(1)$$I({x,y} )\textrm{ = }a({x,y} )+ b({x,y} )\cos ({\phi ({x,y} )\textrm{ + }2\pi {f_0}x} )\textrm{ + }noise,$$

where $a({x,y} )$ is the background, $b({x,y} )$ and $\phi ({x,y} )$ are the modulation intensity and the optical phase, ${f_0}$ is carrier frequency, and $noise$ denotes the noise in $I({x,y} )$. Phase retrieval can be implemented by extracting the fringe part $b({x,y} )\cos ({\phi ({x,y} )\textrm{ + }2\pi {f_0}x} )$ apart from the background $a({x,y} )$ and $noise$ part [13]. However, due to the discontinuous edge of objects and noise effect, the fringe part and other parts are not well separated. The deep learning method has been proposed to separate the fringe part from the other parts by learning the mapping between the input fringe pattern and the output fringe part with DNN. As noted, previous works on phase retrieval using deep learning tool do not deal with separation fringe part from noise. The noise effect in the labeled data has never been paid attention. Also, in order to effectively train the DNN, scores of fringe pattern with labeled data should be prepared [20]. In this paper, we propose to extract the fringe part apart from background part as well as noise part with less samples in a new manner using deep learning as follows.

2.1 The design of DNN for phase retrieval

The proposed phase retrieval method is based on the extraction of the fringe part from fringe pattern by using a DNN to learn the process of fringe part extraction. There are two steps for the DNN to implement fringe part extraction: the training step and the testing step. In the training step the DNN was trained to learn the mapping between the input data (fringe pattern) and the output labeled data (fringe part), and in the testing step the trained DNN predicts the output fringe part given the input fringe pattern. Figure 1 shows the diagram of the fringe part extraction, where DnCNN model is used in our study for that DnCNN utilizes residual learning and batch normalization which can benefit from each other, and their integration is effective in speeding up the training and boosting the denoising performance [25]. As shown in Fig. 1, in order to make the network acquire the features of the image more efficiently, the input fringe pattern and output labeled fringe part with pixels 512×512 are divided into overlapped patches of 40×40 pixels size by using a fixed-size window. The input patches and output label patches are used to train the DNN to learn the mapping between the fringe pattern and the output fringe part. Once the DNN is trained, it is used to predict fringe part from tested fringe pattern.

Fig. 1. The diagram of the fringe part extraction.

Abstract

1. Introduction

2. The proposed method

2.1 The design of DNN for phase retrieval

2.2 The label enhancement and patch strategy

3. Results and discussion

3.1 Patched based strategy validation

3.2 Label enhancement validation

3.3 Comparisons with other methods and application in dynamic object measurement

4. Conclusion

Funding

References

Cited By

Figures (15)

Equations (4)

Optics Express