Flexible multicamera calibration method with a rotating calibration plate

Huajun Cai; Yang Song; Yuqing Shi; Zheng Cao; Zhenyan Guo; Zhenhua Li; Anzhi He

doi:10.1364/OE.402761

1. Introduction

Camera calibration plays an important role in both image measurements and machine vision tasks. Under most circumstances, camera parameters, including intrinsic, extrinsic, and distortion parameters, are obtained through calculations. Camera calibration calculates the relationship between 3D calibration points in space and 2D coordinates on the image plane. Therefore, it is essential to define a 3D world coordinate system and obtain the 3D coordinates of calibration points. The camera calibration is a prerequisite for any work that follows. It is obvious that calibration results directly influence the accuracy of image measurements.

To achieve accurate image measurement results, it is necessary to utilize a calibration object to define the 3D coordinates of the calibration points. First, the 3D world coordinates are usually defined on the calibration object. Cameras then capture images of the calibration object to obtain the 2D image coordinates corresponding to these 3D calibration points. Thus, the associations between the 3D coordinates of the calibration point in the world coordinate system and the 2D coordinates in the image coordinate system are determined. Finally, the intrinsic and extrinsic parameters of the camera are calculated by means of several calibration methods, for example, direct nonlinear minimization [1–3], close-form solution [4,5], and a two-step method [6–8].

Calibration objects are indispensable in the camera calibration process. There are approximately four types of camera calibration methods, depending on the nature of the calibration object. The first is the self-calibration method [9–11]. A camera can be calibrated directly from an image sequence, despite unknown motion and changes in some of the intrinsic parameters. When a series of images of a fixed scene is taken, the absolute conic is fixed, and once this is determined, the metric geometry can be computed [12]. Since this method does not require pre-determined calibration objects, it can quickly produce camera parameters. However, the results obtained are unreliable [13]. The second method is to employ a plane calibration plate, such as in Zhang’s method [13]. By randomly placing the plane calibration plate in different orientations, the intrinsic parameters of the camera are calculated. Although this method is easily implemented, the calibration results vary with different orientations of the calibration plate. To some extent, the calculated intrinsic and extrinsic camera parameters obtained with this method only converge to local optimal solutions that correspond to specific orientation sequences of the calibration plate. The third method is to adopt a linear translation stage to drive the plane calibration plate for translational movement, such that the world coordinate system is determined by the known movement of the plane calibration plate [14,15]. This method requires extreme precision in the installation orientation of the calibration plate and the positioning accuracy of the translation stage. The fourth method is based on a 3D calibration object [16,17], on which calibration points are printed. In this method, the 3D coordinates of the calibration points can be obtained directly, which is important for accurate camera calibration. However, this method requires high precision in the manufacturing stage of the calibration object [13]. On the other hand, if multicamera calibration is performed, the limitation of the field of view means that each camera may only be able to capture a limited number of calibration points for stereo calibration of the object, resulting in fewer calibration points that can be employed in the solution, and hence influencing the stability of the results.

The multicamera system has been widely employed in many fields, for example, in the measurement of fluids [14,15,18], 3D reconstruction [19,20], and multicamera tracking [21,22]. The multicamera system can achieve accurate measurements, but calibration is more complicated. The inward-looking [23] multicamera setup is commonly used in the experiment of background oriented schlieren (BOS) [18,24] or the flame chemiluminescence tomography (FCT) [16,25]. The cameras are usually distributed over a wide range in order to acquire sufficient data. And there exists a volume that in the field of view of all cameras. Under this circumstance, it is impossible for all cameras to see a calibration plate simultaneously, thus the unified world coordinate system is difficult to be determined. Therefore, Shen and Hornsey [23] present a calibration method based on a novel 3D target. While a 3-D laser scanning system has to be employed to determine the actual target configuration due to the manufacturing tolerances. Reference [16] utilizes a special 3D calibration object. The calibration points are printed on several known planes, such that the world coordinate system is directly determined by the calibration object. However, making an accurate calibration object is more difficult and expensive, and the calibration points are limited to some special plane. Feng et al. [26] proposed a method using a transparent glass calibration board. All cameras can capture the calibration points simultaneously from different positions and orientations. However, the refractive phenomenon from the transparent glass needs to be considered in the calibration process. Besides, this method may be invalid for cameras that are at a large angle to the calibration plate due to the refractive phenomenon. A 2D standard checkerboard is employed for the calibration of a 23-camera setup [24]. The coordinate system of each camera is transformed to the first pair of cameras one after another. The calibration process is complicated and the error may be amplified by excessive transformation coordinate system. Thus, it is necessary to develop a new multicamera calibration method.

In this paper, we propose a flexible method for multicamera calibration that is easily implemented. Our method does not require a special calibration object, but only a calibration plate. Furthermore, it does not require precision in the movement of the calibration plate. The only requirement is rotation around a fixed axis. Our method also does not require all cameras to be able to see the calibration plate simultaneously, as long as there are overlapping fields of view between adjacent groups. In Section 2, we describe how we calculated the world coordinates of rotating calibration points using multi-view stereo vision. Section 3 presents the world coordinate system and camera coordinate system determined by rotating calibration points. Section 4 shows the experiments and results.

2. World coordinates of rotating calibration points

2.1 Camera imaging model

Visual applications usually employ the pinhole model of a camera, where all the light is concentrated through the optical center of the camera. The homogeneous coordinate of a 3D point in the world coordinate system is denoted by ${M_w} = {({x_w},{y_w},{z_w},1)^T}$, and a point in the camera and pixel coordinate systems is denoted as ${M_c} = {({x_c},{y_c},{z_c},1)^T}$ and $m = {(u,v,1)^T}$, respectively, as shown in Fig. 1. Based on the pinhole model, the transformation from a 3D point to its 2D image point is given by:

(1)$$sm = {\mathbf P}{M_w},$$

(2)$${\mathbf P} = {\mathbf K}\left[ {\begin{array}{cc} {\mathbf R}&{\mathbf t}\\ {{0^T}}&1 \end{array}} \right] = \left[ {\begin{array}{cccc} {{p_{11}}}&{{p_{12}}}&{{p_{13}}}&{{p_{14}}}\\ {{p_{21}}}&{{p_{22}}}&{{p_{23}}}&{{p_{24}}}\\ {{p_{31}}}&{{p_{32}}}&{{p_{33}}}&{{p_{34}}} \end{array}} \right],$$

(3)$${\mathbf K} = \left[ {\begin{array}{cccc} {{f_u}}&0&{{u_0}}&0\\ 0&{{f_v}}&{{v_0}}&0\\ 0&0&1&0 \end{array}} \right],$$

where ${\mathbf R}$ is a 3×3 rotation matrix, and ${\mathbf t}$ is a 3×1 translation vector. These are the extrinsic parameters, and ${\mathbf K}$ is the intrinsic matrix. The scale factors along the image axes u and v are denoted by ${f_u}$ and ${f_v}$, and s is a nonzero scale factor. The principal point is denoted by $({u_0},{v_0})$, and ${\mathbf P}$ denotes the camera projection matrix.

Fig. 1. Camera imaging model.

Camera	$f_{u}$	$f_{v}$	$u_{0}$	$v_{0}$	$k_{1}$	$k_{2}$
1	3198.26	3200.13	637.93	466.81	-0.05	3.56
2	3201.84	3203.46	631.11	480.14	0.21	-39.57
3	3212.78	3213.82	665.77	449.14	0.07	-9.80
4	3209.43	3209.18	593.83	465.27	-0.17	20.67
5	3215.38	3214.03	652.95	475.53	-0.02	-1.60
6	3186.35	3184.86	627.80	458.52	0.09	-11.84
7	3213.40	3212.90	631.39	455.83	-0.09	18.31
8	3204.43	3201.16	654.25	491.51	0.07	-9.36
9	3232.82	3236.06	614.41	485.65	-0.06	12.17
10	3189.41	3191.02	643.69	475.62	0.23	-38.91
11	3184.91	3189.10	652.90	484.02	0.09	-15.10
12	3188.94	3191.51	628.97	483.45	-0.13	13.16

Camera	yaw/°	roll/°	pitch/°	tx/mm	ty/mm	tz/mm
1	73.31	-1.40	33.48	-410.24	625.83	-235.47
2	105.06	0.20	34.11	-416.34	636.10	196.36
3	74.46	-1.40	22.56	-296.36	739.23	-228.21
4	104.70	1.82	21.81	-303.71	747.83	194.24
5	74.17	-1.98	0.73	-2.13	840.90	-241.06
6	104.26	-0.17	0.31	-2.38	853.75	199.79
7	74.69	-1.55	-21.54	313.89	787.05	-233.22
8	103.91	-1.05	-22.64	312.51	800.27	198.40
9	72.70	3.04	-54.90	619.91	455.02	-254.29
10	105.86	2.30	-58.27	655.16	402.19	210.25
11	71.67	2.14	-66.19	639.83	292.81	-242.52
12	105.34	2.89	-70.88	676.91	244.15	202.78

Abstract

1. Introduction

2. World coordinates of rotating calibration points

2.1 Camera imaging model

2.2 Multi-camera system

2.3 World coordinates of rotating calibration points

3. World coordinate system and camera coordinate system

3.1 World coordinate system

3.2 Camera coordinate system

3.3 Unified coordinate system

3.4 Bundle adjustment

4. Experiments and results

4.1 Accuracy of 3D rotating points

4.2 Four-camera experiment

4.3 Eight-camera experiment

4.4 Twelve-camera experiment

5. Summary

Funding

Disclosures

References

Cited By

Figures (18)

Tables (2)

Equations (16)

Optics Express