Geometric calibration for LiDAR-camera system fusing 3D-2D and 3D-3D point correspondences

Pei An; Pei An; Tao Ma; Tao Ma; Kun Yu; Kun Yu; Bin Fang; Bin Fang; Jun Zhang; Jun Zhang; Wenxing Fu; Jie Ma; Jie Ma

doi:10.1364/OE.381176

1. Introduction

Multi-sensor systems equipped with cameras are vital modules in many visual and interactive applications, such as geometric multi-camera imaging system [1,2], the light field imaging system [3,4], binocular stereo vision measurement system [5], spacecraft optical system [6] and 3D Light Detection And Ranging (LiDAR) and camera (LiDAR-camera) system [7]. Among these imaging systems, the LiDAR-camera system is widely used in the field of robotic vision, such as detecting 3D objects [8] and solving navigation tasks [9]. In these applications, based on the distances measured by light beams, a LiDAR sensor can generate the sparse point cloud of the surroundings. The main advantage of LiDAR is the active illumination which can work independently of ambient light. However, the disadvantages of LiDAR are its expensive cost, limited low resolution, such as Velodyne-64 LiDAR that only measures 64 channels with a low refresh rate. Besides, a LiDAR sensor cannot measure RGB information. RGB camera is relatively cheap, and it can produce high resolution, color images with a high frame rate. But it cannot measure depth information directly. In a word, a LiDAR sensor produces sparse 3D information while the camera captures 2D dense information. Fortunately, by fusing measurements of LiDAR and camera, most of the shortcomings of LiDAR sensors can be compensated by RGB cameras and vice versa. After that, the LiDAR-camera system can precept and analyze the target objects with a more advanced and intelligent view. However, sensor fusion requires the extrinsic parameters of the LiDAR and the camera. So, it is essential to calibrate the extrinsic parameters of these sensors in advance.

The key of LiDAR-camera system calibration is to find geometric relationships from co-observable features [10]. Although feature points in the 2D image can be easily detected, the corresponding 3D points from the sparse LiDAR point cloud are hard to identify. So, the core problem of LiDAR-camera calibration is to exploit point correspondences. To solve this problem, a general approach is to design calibration objects, the corner points of which can be used to establish point correspondences. Calibration objects can be divided as 1D objects, such as line objects with aligned points [11,12], 2D objects, such as polygonal planar boards [10], ordinary planar boards [13], planar boards with chessboard patterns [14], planar boards with rectangle holes [15], planar boards with circle holes [16], and 3D objects, such as ordinary boxes [17]. Besides, there exist methods using no calibration objects. According to literature [18], the information of visual odometry and LiDAR odometry are fused to estimate the relative pose of both sensors. LiDAR odometry is obtained via iterative closest point (ICP) algorithm [19]. Other methods find 3D-2D correspondences using mutual information [20]. Although lots of works have been proposed above, they might face challenges to calibrate the low-resolution LiDAR-camera system, because the point cloud is sparse enough, making it difficult to establish point correspondences.

Motivated by this, we propose a novel calibration object combination and geometric calibration method for estimating the extrinsic parameters of the LiDAR-camera system. Firstly, we design a new combination of calibration objects, which contains rectangular planar boards with chessboard patterns and auxiliary calibration objects. An improved planar chessboard can provide 3D-2D and 3D-3D point correspondences. In this paper, auxiliary calibration objects are defined as some temporarily found 2D planar board in the field of view (FOV) of the camera, which can provide extra 3D-2D point correspondences. After that, a novel geometric optimization framework is proposed, which considers all 3D-2D and 3D-3D point correspondences. Extrinsic parameters can be solved via Bundle Adjustment (BA) [21]. By fusing the information of 3D-2D and 3D-3D point correspondences, the proposed method can estimate the positions of corner points with accuracy, thus leading calibration results robust to LiDAR sensor noise. Besides, we also present an automatic approach to extract LiDAR point clouds of planar calibration objects. Experimental results and 3D reconstruction demonstrate that the proposed planar calibration object and geometric calibration method is helpful for LiDAR-camera system. We believe that it will contribute to future advanced visual applications.

The remainder of this paper is organized as follows. We briefly survey the field of LiDAR-camera system calibration in Sec. 2. After that, the proposed LiDAR-camera system calibration based on novel polygonal planar board with chessboard patterns is presented by details in Sec. 3. In Sec. 4, experiment settings, calibration results, reconstruction are presented and discussed. Finally, we conclude our work in Sec. 5.

2. Related works

The purpose of LiDAR-camera calibration is for two kinds of parameters, including intrinsic and extrinsic parameters of this system. The intrinsic parameters of most LiDAR sensors are calibrated in advance by manufactures. One can estimate the intrinsic parameters of LiDAR via methods [13]. The intrinsic parameters of cameras can be calibrated with the traditional method [22]. In this paper, we assume that the intrinsic parameters of the LiDAR and the camera are both known in advance. As for extrinsic parameters of LiDAR-camera systems, many insightful approaches have been proposed, such as methods based on calibration objects [10–17], the method based on odometry fusion [18], the method based on mutual information [20], and the method based on deep learning [23]. As for methods based on 2D and 3D calibration objects, the corner points of the calibration objects are used to establish point correspondences, and then the relative pose of LiDAR-camera can be computed by Effective Perspective-n-Points (EPnP) algorithm [24]. However, the accuracy of EPnP algorithm is tightly dependent on the precise of 3D points. Among the calibration objects, 1D calibration object is the most special due to its simplicity. Previous works [11,12] have been done to calibrate the intrinsic and extrinsic parameters of the camera using 1D calibration objects. These methods can be extended for LiDAR-camera calibration. But detecting the endpoints of 1D calibration objects in the sparse point cloud with accuracy is a challenging task. Another methods [15,25] attempt to establish 3D-3D point correspondences, and then apply ICP algorithms [19,26] to solve the problem. But these method needs to be calibrated binocular cameras to obtain the positions of 3D points, which cannot be used for monocular cameras. The method based on odometry fusion needs to optimize the odometry information of the camera and LiDAR [18], respectively. Odometry of LiDAR is obtained via ICP algorithm [19]. However, for low-resolution LiDAR, ICP algorithm might obtain bad even not a convergent estimation, because point clouds are sparse enough to find sufficient matched points. The method uses mutual information to match the camera image and images generated by LiDAR [20]. However, sparse point clouds generate sparse images, which decreases the accuracy of matching results. Graphics Processing Unit (GPU) is also required to accelerate the process of calibration [20]. The methods based on deep learning can calibrate the LiDAR-camera system without using calibration objects, odometry information or mutual information, but the accuracy of this kind of method is dependent on the size of the training data set and the framework of deep learning networks. Through the above discussion, it is still a challenging to calibrate the LiDAR-camera system with high precision. Therefore, we propose a novel combination of calibration objects and geometric calibration method robust to LiDAR sensor noises. Our work are proved to be helpful in calibrating these multi-sensor systems.

3. Method

3.1 Problem statement

The problem of calibration on the LiDAR-camera system is presented in Fig. 1. A model of calibration objects and the LiDAR-camera system is shown in Fig. 1(a). In this system, the LiDAR and the camera are both mounted on a base, and the relative pose of them are fixed. We propose a novel combination of calibration objects, which contains planar chessboards and auxiliary calibration objects. Planar chessboards are shown as the pink boards in Fig. 1(a). $P_{i}(i=1,\ldots ,8)$ denotes the corner point on planar chessboards. The coordinate system of i-th planar chessboard is $O_{pi}-X_{pi}Y_{pi}Z_{pi}$. $O_{pi}$ denotes the up and left corner point of i-th planar chessboard. It is discussed in detail in Sec.3.2.1. Auxiliary calibration object is an arbitrary planar board, shown as the orange board in Fig. 1(a). $A_{i}(i=1,2)$ means the corner point on the auxiliary calibration object. For stable calibration results, it is recommended to use at least two planar chessboards and one auxiliary calibration objects, and the reason is shown in Sec.4.2.5. Auxiliary calibration object, such as the wooden table in Fig. 1(b), is temporarily found and might be used for calibration by providing extra constraints. These calibration objects are discussed in Sec.3.2.1. LiDAR coordinate system is represented as $O_{d}-X_{d}Y_{d}Z_{d}$. The origin of $O_{d}-X_{d}Y_{d}Z_{d}$ means the position of LiDAR. The camera coordinate system is denoted as $O_{c}-X_{c}Y_{c}Z_{c}$. The origin of $O_{c}-X_{c}Y_{c}Z_{c}$ is the optical center of the camera. The extrinsic parameters of the LiDAR-camera system are the rotation matrix $\mathbf {R}$ and the transformation vector $T$, which are shown in Fig. 1(a). $\mathbf {R}$ and $T$ mean the rigid transformation between LiDAR coordinate system $O_{d}-X_{d}Y_{d}Z_{d}$ and camera coordinate system $O_{c}-X_{c}Y_{c}Z_{c}$. The proposed calibration method aims to estimate $\mathbf {R}$ and $T$.

Fig. 1. Representation of calibration objects and LiDAR-camera system. (a) Geometric Representation of planar chessboards, auxiliary calibration objects, and LiDAR-camera system. $\mathbf {R}$ and $T$ are extrinsic parameters for calibration. (b) Two planar chessboards and one auxiliary calibration object. It is recommended to use at least two planar chessboard and one auxiliary calibration object. Point $P_{i}(i=1,\ldots ,8)$ is the corner point of planar chessboards. Point $A_{i}(i=1,2)$ is the corner point of auxiliary calibration object.

Parameter	Value
$f_{u} / (p i x e l)$	1250.40
$f_{u} / (p i x e l)$	1080.60
$u_{0} / (p i x e l)$	550.20
$v_{0} / (p i x e l)$	540.30
$T / (m e t e r)$	$(0.5, 0.3, 0.2)^{T}$

Camera Parameter	Value
$f_{u} / (p i x e l)$	1094.04
$f_{u} / (p i x e l)$	1087.38
$u_{0} / (p i x e l)$	942.01
$v_{0} / (p i x e l)$	530.35

Point Index	3D2D	3D2DFit	3D2DFilter	3D3D	MergeSimple	MergeOpt
$P_{1}$	12.201	5.038	4.122	0.798	1.026	0.758
$P_{2}$	37.996	6.415	4.358	3.242	2.459	2.281
$P_{3}$	10.783	8.465	2.699	2.089	1.882	1.831
$P_{4}$	9.743	19.670	none	1.683	2.035	1.516
$P_{5}$	5.130	2.773	3.267	1.512	1.321	1.298
$P_{6}$	9.085	5.972	1.506	1.335	0.695	0.585
$P_{7}$	16.481	15.050	none	3.078	1.801	1.573
$P_{8}$	91.592	26.813	none	1.855	2.043	1.627
$P_{9}$	3.647	3.251	2.881	1.206	0.905	0.831
$P_{10}$	3.191	1.337	3.081	2.017	1.700	1.620
$M e a n E r r o r$	19.985	9.478	3.131	1.881	1.586	1.392
$S t . D e v$	27.085	8.350	0.948	0.778	0.573	0.528
$G a i n$	0.00 $%$	52.57 $%$	84.33 $%$	90.58 $%$	92.06 $%$	93.03 $%$

Chessboard 1	Chessboard 2	Auxiliary calibration object	Error
$✓$	$✓$	$✓$	1.392
$✓$	$✓$	$\times$	1.546
$✓$	$\times$	$✓$	1.832
$\times$	$✓$	$✓$	1.907
$✓$	$\times$	$\times$	2.312
$\times$	$✓$	$\times$	2.736

Abstract

1. Introduction

2. Related works

3. Method

3.1 Problem statement

3.2 Methodology

3.2.1 Combination of calibration objects for LiDAR-camera system

3.2.2 Corner points extraction

3.2.3 Geometric LiDAR-camera calibration

4. Experiments

4.1 Simulations

4.1.1 Experiment settings

4.1.2 Performance with respect to noise level

4.1.3 Performance with respect to corner points

4.2 Real data experiments

4.2.1 Experiment settings

4.2.2 RANSAC planar fit versus reprojection error performance

4.2.3 Point filter strategy versus reprojection error performance

4.2.4 Optimization framework versus reprojection error performance

4.2.5 Numbers of calibration objects versus reprojection error performance

4.2.6 Model comparisons versus reprojection error performance

4.3 Verification on depth map

4.4 Limitations

5. Conclusion

Appendices

A. Establish 3D-3D point correspondences for $N$ planar chessboards

B. Closed-form solution in Eq. (5)

Funding

Acknowledgments

Disclosures

References

Cited By

Figures (16)

Tables (4)

Equations (12)

Optics Express