Automotive augmented reality 3D head-up display based on light-field rendering with eye-tracking

Jin-ho Lee; Igor Yanusik; Yoonsun Choi; Byongmin Kang; Chansol Hwang; Juyong Park; Dongkyung Nam; Sunghoon Hong

doi:10.1364/OE.404318

1. Introduction

Recent years have witnessed rapid developments in augmented reality (AR) technologies in terms of both software and hardware, particularly in the case of head-mounted displays and mobile devices. In the automotive sector, AR technologies can improve the driver comfort and safety and also act as an infotainment platform when applied in the form of next-generation head-up displays (HUDs).

Cars today are typically equipped with HUDs with a limited horizontal field of view (FoV) of up to 5°. In these systems, the virtual image is implemented as a conventional 2D projection on the car windshield with a virtual image distance of up to 3 m. Even when the size of the virtual image is enlarged, the typically represented information for such HUDs involves the replication of instrument clusters with limited navigation extension. A recent key improvement in the automotive sector is the development of HUDs with AR support, which affords the ability to merge driving-related information with the actual (“real”) scene. Such information can be obtained from various sensors to aid the driver in avoiding collisions or receive additional information for driver assistance.

In this regard, many studies have focused on implementing AR HUDs from the perspectives of different design criteria. Thus, the FoV and image brightness can be improved by using microelectromechanical-systems-based laser scanning [1], digital light processing [2], or 2D computer-generated holography (CGH) technologies [3]. AR HUD systems can also be made more compact by applying waveguide [4], holographic optical element [5], or metasurface [6] technologies instead of mirror-based projection. However, the proposed solutions do not address improvements of AR content matching with real scene to avoid visual conflicts. One effective approach to address this issue involves implementing multi-depth images while merging a set of virtual images with the real scene. Several such methods have been implemented to address the problem [7–9]; however, the number of image planes is currently limited and therefore insufficient for natural overlap. Additionally, flickering can also occur when mechanical steering is adopted. These disadvantages may be overcome by using varifocal optical elements as described in Ref. [10–12]; however, it is difficult to practically apply these components considering factors such as the required size/dimensions, compatibility with curved windshields, and large eyebox size. In future, CGH-based fully dynamic holographic 3D displays [13] could be an ultimate solution to address all the cues of natural multi-depth image perception. However, CGH application is complex both in terms of hardware and rendering and further requires additional computational time; thus, this technology is currently impractical for high-speed automotive solution.

One approach that may bridge the gap between conventional systems and holographic 3D displays is the integration of an autostereoscopic 3D display [14] into the HUD. In essence, stereoscopy works based on binocular disparity: the illusion of depth can be created from two 2D images whose features are slightly offset from each other, and the brain merges these two images into a single 3D perspective in an acceptable depth range [15]. Autostereoscopic 3D displays allow for the formation of several viewing positions at which the viewer can observe valid stereo-pair images. As a drawback, resolution of a perceived 3D image decreases depending on the number of viewing points. However, a method to improve pixel-resources utilization for a large number of viewing points was proposed by implementing a light-field model [16]. When combined with an eye-tracking, direct light-field rendering for valid stereo-pairs is possible without a significant loss of the 3D image resolution [17]. Although finally exploiting binocular disparity, this type of display can be considered as a light-field display because stereo-pair images are assigned to light rays uniformly distributed according to the light-field model. In this regard, Table 1 compares the currently available technologies that can be used as potential car solutions.

Table 1. Head-up display (HUD) technology comparison.

View Table | View all tables in this article

Against this backdrop, we propose the adoption of stereoscopy-based 3D images with continuous depth by integrating a light-field display into the HUD optics. With our approach, such virtual images may be observed with low crosstalk when using matching HUD optics with viewing zones formed by the light-field display [18]. First, we derive certain equations to replace the complex optics by an effective lens model, and subsequently, we translate the driver’s actual eye position to the corresponding virtual one. Based on the results of this substitution, we proceed with our lenticular lens design and the corresponding light-field rendering. Next, we use simulations with ray-tracing software and conduct experiments with a developed prototype to demonstrate how closely the simulations and experimental results match in terms of the crosstalk level and low-crosstalk margin based on our initial assumptions.

2. Configuration and operating principle

Figure 1 shows the schematic of our AR 3D HUD with an eye-tracking camera. The camera acquires information on the eye position and tracks it continuously. In front of the driver, the HUD generates a virtual image through the windshield by means of mirror projection optics according to the perceived eye position. The 3D depth is adjusted within a suitable range based on environmental information from external sensors. Therefore, virtual objects are seamlessly integrated into the real world and made visible at various selected depths.

Fig. 1. Schematics of (a) light-field virtual image projection and (b) depth creation of augmented reality (AR) 3D head-up display (HUD) images with eye-tracking camera.

Name	Description
Visualization 1	This is the demonstration video of AR 3D HUD.
Visualization 2	This is the demonstration video of AR 3D HUD.

Type/Parameter	Conventional	Multi-plane	Holographic	Light-field
AR depth matching	No (Single depth)	Discrete depth	Real depth	Continious depth
Algorithm complexity	Low	Medium	High	Medium
Hardware complexity	Low	High	Medium	Low

Parameter	Design value
FoV	10° (H) × 5° (V)
Virtual image distance	7000 mm
Look-down angle	-2.5°
Look-over angle	0.7°
Eyebox size	140 mm × 80 mm
Optical resolution, mean (range)	0.6 arcmin (0.25–0.9 arcmin)
Optical distortion, mean (range)	0.6% (0.3–1.0%)
Horizontal BM as convergence or divergence, mean (range)	0.4 mrad (0.1–0.6 mrad)
Vertical BM as dipvergence, mean (range)	0.2 mrad (0.1–0.6 mrad)
Brightness drop over eyebox, mean (range)	3.7% (1.9–5.5%)

Parameter	Value
Magnification of virtual eyebox (M_VEB)	-1.28
Horizontal off-axis shift of real eyebox center (x_EB)	-7.396 mm
Vertical off-axis shift of real eyebox center (y_EB)	14.501 mm
Horizontal off-axis shift of display center (x_D)	5.683 mm
Vertical off-axis shift of display center (y_D)	-21.178 mm

Abstract

1. Introduction

2. Configuration and operating principle

3. System design and methods

3.1 Projection optics

3.2 Lenticular lens

3.3 Light-field rendering

3.4 Ray-tracing simulation

4. Experiment

5. Conclusion

Acknowledgments

Disclosures

References

Supplementary Material (2)

Cited By

Figures (18)

Tables (3)

Equations (18)

Optics Express