Near eye light field display based on human visual features

Mali Liu; Chihao Lu; Haifeng Li; Xu Liu

doi:10.1364/OE.25.009886

1. Introduction

Virtual reality (VR) display blocks the normal sight to the real world and provides a virtual realistic scenario rendered by the computer to viewers. It has promising applications in commercial, constructional, medical and educational domains, such as video game, architecture visualization, and model visualization.

Since the first head-mounted-display (HMD) device had been developed in 1968, numerous VR schemes have been developed. These display technologies can be roughly classified into two types which are based on binocular parallax and light field. For binocular parallax based display technology, the representative devices are Google Glass and Oculus Rift, which provide a three dimensional scene by displaying different views to the left and right eyes. Nevertheless, since only two views are presented, the reconstructed virtual scene has vergence-accommodation conflict [1]. In recent years, the near eye light field display technologies have been developed to improve the amount of presented information to eliminate the vergence-accommodation conflict and enhance the immersion. Nvidia has introduced a multi-view based near eye light field display precept by employing a high-resolution OLED layer and a microlens array to reconstruct a light field [2]. Although it achieves an approximate monocular focusing effect, a tradeoff between the spatial resolution and the angular resolution is imposed owing to the limited light field information provided by a single layer of OLED, which leads to a low resolution performance. Based on fiber scanning, Schowengerdt. et al. has proposed a near-eye light field display design by using a fast scanning fiber to display multi-view scene [3]. The amount of information is limited by the characteristics of the fiber, which results in a tradeoff between refresh rate and resolution. In addition, a near eye light field display strategy utilizing multi-layer LCDs as light modulations to reconstruct the light field has been exploited [4]. This strategy factorizes the original light field data into some patterns displayed by these LCDs, which makes it possible for providing a large amount of information by a simple device. It has a very high information utilization and presents a high resolution light field supporting accommodation. However, the light field factorization of this method is computational complexity and time-consuming. Moreover, a significant latency causes degraded performance and adverse users’ sense of immersion [5, 6].

In order to realize near-eye 3D display with a good sense of immersion, it is necessary to quickly reconstruct a 3D scene with a large amount of information and accommodation effect by a simple hardware device. Inspired by the multi-layer light field display technology, we propose a near eye light field display design with super-fast synthesis speed based on human visual features.

Human visual acuity over visual field is inconsistent. The highest acuity is confined to the fovea which is responsible for sharp vision, and the large peripheral area delivers low resolution information of the surroundings [7]. In this paper, we propose to reconstruct a multi-resolution light field whose resolution distribution is consistent with human visual acuity distribution. The proposed system provides an immersive high resolution light field supporting retinal blur in high reconstruction speed. A human vision based algorithm is used to synthesize the light field. Experiment results show that the acceleration of the proposed scheme is evident and escalates when the spatial resolution increases. These key aspects of our design are verified by simulation and a prototype device.

2. The display principle

In this section, the principle of the proposed near eye light field display is discussed. In the first part, the original light field rendering method is introduced. In the second part, we explains the human vision based reconstruction algorithm, which shows the light field data can be compressed by setting its resolution distribution identical to human visual acuity. The reconstruction speed can be accelerated by synthesizing a multi-resolution light field.

2.1The light field rendering

The general steps of multi-layer display technology are rendering or capturing the original light field data, then factorizing the light field into 2D patterns, and lastly displaying these patterns by multi-layer LCDs display equipment.

The Fig. 1 explains the arrangement of viewpoints. The origin of coordinate system is located at the center of the eyeball. The scene is located in front of the observer along Z axis. A reference plane is arranged for original light field rendering. The position of light ray is described by its intersection on the reference plane and the viewpoint. As Takaki et al. mentioned [8], there should be at least two light rays falling into the eye to achieve accommodation. On account of that, several viewpoints over the eye are arranged as shown in Fig. 1. For the purpose of simulating the distribution of the viewpoints in different viewing directions, the viewpoint is approximated as the point distributed on the eyeball, as shown in Fig. 1(b). The viewpoints are distributed at equal angle gap between each other in X and Y direction, respectively. The position of each viewpoint ( $x_{e}, y_{e}, z_{e}$ ) is described by the Eqs. (1)-(3).

x_{e} = \sin (n_{x} θ_{x} + Φ_{x}) c o s (n_{y} θ_{y} + Φ_{y}) R

y_{e} = \sin (n_{y} θ_{y} + Φ_{y}) R

z_{e} = \cos (n_{x} θ_{x} + Φ_{x}) c o s (n_{y} θ_{y} + Φ_{y}) R

Here, R represents the radius of the eyeball,

n_{x}

and

n_{y}

are the index of viewpoints, (

θ_{x}, θ_{y}

) and (

Φ_{x}, Φ_{y}

) are the angle intervals between each viewpoint and the direction of visual axis, respectively, and the subscripts x and y denote the directions along X and Y axis. The target light field data

\tilde{L} [m_{r}, m_{c}, n_{x}, n_{y}]

is composed of these perceived images obtained on these viewpoints, where (

m_{r}, m_{c}

) is the pixel index of each perceived image.

Fig. 1 The arrangement of viewpoints for original data rendering. (a) Illustration of the original data rendering in z-x plane. (b) Three-dimensional schematic of the arrangement of viewpoints, the eyeball is sketched as a sphere, and the schematic viewpoint located on the circular section of the eyeball.

Light field size	1472x1104x3x9	1472x3312x3x3	2208x2484x2x4	2208x4968x2x2
Runtime(proposed algorithm)/ms	24.67	17.73	16.66	11.40
Runtime(conventional algorithm) /ms	187.18	199.89	192.52	221.03
Time consumption ratio	7.59	11.27	11.55	19.39

Abstract

1. Introduction

2. The display principle

2.1The light field rendering

2.2 The factorization of light field

2.3 Human visual features

2.4 The reconstruction algorithm based on human visual features

3. Experiment

3.1 Hardware and software

3.2 Optical distortion correction

4. Results and analysis

4. 1 Accommodation

4.1 The Perceived images

4.3 Acceleration

5. Conclusion

Acknowledgments

References and links

Cited By

Figures (16)

Tables (1)

Equations (20)

Optics Express