Abstract
We examine methods used for the construction of image descriptors for the global visual indoor localization problem based on the aggregation of local features and deep-learning convolutional neural networks. We propose a criterion for estimating the quality of image matching results and a technique for selecting reference frames in a test sample consisting of images obtained by the sequential motion of the camera. Additionally, we present an evaluation of the efficiency and speed of the considered methods, which determined that methods based on convolutional neural networks provide more reliable image matching than techniques based on local features, although the former exhibit lower processing speed.
© 2017 Optical Society of America
PDF Article
More Like This
Cited By
You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.
Contact your librarian or system administrator
or
Login to access Optica Member Subscription