Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Study of the ability of neural networks to extract and use semantic information when they are trained to reconstruct noisy images

Not Accessible

Your library or personal account may give you access

Abstract

Subject of study. This paper discusses how deep convolutional neural networks can be used to improve images obtained under noisy conditions when supplementary information concerning objects on the image is input in the form of segmentation masks. Several methods of using semantic information during the operation of the network are studied. First, by introducing a mask along with the image at the input of the network, and second, by creating a loss function by using a segmentation mask. Method. Several experimental series are carried out both with and without using various types of semantic information. The experiments were run with several noise intensities. The image-reconstruction quality as a whole is analyzed, along with the reconstruction quality of a region of images corresponding to an entire class. The class of road signs was chosen as the target class since it possesses less variability than many other classes, and this gives an advantage to the network when it is reconstructed in the presence of semantic information compared with in its absence. The COCO dataset with marked segmentation maps was used during the study. Test surroundings for reconstructing the images, with the possibility of visualizing the results of the testing, were developed to analyze the semantic properties of all the objects contained in the COCO set, and this made it possible to draw certain useful conclusions concerning how various properties of the objects influence the accuracy with which they are reconstructed. Main results. We show that, under very noisy conditions, a reconstruction network trained using supplementary information in the form of segmentation masks can do a better job of reconstructing objects that correspond to the masks (by 3.5%), while lowering the capability of the network to reconstruct the entire image is only insignificantly reduced (by 0.4%). However, no such quality increment is obtained for weak and medium noise. Practical significance. Our goal in this paper was not to create a finished algorithm and neural-network architecture, but only to study the possible properties of such algorithms; we therefore supplied semantic reference markers to the input of the neural networks. In developing such a method, a segmenter network that would automatically extract information from a noisy image can be added (the process itself can be iterative in this case—the image is improved after being segmented, and a refined segmentation mask is constructed from the improved image).

© 2022 Optica Publishing Group

PDF Article
More Like This
Semantic ghost imaging based on recurrent-neural-network

Yuchen He, Sihong Duan, Yuan Yuan, Hui Chen, Jianxing Li, and Zhuo Xu
Opt. Express 30(13) 23475-23484 (2022)

Physics-informed deep neural network for image denoising

Emmanouil Xypakis, Valeria de Turris, Fabrizio Gala, Giancarlo Ruocco, and Marco Leonetti
Opt. Express 31(26) 43838-43849 (2023)

Semantic-guided polarization image fusion method based on a dual-discriminator GAN

Ju Liu, Jin Duan, Youfei Hao, Guangqiu Chen, and Hao Zhang
Opt. Express 30(24) 43601-43621 (2022)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.