Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Multi-class GAN for generating multi-class images in object recognition

Not Accessible

Your library or personal account may give you access

Abstract

The current generative adversarial network (GAN) is limited in the application of data augmentation in object recognition. The training of the GAN is unstable, and the generated image quality is poor. Methods such as progressive growing of GANs and multi-scale gradient GAN solve these problems. The packed GAN (PacGAN) solves the problem of mode collapse during training. However, these methods can generate only one type of image at a time, and the training time is long. To solve the above problems, this paper proposes the multi-class GAN (Mc-GAN). It uses an augmented discriminator to train multiple generators at the same time. Through iterative training, the discriminator can accurately judge the output of each generator and guide it to generate the corresponding image. This paper analyzes the optimization process of the objective function of Mc-GAN. Experiments show that the method can generate high-quality images and reduce training time, and it can be used for data augmentation in object recognition. It effectively improves the practicality of GAN.

© 2022 Optica Publishing Group

Full Article  |  PDF Article
More Like This
Multi-class remote sensing object recognition based on discriminative sparse representation

Xin Wang, Siqiu Shen, Chen Ning, Fengchen Huang, and Hongmin Gao
Appl. Opt. 55(6) 1381-1394 (2016)

Semantic-guided polarization image fusion method based on a dual-discriminator GAN

Ju Liu, Jin Duan, Youfei Hao, Guangqiu Chen, and Hao Zhang
Opt. Express 30(24) 43601-43621 (2022)

Data availability

Data underlying the results presented in this paper are available in the Celeba Dataset, Ref. [21], and ImageNet Dataset, Ref. [22].

21. Z. Liu, P. Luo, X. Wang, and X. Tang, “Deep learning face attributes in the wild,” in IEEE International Conference on Computer Vision (2015), pp. 3730–3738.

22. D. Jia, D. Wei, R. Socher, L. Li, L. Kai, and F. Li, “ImageNet: a large-scale hierarchical image database,” in IEEE Conference on Computer Vision & Pattern Recognition (2009).

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (5)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (6)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (20)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.