UMA-Net: an unsupervised representation learning network for 3D point cloud classification

Jie Liu; Jie Liu; Yu Tian; Yu Tian; Guohua Geng; Guohua Geng; Guohua Geng; Haolin Wang; Haolin Wang; Da Song; Da Song; Kang Li; Kang Li; Mingquan Zhou; Mingquan Zhou; Xin Cao; Xin Cao; Xin Cao

doi:10.1364/JOSAA.456153

Journal of the Optical Society of America A
Vol. 39,
Issue 6,
pp. 1085-1094
(2022)
•https://doi.org/10.1364/JOSAA.456153

UMA-Net: an unsupervised representation learning network for 3D point cloud classification

Jie Liu, Yu Tian, Guohua Geng, Haolin Wang, Da Song, Kang Li, Mingquan Zhou, and Xin Cao

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Jie Liu, Yu Tian, Guohua Geng, Haolin Wang, Da Song, Kang Li, Mingquan Zhou, and Xin Cao, "UMA-Net: an unsupervised representation learning network for 3D point cloud classification," J. Opt. Soc. Am. A 39, 1085-1094 (2022)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Check for updates

Abstract

The success of deep neural networks usually relies on massive amounts of manually labeled data, which is both expensive and difficult to obtain in many real-world datasets. In this paper, a novel unsupervised representation learning network, UMA-Net, is proposed for the downstream 3D object classification. First, the multi-scale shell-based encoder is proposed, which is able to extract the local features from different scales in a simple yet effective manner. Second, an improved angular loss is presented to get a good metric for measuring the similarity between local features and global representations. Subsequently, the self-reconstruction loss is introduced to ensure the global representations do not deviate from the input data. Additionally, the output point clouds are generated by the proposed cross-dim-based decoder. Finally, a linear classifier is trained using the global representations obtained from the pre-trained model. Furthermore, the performance of this model is evaluated on ModelNet40 and applied to the real-world 3D Terracotta Warriors fragments dataset. Experimental results demonstrate that our model achieves comparable performance and narrows the gap between unsupervised and supervised learning approaches in downstream object classification tasks. Moreover, it is the first attempt to apply the unsupervised representation learning for 3D Terracotta Warriors fragments. We hope this success can provide a new avenue for the virtual protection of cultural relics.

Full Article | PDF Article

More Like This

TGPS: dynamic point cloud down-sampling of the dense point clouds for Terracotta Warrior fragments

Jie Liu, Da Song, Guohua Geng, Yu Tian, Mengna Yang, Yangyang Liu, Mingquan Zhou, Kang Li, and Xin Cao
Opt. Express 31(6) 9496-9514 (2023)

Simplification method for 3D Terracotta Warrior fragments based on local structure and deep neural networks

Guohua Geng, Jie Liu, Xin Cao, Yangyang Liu, Wei Zhou, Fengjun Zhao, Linzhi Su, Kang Li, and Mingquan Zhou
J. Opt. Soc. Am. A 37(11) 1711-1720 (2020)

TDNet: transformer-based network for point cloud denoising

Xueli Xu, Guohua Geng, Xin Cao, Kang Li, and Mingquan Zhou
Appl. Opt. 61(6) C80-C88 (2022)

Previous Article Next Article

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (11)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (9)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (15)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Label	Arm	Body	Head	Leg	Total
Train	2656	2720	2720	2496	10144
Test	476	504	428	444	1852

Method	Re.	Supervised	M40	M10
3D-GAN [37]	V	F	83.30	91.00
VIPGAN [38]	Mv	F	91.98	94.05
FoldingNet (M40) [10]	P	F	84.36	91.85
FoldingNet [10]	P	F	88.40	94.40
$l$ -GAN (M40) [16]	P	F	87.27	92.18
$l$ -GAN [16]	P	F	85.70	95.30
Multi-Task [22]	P	F	89.10	—
L2G-AE [17]	P	F	90.64	95.37
PointNet [4]	P	T	89.20	—
PointNet $+ +$ [5]	P	T	90.70	—
ShellNet [9]	P	T	93.10	—
PointWeb [8]	P	T	92.30	—
AMS-Net [29]	P	T	92.94	95.83
DGCNN [7]	P	T	92.90	—
SO-Net [15]	P(2048)	T	90.90	—
AMS-Net [29]	P, N	T	93.52	95.91
Ours	P	F	92.06	95.60

		Acc. (%)
Encoder	Single-scale	90.64
Encoder	Multi-scale	92.06
Decoder	MLPs	91.65
	FlodingNet	92.06
	Cross-dim block	91.97
Loss	$L_{R}$	89.47
	$L_{R} + L_{N}$	91.86
	$L_{R} + L_{A}$	89.59
	$L_{R} + L_{IA}$	92.06
$N_{shell}^{i}$	[4,8,16]	92.00
	[8,16,32]	92.06
	[16,32,32]	90.88

Model	FLOPs	Throughput	Acc. (%)
DGCNN	39.924 G	257.27pc/s	92.9
SSG PointNet $+ +$	13.814 G	113.42pc/s	90.5
MSG PointNet $+ +$	64.473 G	68.78pc/s	91.7
Method in [28]	91.666 G	457.59pc/s	—
Ours	8.039 G	53.84pc/s	92.06

Method	Data Type	Supervised	OA(%)
Method in [26]	I	T	84.34
Method in [39]	P	T	87.64
PointNet [4]	P	T	88.93
Method in [28]	P, I	T	91.41
AMS-Net [29]	P	T	95.68
Ours	P	F	93.90

Method	Arm	Body	Head	Leg
Method in [28] (G)	82.51	96.45	92.36	84.41
Method in [28] (T)	77.75	92.75	91.50	76.25
Method in [28]	87.55	87.55	94.37	88.41
AMS-Net [29]	92.40	98.10	98.00	94.20
Ours	91.00	94.73	94.58	92.54

Abstract

Data availability

Cited By

Figures (11)

Tables (9)

Equations (15)

Journal of the Optical Society of America A