3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis

General point clouds have been increasingly investigated for different tasks, and recently Transformer-based networks are proposed for point cloud analysis. However, there are barely related works for medical point clouds, which are important for disease detection and treatment. In this work, we propose an attention-based model specifically for medical point clouds, namely 3D medical point Transformer (3DMedPT), to examine the complex biological structures. By augmenting contextual information and summarizing local responses at query, our attention module can capture both local context and global content feature interactions. However, the insufficient training samples of medical data may lead to poor feature learning, so we apply position embeddings to learn accurate local geometry and Multi-Graph Reasoning (MGR) to examine global knowledge propagation over channel graphs to enrich feature representations. Experiments conducted on IntrA dataset proves the superiority of 3DMedPT, where we achieve the best classification and segmentation results. Furthermore, the promising generalization ability of our method is validated on general 3D point cloud benchmarks: ModelNet40 and ShapeNetPart. Code is released.

PDF Abstract


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Point Cloud Classification IntrA 3DMedPT F1 score (5-fold) 0.936 # 1
3D Part Segmentation IntrA 3DMedPT IoU (V) 94.82 # 1
DSC (V) 97.29 # 1
IoU (A) 82.39 # 1
DSC (A) 89.71 # 1
3D Point Cloud Classification ModelNet40 3DMedPT Overall Accuracy 93.4 # 30
Number of params 1.54M # 65