ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

1Key Laboratory of Signal Detection and Processing, Xinjiang University    2Joint International Research Laboratory of Silk Road Multilingual Cognitive Computing, Xinjiang University    3Department of Computer Science and Technology, Nanjing University
Corresponding authors | NeurIPS 2025

Abstract

State Space models (SSMs) like PointMamba provide efficient feature extraction for point cloud self-supervised learning with linear complexity, surpassing Transformers in computational efficiency. However, existing PointMamba-based methods rely on complex token ordering and random masking, disrupting spatial continuity and local semantic correlations. We propose ZigzagPointMamba to address these challenges. The key to our approach is a simple zigzag scan path that globally sequences point cloud tokens, enhancing spatial continuity by preserving the proximity of spatially adjacent point tokens. Yet, random masking impairs local semantic modeling in self-supervised learning. To overcome this, we introduce a Semantic-Siamese Masking Strategy (SMS), which masks semantically similar tokens to facilitate reconstruction by integrating local features of original and similar tokens, thus overcoming dependence on isolated local features and enabling robust global semantic modeling. Our pre-training ZigzagPointMamba weights significantly boost downstream tasks, achieving a 1.59% mIoU gain on ShapeNetPart for part segmentation, a 0.4% higher accuracy on ModelNet40 for classification, and 0.19%, 1.22%, and 0.72% higher accuracies respectively for the classification tasks on the OBJ-BG, OBJ-ONLY, and PB-T50-RS subsets of ScanObjectNN.

Method Overview and Results

Poster

Additional Qualitative Results

Reconstruction Quality Analysis

Qualitative analysis of mask predictions on ShapeNet validation set
Figure: Qualitative analysis of mask predictions from ZigzagPointMamba on ShapeNet validation set. From left to right: Input point cloud, Masked version, Reconstructed result, and additional object examples.

Part Segmentation Comparison

Comparison of part segmentation results between PointMamba and ZigzagPointMamba
Figure: Qualitative comparison of part segmentation results. Top: Ground Truth, Middle: PointMamba predictions, Bottom: ZigzagPointMamba predictions. Objects include laptop, lamp, guitar, airplane, and table.

BibTeX

@inproceedings{diao2025zigzagpointmamba,
  title={ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding},
  author={Diao, Linshuang and Song, Sensen and Qian, Yurong and Ren, Dayong},
  booktitle={Advances in neural information processing systems},
  year={2025}
}