SparseVoxNet: 3-D object recognition with sparsely aggregation of 3-D dense blocks

Ahmad Karambakhsh, Bin Sheng, Ping Li, Huating Li, Jinman Kim, Younhyun Jung, C. L. Philip Chen

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)


Automatic recognition of 3-D objects in a 3-D model by convolutional neural network (CNN) methods has been successfully applied to various tasks, e.g., robotics and augmented reality. Three-dimensional object recognition is mainly performed by analyzing the object using multi-view images, depth images, graphs, or volumetric data. In some cases, using volumetric data provides the most promising results. However, existing recognition techniques on volumetric data have many drawbacks, such as losing object details on converting points to voxels and the large size of the input volume data that leads to substantial 3-D CNNs. Using point clouds could also provide very promising results; however, point-cloud-based methods typically need sparse data entry and time-consuming training stages. Thus, using volumetric could be a more efficient and flexible recognizer for our special case in the School of Medicine, Shanghai Jiao Tong University. In this article, we propose a novel solution to 3-D object recognition from volumetric data using a combination of three compact CNN models, low-cost SparseNet, and feature representation technique. We achieve an optimized network by estimating extra geometrical information comprising the surface normal and curvature into two separated neural networks. These two models provide supplementary information to each voxel data that consequently improve the results. The primary network model takes advantage of all the predicted features and uses these features in Random Forest (RF) for recognition purposes. Our method outperforms other methods in training speed in our experiments and provides an accurate result as good as the state-of-the-art.

Original languageEnglish
Pages (from-to)1-15
Number of pages15
JournalIEEE Transactions on Neural Networks and Learning Systems
Publication statusAccepted/In press - May 2022


  • 3-D convolutional network
  • 3-D recognition
  • Convolutional neural networks
  • Data models
  • Feature extraction
  • Object recognition
  • Shape
  • Solid modeling
  • SparseNet
  • surface normal
  • Training
  • volumetric representation

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Computer Networks and Communications
  • Artificial Intelligence


Dive into the research topics of 'SparseVoxNet: 3-D object recognition with sparsely aggregation of 3-D dense blocks'. Together they form a unique fingerprint.

Cite this