Three-dimensional working pose estimation in industrial scenarios with monocular camera

Yantao Yu, Heng Li, Jiannong Cao, Xiaochun Luo

Research output: Journal article publicationJournal articleAcademic researchpeer-review

Abstract

Three-dimensional (3D) pose data has drawn great attention owing to its wide range of applications. Internet of Things (IoT)-based techniques have been introduced to collect 3D pose data. Though previous studies have yielded significant results, researchers have yet to use 3D pose estimation in real-life applications. Since wearable sensors might be intrusive and infra-red depth cameras are sensitive to sunlight, monocular-camera-based computer vision algorithms provide a possible solution. Previous algorithms are trained and tested with simple daily postures. There are industrial scenarios where the poses are more complex and irregular. An example is the poses of workers on construction sites, such as lifting, climbing, and rebar tying. These postures differ drastically from daily postures and vary from person to person. For instance, some workers prefer bending rebar tying, while others prefer squatting rebar tying. As a result, previous monocular-camera-based-3D poses estimation methods have proved to be inapplicable to industrial scenarios. Thus, this paper developed a monocular-camera-based 3D estimation method which is suitable for industry working poses. A residual artificial neural network (RANN) with flexible complexity and weighted training loss was designed. A 3D pose dataset, which consists of diversified working poses in worksites, was built to test the performance of the network in complex scenarios. Compared with previous 3D pose capture methods, the mean per joint position error reduced by 31.42%. The latency was 0.24 s. Thus we conclude that the proposed monocular-camera-based method has great potential in industrial application scenarios.

Original languageEnglish
JournalIEEE Internet of Things Journal
DOIs
Publication statusAccepted/In press - 2020

Keywords

  • 3D working pose estimation
  • Artificial neural networks
  • Cameras
  • computer vision
  • deep learning.
  • Internet of Things
  • Pose estimation
  • Three-dimensional displays
  • Training
  • Two dimensional displays
  • worker

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems
  • Hardware and Architecture
  • Computer Science Applications
  • Computer Networks and Communications

Cite this