Computing Reviews, the leading online review service for computing literature.

Search

Human skeleton tracking from depth data using geodesic distances and optical flow
Schwarz L., Mkhitaryan A., Mateus D., Navab N. Image and Vision Computing30 (3):217-226,2012.Type:Article

Date Reviewed: Aug 26 2013

Tracking the full body of a human from intensity images is a challenging task because appearance-based features vary widely with illumination and changes of pose. To overcome this problem, the authors of this paper propose a graph-based approach using depth images for pose-invariant human body tracking. The authors attempt to build a human skeleton for each frame of a depth and RGB combined video. Initially, the human body is separated from the depth map and represented as a 3D point cloud. The 3D points represent the vertices of the graph and the edges are defined by spatial neighborhoods. The graph is used to generate a geodesic distance map from the body center to all other body parts. Landmarks on the body, such as the head, knees, and shoulders, are detected based on this map. Finally, a skeleton is fitted to the detected landmarks. Since some body parts may be occluded in a depth map, the authors use motion estimation based on optical flow from RGB frames to predict the positions of those parts. The proposed algorithm has been tested on depth data captured by a time of flight (ToF) camera, as well as a Kinect sensor. The authors found that the Kinect data was more stable than the ToF data. This is encouraging for researchers who use Kinect for its low cost. However, the algorithm is computationally heavy and does not work properly if the human rotates fully about the vertical axis. Overall, this is a good marker-free approach for tracking the human skeleton. This strategy can be used in various applications, such as surveillance, virtual reality, and medical diagnostics. Researchers working in depth estimation with specific reference to human body basics and activity tracking would find this paper particularly informative.

Reviewer: Partha Pratim Das	Review #: CR141497 (1311-1037)

Image Representation (I.4.10 )

Depth Cues (I.4.8 ... )

Imaging Geometry (I.4.1 ... )

Would you recommend this review?

yes

Other reviews under "Image Representation":	Date

On detecting all saddle points in 2D images Kuijper A. Pattern Recognition Letters 25(15): 1665-1672, 2004. Type: Article	Jul 14 2005

General adaptive neighborhood image processing Debayle J., Pinoli J. Journal of Mathematical Imaging and Vision 25(2): 267-284, 2006. Type: Article	Mar 29 2007

Learning group-based dictionaries for discriminative image representation Lei H., Mei K., Zheng N., Dong P., Zhou N., Fan J. Pattern Recognition 47(2): 899-913, 2014. Type: Article	May 27 2014

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy