Journal of Software, Vol 7, No 7 (2012), 1577-1584, Jul 2012
doi:10.4304/jsw.7.7.1577-1584

Human Action Recognition algorithm based on Minimum Spanning Tree of CPA Models

Yi Ouyang, Jianguo Xing

Abstract


Human pose recognition algorithm for monocular video was proposed to model human part parameters using video features combination with 3D motion capture data. Firstly Constant Part Appearance (CPA) Models and three-dimensional motion data projection constraint graph structure was defined. To simplify the reasoning process, a constraint graph of the spanning tree construction algorithm and the balancing algorithm were proposed. Combination with the proposed function mechanism, spanning tree of constraint graph and Metropolis-Hastings method, human motion under monocular video can be tracking and recognition, and inferring the 3D motion parameters. By using data-driven (Markov chain Monte Carlo MCMC) and constrain map, human motion limb recognition algorithm is proposed, and the method can be applied to data-driven online human behavior recognition. Experimental results show that the proposed method can recognize human motion action automatically and accurately in monocular video. 


Keywords


human action; Markov chain; belief propagation; 3D human action estimation

References


 

[1] Agarwal, A.; Triggs. Recovering. 3D human pose from monocular images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28(1): 44 - 58 .
http://dx.doi.org/10.1109/TPAMI.2006.21
PMid:16402618

[2] C. Sminchisescu, A. Kanaujia, and D. Metaxas. Learning joint top-down and bottom-up processes for 3d visual inference, EEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), 2006: 1743-1752

[3] LEE, M.W. and R. NEVATIA. Human pose tracking in monocular sequence using multilevel structured models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 31(1). 27-38.

[4] Weiwei Guo Patras, I. Queen Mary. Discriminative 3D human pose estimation from monocular images via topological preserving hierarchical affinity clustering, Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on, 2009: 9 -15.

[5] Ankur Agarwal, Bill Triggs. 3d human pose from silhouettes by relevance vector regression, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'04), 2004(2):882-888.

[6] A. Elgammal and C.S.Lee. Inferring 3d body pose from silhouettes using activity manifold learning, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'04),2004 (2):681-688

[7] Lv F, Nevatia R. Single view human action recognition using key pose matching and viterbi path searching, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'04),2007(4): 1−8 .

[8] P. Felzenszwalb, D. McAllester, D. Ramaman. A discriminatively trained, multiscale, deformable part model, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'08),2008:1-8.
PMid:18631853

[9] P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan. Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009,32(9):1627-45.
http://dx.doi.org/10.1109/TPAMI.2009.167
PMid:20634557

[10] M. Lee and I. Cohen. Proposal maps driven mcmc for estimating human body pose in static images, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'04),2004: 334-341

[11] M. Lee and R. Nevatia. Dynamic human pose estimation using markov chain monte carlo approach motion, IEEE Workshop on Motion and Video Computing, 2005. WACV/MOTIONS'05, 2005(2):168-175.

[12] WANG, Y., N.L. ZHANG, and T. CHEN. Latent tree models and approximate inference in Bayesian networks . Journal of Artificial Intelligence Research, 2008,32(1). 879-900

[13] D. Ramanan, D.A. Forsyth, and A. Zisserman. Strike a pose:tracking people by finding stylized poses, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'04),2004 (1):271-278.

[14] Z.W. Tu and S.C. Zhu, Image segmentation by data-driven markov chain monte carlo, IEEE Trans. Pattern Analysis and Machine Intelligence, 2002(24)5:657-672.

[15] T. Zhao and R. Nevatia. Tracking multiple humans in crowded environment, Proceedings of. IEEE Conf. Computer Vision and Pattern Recognition, 2004 (2):406-413.

[16] S. Zhu, R. Zhang, and Z. Tu. Integrating bottom-up/top-down for object recognition by data driven markov chain monte carlo, Proceedings of. IEEE Conf. Computer Vision and Pattern Recognition,2000(1): 738-745.

[17] EVERINGHAM, M. AND VAN~GOOL, L. AND WILLIAMS, C. K. I. AND WINN, J. AND ZISSERMAN, EB/OL.2008.A.The PASCAL VOC2008 Results.

[18] CMU,Cmu motion capture library.EB/OL.2007 http:,mocap.cs.cmu.edu.


Full Text: PDF


Journal of Software (JSW, ISSN 1796-217X)

Copyright @ 2006-2013 by ACADEMY PUBLISHER – All rights reserved.