Static hand gesture recognition based on multi-feature weighted fusion is proposed to solve the problems of singularity and omission in convolutional neural network for feature extraction. Firstly, the Fourier and Hu moments of the segmented gesture image are extracted and fused as the local features. Besides, a dual-channel convolutional neural network is designed to extract the deep features of the gesture image, which are further treated by dimensionality reduction by principal component analysis. Secondly, the extracted local and deep features are weighted and fused as effective description for hand gesture recognition. Finally, gesture images are classified with Softmax classifier. Experimental results verify the proposed method, and the recognition accuracy reaches over 99% on the image dataset.
[1] Wang C, Liu Z, Chan SC. Superpixel-based hand gesture recognition with Kinect depth camera. IEEE Transactions on Multimedia, 2015, 17(1): 29–39. [doi: 10.1109/TMM.2014.2374357
[2] Ren Z, Yuan JS, Meng JJ, et al. Robust part-based hand gesture recognition using Kinect sensor. IEEE Transactions on Multimedia, 2013, 15(5): 1110–1120. [doi: 10.1109/TMM.2013.2246148
[3] Hikawa H, Kaida K. Novel FPGA implementation of hand sign recognition system with SOM-Hebb classifier. IEEE Transactions on Circuits and Systems for Video Technology, 2015, 25(1): 153–166. [doi: 10.1109/TCSVT.2014.2335831
[4] Neto P, Pereira D, Pires JN, et al. Real-time and continuous hand gesture spotting: An approach based on artificial neural networks. Proceedings of 2013 IEEE International Conference on Robotics and Automation. Karlsruhe, Germany. 2013. 178–183.
[6] Paulraj MP, Yaacob S, Bin Zanar Azalan MS, et al. A phoneme based sign language recognition system using skin color segmentation. Proceedings of 2010 6th International Colloquium on Signal Processing & its Applications. Mallaca City, Malaysia. 2010. 1–5.
[7] Fernando M, Wijjayanayake J. Novel approach to use HU moments with image processing techniques for real time sign language communication. International Journal of Image Process, 2015, 9(6): 335–345
[9] Pigou L, Dieleman S, Kindermans PJ, et al. Sign language recognition using convolutional neural networks. Proceedings of the European Conference on Computer Vision. Zurich, Switzerland. 2015. 572–578.
[10] Bobić V, Tadić P, Kvaščev G. Hand gesture recognition using neural network based techniques. Proceedings of 2016 13th Symposium on Neural Networks and Applications. Belgrade, Serbia. 2016. 1–4.
[11] Bheda V, Radpour D. Using deep convolutional networks for gesture recognition in American sign language. arXiv preprint arXiv: 1710.06836, 2017.
[12] Rao GA, Kishore PVV, Kumar DA, et al. Neural network classifier for continuous sign language recognition with selfie video. Far East Journal of Electronics and Communications, 2017, 17(1): 49–71. [doi: 10.17654/EC017010049
[13] Chai XJ, Liu ZP, Yin F, et al. Two streams recurrent neural networks for large-scale Continuous gesture recognition. Proceedings of the 23rd International Conference on Pattern Recognition. Cancun, Mexico. 2016. 31–36.
[16] Rahmany I, Nemmala MEA, Khlifa N, et al. Automatic detection of intracranial aneurysm using LBP and Fourier descriptor in angiographic images. International Journal of Computer Assisted Radiology and Surgery, 2019, 14(8): 1353–1364. [doi: 10.1007/s11548-019-01996-0
[17] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. https://arxiv.org/abs/1409.1556. (2014-09-04)[2015-04-10].
[18] Mrówczyńska M, Sztubecki J, Greinert A. Compression of results of geodetic displacement measurements using the PCA method and neural networks. Measurement, 2020, 158: 107693. [doi: 10.1016/j.measurement.2020.107693
[19] Pansare JR, Gawande SH, Ingle M. Real-time static hand gesture recognition for American Sign Language (ASL) in complex background. Journal of Signal and Information Processing, 2012, 3(3): 364–367. [doi: 10.4236/jsip.2012.33047
[21] Pugeault N, Bowden R. Spelling it out: Real-time ASL fingerspelling recognition. Proceedings of 2011 IEEE International Conference on Computer Vision Workshops. Barcelona, Spain. 2011. 1114–1119.
[22] Guo S, Huang WL, Wang LM, et al. Locally supervised deep hybrid model for scene recognition. IEEE Transactions on Image Processing, 2017, 26(2): 808–820. [doi: 10.1109/TIP.2016.2629443