Review on Image-based Wildlife Detection and Recognition
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [95]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    Wildlife monitoring is essential for wildlife conservation and ecosystem maintenance, and wildlife detection and identification is the core technology to achieve monitoring. In recent years, with the rapid development and widespread application of computer vision technology, image-based non-contact methods have attracted extensive attention in the field of wildlife monitoring, and researchers have proposed various methods to solve different problems in this field. However, the complexity of wild environment still poses challenges for accurate detection and identification of wildlife. In order to promote research in this field, the existing image-based wildlife monitoring methods are reviewed in this study, which mainly include three sections: wildlife image acquisition methods, wildlife image preprocessing methods, and wildlife detection and recognition algorithms. These methods are discussed and classified according to the different processing mechanisms of image datasets and wildlife detection and recognition algorithms. Finally, the research hotspots and existing problems of wildlife monitoring based on deep learning are analyzed and summarized, and the prospect for future research priorities is proposed in the study.

    Reference
    [1] 于华. 近50年全球野生动物减少2/3. 生态经济, 2020, 36(11): 5–8.
    [2] 梁成陆. 野生动物保护的意义及举措. 现代农业科技, 2021(10): 184–185.
    [3] 陈梦霄, 王晓玲, 穆丽光, 等. 内蒙古赛罕乌拉国家级自然保护区狍的日活动节律及其季节变化研究. 四川动物, 2019, 38(2): 214–219.
    [4] 何金权. 广东东源康禾自然保护区陆生野生动物资源现状及保护对策. 内蒙古林业调查设计, 2018, 41(5): 33–35.
    [5] Allan BM, Arnould JPY, Martin JK, et al. A cost-effective and informative method of GPS tracking wildlife. Wildlife Research, 2013, 40(5): 345–348.
    [6] Xu WT, Zhang X, Yao LN, et al. A multi-view CNN-based acoustic classification system for automatic animal species identification. Ad Hoc Networks, 2020, 102: 102115.
    [7] Steen KA, Villa-Henriksen A, Therkildsen OR, et al. Automatic detection of animals in mowing operations using thermal cameras. Sensors, 2012, 12(6): 7587–7597.
    [8] Cilulko J, Janiszewski P, Bogdaszewski M, et al. Infrared thermal imaging in studies of wild animals. European Journal of Wildlife Research, 2013, 59(1): 17–23.
    [9] Pavlíček J, Jarolímek J, Jarolímek J, et al. Automated wildlife recognition. AGRIS On-line Papers in Economics and Informatics, 2018, 10(1): 51–60.
    [10] Seymour AC, Dale J, Hammill M, et al. Automated detection and enumeration of marine wildlife using unmanned aircraft systems (UAS) and thermal imagery. Scientific Reports, 2017, 7(1): 45127.
    [11] Chamoso P, Raveane W, Parra V, et al. UAVs applied to the counting and monitoring of animals. Proceedings of the 5th International Symposium on Ambient Intelligence. Springer, 2014. 71–80.
    [12] 程跃红, 张卫东, 彭晓辉, 等. 卧龙国家级自然保护区热水河温泉周边有蹄类野生动物红外相机监测初报. 农业与技术, 2016, 36(1): 178–180.
    [13] 刘雪华, 武鹏峰, 何祥博, 等. 红外相机技术在物种监测中的应用及数据挖掘. 生物多样性, 2018, 26(8): 850–861.
    [14] 陈善安, 胡春鹤, 王远, 等. 基于无线传感器网络的野生动物图像监测系统设计. 现代制造技术与装备, 2017(3): 64–66.
    [15] Xie JJ, Li AQ, Zhang JG, et al. An integrated wildlife recognition model based on multi-branch aggregation and squeeze-and-excitation network. Applied Sciences, 2019, 9(14): 2794.
    [16] Zhao P, Liu SM, Zhou Y, et al. Estimating animal population size with very high-resolution satellite imagery. Conservation Biology, 2021, 35(1): 316–324.
    [17] 肖文宏, 周青松, 朱朝东, 等. 野生动物监测技术和方法应用进展与展望. 植物生态学报, 2020, 44(4): 409–417.
    [18] Linchant J, Lisein J, Semeki J, et al. Are unmanned aircraft systems (UASs) the future of wildlife monitoring? A review of accomplishments and challenges. Mammal Review, 2015, 45(4): 239–252.
    [19] Swanson A, Kosmala M, Lintott C, et al. Snapshot serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Scientific Data, 2015, 2: 150026.
    [20] Villa AG, Salazar A, Vargas F. Towards automatic wild animal monitoring: Identification of animal species in camera-trap images using very deep convolutional neural networks. Ecological Informatics, 2017, 41: 24–32.
    [21] 杜科. 基于Attention-CNN-GRU的野生动物监测图像分类[硕士学位论文]. 北京: 北京林业大学, 2019. 1–70.
    [22] Timm M, Maji S, Fuller T. Large-scale ecological analyses of animals in the wild using computer vision. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Salt Lake City: IEEE, 2018. 1977–19772.
    [23] Trammell MJ, Oberoi P, Egenrieder J, et al. Contextual label smoothing with a phylogenetic tree on the iNaturalist 2018 challenge dataset. Journal of the Washington Academy of Sciences, 2019, 105(1): 23–46.
    [24] Wah C, Branson S, Welinder P, et al. The caltech-UCSD birds-200-2011 dataset. Technical Report, Pasadena: California Institute of Technology, 2011.
    [25] Chen RL, Little R, Mihaylova L, et al. Wildlife surveillance using deep learning methods. Ecology and Evolution, 2019, 9(17): 9453–9466.
    [26] Rey N, Volpi M, Joost S, et al. Detecting animals in African savanna with UAVs and the crowds. Remote Sensing of Environment, 2017, 200: 341–351.
    [27] 陈建促, 王越, 朱小飞, 等. 融合多特征图的野生动物视频目标检测方法. 计算机工程与应用, 2020, 56(7): 221–227.
    [28] Chen K, Song H, Loy CC, et al. Discover and learn new objects from documentaries. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu: IEEE, 2017. 1111–1120.
    [29] Lampert CH, Nickisch H, Harmeling S. Attribute-based classification for zero-shot visual object categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(3): 453–465.
    [30] Xian YQ, Lampert CH, Schiele B, et al. Zero-shot learning—A comprehensive evaluation of the good, the bad and the ugly. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019, 41(9): 2251–2265.
    [31] Favorskaya M, Pakhirka A. Animal species recognition in the wildlife based on muzzle and shape features using joint CNN. Procedia Computer Science, 2019, 159: 933–942.
    [32] Tabak MA, Norouzzadeh MS, Wolfson DW, et al. Machine learning to classify animal species in camera trap images: Applications in ecology. Methods in Ecology and Evolution, 2019, 10(4): 585–590.
    [33] Nguyen H, Maclagan SJ, Nguyen TD, et al. Animal recognition and identification with deep convolutional neural networks for automated wildlife monitoring. Proceedings of the 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA). Tokyo: IEEE, 2017. 40–49.
    [34] Song YL, Wang HP, Li S, et al. CNN based wildlife recognition with super-pixel segmentation for ecological surveillance. Proceedings of the 8th IEEE Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER). Tianjin: IEEE, 2018. 132–137.
    [35] Ferreira BA, De Villiers JP, De Freitas A. Dashcam based wildlife detection and classification using fused data sets of digital photographic and simulated imagery. Proceedings of the 23rd IEEE International Conference on Information Fusion (FUSION). Rustenburg: IEEE, 2020. 1–8.
    [36] Okafor E, Smit R, Schomaker L, et al. Operational data augmentation in classifying single aerial images of animals. Proceedings of the 2017 IEEE International Conference on Innovations in Intelligent Systems and Applications (INISTA). Gdynia: IEEE, 2017. 354–360.
    [37] 曾陈颖. 面向珍稀野生动物保护的图像监测与识别技术研究[硕士学位论文]. 北京: 北京林业大学, 2015. 1–69.
    [38] Hong NM, Thanh NC. A single image dehazing method based on adaptive gamma correction. Proceedings of the 6th NAFOSTED Conference on Information and Computer Science (NICS). Hanoi: IEEE, 2019. 558–562.
    [39] 张军国, 程浙安, 胡春鹤, 等. 野生动物监测光照自适应Retinex图像增强算法. 农业工程学报, 2018, 34(15): 183–189.
    [40] Qiao BW, Zhou ZF, Yang HT, et al. Bird species recognition based on SVM classifier and decision tree. Proceedings of the 1st International Conference on Electronics Instrumentation & Information Systems (EIIS). Harbin: IEEE, 2017. 1–4.
    [41] Noviyanto A, Arymurthy AM. Beef cattle identification based on muzzle pattern using a matching refinement technique in the SIFT method. Computers and Electronics in Agriculture, 2013, 99: 77–84.
    [42] Yu XY, Wang JP, Kays R, et al. Automated identification of animal species in camera trap images. Eurasip Journal on Image and Video Processing, 2013, 2013(1): 52.
    [43] Uddin MS, Akhi AY. Horse detection using Haar like features. International Journal of Computer Theory and Engineering, 2016, 8(5): 415–418.
    [44] Alsahwa B, Maussang F, Garello R, et al. Marine life airborne observation using HOG and SVM classifier. Proceedings of the OCEANS 2016 MTS/IEEE Monterey. Monterey: IEEE, 2016. 1–5.
    [45] Gonzalez LF, Montes GA, Puig E, et al. Unmanned aerial vehicles (UAVs) and artificial intelligence revolutionizing wildlife monitoring and conservation. Sensors, 2016, 16(1): 97.
    [46] Christiansen P, Steen KA, Jørgensen RN, et al. Automated detection and recognition of wildlife using thermal cameras. Sensors, 2014, 14(8): 13778–13793.
    [47] Kamencay P, Trnovszky T, Benco M, et al. Accurate wild animal recognition using PCA, LDA and LBPH. Proceedings of the 2016 ELEKTRO. Strbske Pleso: IEEE, 2016. 62–67.
    [48] Feng WZ, Zhang JG, Hu CH, et al. A novel saliency detection method for wild animal monitoring images with WMSN. Journal of Sensors, 2018, 2018: 3238140.
    [49] Okafor E, Pawara P, Karaaba F, et al. Comparative study between deep learning and bag of visual words for wild-animal recognition. Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence (SSCI). Athens: IEEE, 2016. 1–8.
    [50] Norouzzadeh MS, Nguyen A, Kosmala M, et al. Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proceedings of the National Academy of Sciences of the United States of America, 2017, 115(25): E5716–E5725.
    [51] Tang YX, Wang J, Gao BY, et al. Large scale semi-supervised object detection using visual and semantic knowledge transfer. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016. 2119–2128.
    [52] Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4): 640–651.
    [53] Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the 18th International Conference on Medical Image Computing and Computer-assisted Intervention. Munich: Springer, 2015. 234–241.
    [54] Badrinarayanan V, Kendall A, Cipolla R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481–2495.
    [55] Parham J, Stewart C, Crall J, et al. An animal detection pipeline for identification. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). Lake Tahoe: IEEE, 2018. 1075–1083.
    [56] Feng WZ, Hu CH, Wang Y, et al. A novel hierarchical coding progressive transmission method for WMSN wildlife images. Sensors, 2019, 19(4): 946.
    [57] 宫一男, 谭孟雨, 王震, 等. 基于深度学习的红外相机动物影像人工智能识别: 以东北虎豹国家公园为例. 兽类学报, 2019, 39(4): 458–465.
    [58] 刘文定, 李安琪, 张军国, 等. 基于ROI-CNN的赛罕乌拉国家级自然保护区陆生野生动物自动识别. 北京林业大学学报, 2018, 40(8): 123–131.
    [59] 蔡前舟, 郑伯川, 曾祥银, 等. 结合长尾数据解决方法的野生动物目标检测. 计算机应用, 2022, 42(4): 1284–1291.
    [60] 王越, 陈建促, 朱小飞, 等. 基于熵和运动偏移量的野生动物遮挡检测方法. 计算机应用研究, 2019, 36(12): 3869–3873, 3877.
    [61] Kellenberger B, Marcos D, Tuia D. Detecting mammals in UAV images: Best practices to address a substantially imbalanced dataset with deep learning. Remote Sensing of Environment, 2018, 216: 139–153.
    [62] Ren XB, Han TX, He ZH. Ensemble video object cut in highly dynamic scenes. Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Portland: IEEE, 2013. 1947–1954.
    [63] Kellenberger B, Marcos D, Lobry S, et al. Half a percent of labels is enough: Efficient animal detection in UAV imagery using deep CNNs and active learning. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(12): 9524–9533.
    [64] Li Y, Tong GF, Gao HS, et al. Pano-RSOD: A dataset and benchmark for panoramic road scene object detection. Electronics, 2019, 8(3): 329.
    [65] Tong GF, Chen HR, Li Y, et al. Object detection for panoramic images based on MS-RPN structure in traffic road scenes. IET Computer Vision, 2019, 13(5): 500–506.
    [66] Real E, Aggarwal A, Huang YP, et al. Regularized evolution for image classifier architecture search. Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Honolulu: AAAI, 2019. 4780–4789.
    [67] Howard AG, Zhu ML, Chen B, et al. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861, 2017.
    [68] Huang G, Liu Z, van der MaatenL, et al. Densely connected convolutional networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017. 2261–2269.
    [69] Zhang XY, Zhou XY, Lin MX, et al. ShuffleNet: An extremely efficient convolutional neural network for mobile devices. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018. 6848–6856.
    [70] Chollet F. Xception: Deep learning with depthwise separable convolutions. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017. 1800–1807.
    [71] Ren SQ, He KM, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137–1149.
    [72] Dai JF, Li Y, He KM, et al. R-FCN: Object detection via region-based fully convolutional networks. Proceedings of the 30th International Conference on Neural Information Processing Systems. Barcelona: Curran Associates Inc., 2016. 379–387.
    [73] He KM, Gkioxari G, Dollár P, et al. Mask R-CNN. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(2): 386–397.
    [74] Wang XL, Cheng P, Liu XC, et al. Focal loss dense detector for vehicle surveillance. Proceedings of the 2018 International Conference on Intelligent Systems and Computer Vision (ISCV). Fez: IEEE, 2018. 1–5.
    [75] Jeong J, Park H, Kwak N. Enhancement of SSD by concatenating feature maps for object detection. Proceedings of the 2017 British Machine Vision Conference. London: BMVC, 2017.
    [76] Lin TY, Goyal P, Girshick R, et al. Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(2): 318–327.
    [77] Zhao QJ, Sheng T, Wang YT, et al. M2Det: A single-shot object detector based on multi-level feature pyramid network. Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Honolulu: AAAI, 2019. 9259–9266.
    [78] Zhang SF, Wen LY, Bian X, et al. Single-shot refinement neural network for object detection. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018. 4203–4212.
    [79] Li YH, Chen YT, Wang NY, et al. Scale-aware trident networks for object detection. Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Seoul: IEEE, 2020. 6053–6062.
    [80] Fu CY, Liu W, Ranga A, et al. DSSD: Deconvolutional single shot detector. arXiv:1701.06659, 2017.
    [81] Shen ZQ, Liu Z, Li JG, et al. DSOD: Learning deeply supervised object detectors from scratch. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV). Venice: IEEE, 2017. 1937–1945.
    [82] Akata Z, Reed S, Walter D, et al. Evaluation of output embeddings for fine-grained image classification. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE, 2015. 2927–2936.
    [83] Jiang HJ, Wang RP, Shan SG, et al. Transferable contrastive network for generalized zero-shot learning. Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019. 9764–9773.
    [84] Xu BR, Zeng ZG, Lian C, et al. Semi-supervised low-rank semantics grouping for zero-shot learning. IEEE Transactions on Image Processing, 2021, 30: 2207–2219.
    [85] Chao WL, Changpinyo S, Gong BQ, et al. An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. Proceedings of the 14th European Conference on Computer Vision. Amsterdam: Springer, 2016. 52–68.
    [86] Li J, Lan XG, Liu Y, et al. Compressing unknown images with product quantizer for efficient zero-shot classification. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach: IEEE, 2019. 5458–5467.
    [87] Akata Z, Perronnin F, Harchaoui Z, et al. Label-embedding for attribute-based classification. Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition. Portland: IEEE, 2013. 819–826.
    [88] Xian YQ, Lorenz T, Schiele B, et al. Feature generating networks for zero-shot learning. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018. 5542–5551.
    [89] Zhang HF, Long Y, Guan Y, et al. Triple verification network for generalized zero-shot learning. IEEE Transactions on Image Processing, 2019, 28(1): 506–517.
    [90] Shao J, Li XR. Generalized zero-shot learning with multi-channel Gaussian mixture VAE. IEEE Signal Processing Letters, 2020, 27: 456–460.
    [91] Yu YL, Ji Z, Han JG, et al. Episode-based prototype generating network for zero-shot learning. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020. 14032–14041.
    [92] Chen XY, Lan XG, Sun FC, et al. A boundary based out-of-distribution classifier for generalized zero-shot learning. Proceedings of the 16th European Conference on Computer Vision. Glasgow: Springer, 2020. 572–588.
    [93] Huang S, Lin JK, Huangfu LW. Class-prototype discriminative network for generalized zero-shot learning. IEEE Signal Processing Letters, 2020, 27: 301–305.
    [94] Ding JY, Hu X, Zhong XR. A semantic encoding out-of-distribution classifier for generalized zero-shot learning. IEEE Signal Processing Letters, 2021, 28: 1395–1399.
    [95] Zhou FT, Huang S, Xing Y. Deep semantic dictionary learning for multi-label image classification. Proceedings of the 35th AAAI Conference on Artificial Intelligence. AAAI, 2021. 3572–3580.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

柯澳,王宇聪,胡博宇,林琦,李勇,双丰.基于图像的野生动物检测与识别综述.计算机系统应用,2024,33(1):22-36

Copy
Share
Article Metrics
  • Abstract:1685
  • PDF: 6223
  • HTML: 3435
  • Cited by: 0
History
  • Received:July 24,2023
  • Revised:August 21,2023
  • Online: November 24,2023
  • Published: January 05,2023
Article QR Code
You are the first991229Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063