Multi-view Stereo Depth Perception for Embedded Platform

doi:10.15888/j.cnki.csa.009078

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 11

Home > Archive>Volume 32, Issue 5, 2023 >105-111. DOI:10.15888/j.cnki.csa.009078

PDF HTML XML Export Cite reminder

Multi-view Stereo Depth Perception for Embedded Platform
DOI:
                        10.15888/j.cnki.csa.009078
                    
CSTR:
                        [cstr]
                    
Author:
                        SHAN BingSHAN Bing
School of Mechanical Engineering, Nanjing University of Science & Technology, Nanjing 210094, China;Suzhou Institute of Nano-tech and Nano-bionics (SINANO), Chinese Academy of Sciences, Suzhou 215123, China;Key Laboratory of Multifunctional Nanomaterials and Smart Systems, Chinese Academy of Sciences, Suzhou 215123, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
HU Yi-MinHU Yi-Min
Suzhou Institute of Nano-tech and Nano-bionics (SINANO), Chinese Academy of Sciences, Suzhou 215123, China;Key Laboratory of Multifunctional Nanomaterials and Smart Systems, Chinese Academy of Sciences, Suzhou 215123, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG LongZHANG Long
School of Mechanical Engineering, Nanjing University of Science & Technology, Nanjing 210094, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LI Jia-DongLI Jia-Dong
Suzhou Institute of Nano-tech and Nano-bionics (SINANO), Chinese Academy of Sciences, Suzhou 215123, China;Key Laboratory of Multifunctional Nanomaterials and Smart Systems, Chinese Academy of Sciences, Suzhou 215123, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The current multi-view stereo (MVS) depth estimation algorithms based on neural networks involve a large number of parameters and serious memory consumption, which is difficult to meet the needs of the current embedded platforms with low-computing power. Therefore, this study proposes an MVS depth perception network (Mobile-MVS2D) based on the MVS2D epipolar attention mechanism and MobileNetV3-Small. The network adopts the structure of encoder-decoder and uses MobileNetV3-Small network for encoding feature extraction. In addition, it adopts the epipolar attention mechanism for the coupling of scale information of different feature layers between the source image and the reference image and introduces SE-Net and jump connection to expand the decoding feature details in the decoding stage and improve the prediction accuracy. Experimental results show that the proposed model shows high accuracy in the evaluation index of depth maps in the ScanNet data set. By Combining with visual SLAM, the model can show a more accurate three-dimensional reconstruction effect and has excellent robustness. On the Jeston Xavier NX, the model only costs 0.17 s in inferring the image group with the accuracy of Float16 and the size of 640×480, and the GPU consumption is only 1 GB. Therefore, it can meet the needs of embedded platforms with low-computing power.

Key words:multi-view stereo (MVS);embedded;attention mechanism;3D reconstruction

Get Citation

单兵,胡益民,张龙,李加东.面向嵌入式平台多视图立体视觉深度感知.计算机系统应用,2023,32(5):105-111

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 27,2022
Revised:October 27,2022
Adopted:
Online: March 17,2023
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063