本文已被:浏览 3次 下载 232次
Received:June 03, 2024 Revised:June 28, 2024
Received:June 03, 2024 Revised:June 28, 2024
中文摘要: 卷烟激光喷码识别是烟草稽查工作的重要手段. 本文提出一种基于双态非对称网络的烟码识别方法, 针对畸变烟码训练样本不足导致模型泛化能力弱的问题, 设计非线性局部增强方法(nonlinear local augmentation, NLA), 通过在烟码图像边缘设置可控基准点进行空间变换, 生成有效畸变训练样本以增强模型泛化能力; 针对烟码与背景图案特征相似导致识别精度低的问题, 提出双态非对称网络(dual-state asymmetric network, DSANet), 将CRNN的卷积层划分为训练模式和部署模式, 训练模式通过引入非对称卷积优化特征权重分布, 增强模型关键特征提取能力; 为保证实时性, 部署模式设计BN融合和分支融合方法, 通过计算融合权重并初始化卷积核, 将卷积层等效转换回原始网络结构, 降低用户端推理时间; 最后, 在循环层中引入自注意力机制, 通过动态调整序列特征权重, 进一步加强模型对烟码特征的提取能力. 通过对比实验, 该方法具有更高的识别精度和速度, 其识别精度达到87.34%.
Abstract:Cigarette laser code recognition is an important tool for tobacco inspection. This study proposes a method for recognizing cigarette codes based on a dual-state asymmetric network. Insufficient training on samples of distorted cigarette codes leads to the weak generalization ability of the model. To address this issue, a nonlinear local augmentation (NLA) method is designed, which generates effective training samples with distortion to enhance the generalization ability of the model through spatial transformation using controllable datums at the edges of cigarette codes. To address the problem of low recognition accuracy due to the similarity between cigarette codes and their background patterns, a dual-state asymmetric network (DSANet) is proposed, which divides the convolutional layers of the CRNN into training and deployment modes. The training mode enhances the key feature extraction capability of the model by introducing asymmetric convolution for optimizing feature weight distribution. For real-time performance, the deployment mode designs BN fusion and branch fusion methods. By calculating fusion weights and initializing convolutional kernels, convolutional layers are equivalently converted back to their original structures, which reduces user-side inference time. Finally, a self-attention mechanism is introduced into the loop layer to enhance the extraction capability of the model for cigarette code features by dynamically adjusting the weights of sequence features. Comparative experiments show that this method has higher recognition accuracy and speed, with the recognition accuracy reaching 87.34%.
keywords: cigarette laser code data augmentation text recognition asymmetric convolution attention mechanism
文章编号: 中图分类号: 文献标志码:
基金项目:陕西省烟草公司咸阳公司科技项目(2022610425240008)
引用文本:
梁尚荣,王慧琴,马琦,王可,文钰栋.基于双态非对称网络的卷烟激光码识别.计算机系统应用,,():1-12
LIANG Shang-Rong,WANG Hui-Qin,MA Qi,WANG Ke,WEN Yu-Dong.Cigarette Laser Code Recognition Based on Dual-state Asymmetric Network.COMPUTER SYSTEMS APPLICATIONS,,():1-12
梁尚荣,王慧琴,马琦,王可,文钰栋.基于双态非对称网络的卷烟激光码识别.计算机系统应用,,():1-12
LIANG Shang-Rong,WANG Hui-Qin,MA Qi,WANG Ke,WEN Yu-Dong.Cigarette Laser Code Recognition Based on Dual-state Asymmetric Network.COMPUTER SYSTEMS APPLICATIONS,,():1-12