End-to-end Multi-task Trademark Sub-card Model

doi:10.15888/j.cnki.csa.009210

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 18

Home > Archive>Volume 32, Issue 8, 2023 >105-115. DOI:10.15888/j.cnki.csa.009210

PDF HTML XML Export Cite reminder

End-to-end Multi-task Trademark Sub-card Model
DOI:
                        10.15888/j.cnki.csa.009210
                    
CSTR:
                        [cstr]
                    
Author:
                        ZHANG Zhen-YanZHANG Zhen-Yan
School of Software, South China Normal University, Foshan 528225, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
SU HaiSU Hai
School of Software, South China Normal University, Foshan 528225, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
YU Song-SenYU Song-Sen
School of Software, South China Normal University, Foshan 528225, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The current trademark sub-card processing method is to first carry out text detection, then conduct area classification, and finally split and combine different areas to form a trademark sub-card. This step-by-step processing takes a long time, and the accuracy of the final results will decrease due to the superposition of errors. Therefore, this study proposes a multi-task network model TextCls, which can improve the inference speed and accuracy of the detection and classification modules. TextCls consists of a feature extraction network and two task branches of text detection and regional classification. The text detection branch uses the segmentation network to learn the pixel classification map and then employs pixel aggregation to obtain the text boxes. The pixel classification map is mainly used to learn the information of text and background pixels. The regional classification branch subdivides regional features into Chinese, English, and graphics, focusing on learning the characteristics of different types of regions. Through the shared feature extraction network, the two branches continuously learn pixel information and regional features, and finally the precision of the two tasks is improved. To make up for the lack of text detection datasets for trademark images and verify the effectiveness of TextCls, this study collects and labels a text detection dataset trademark_text (https://github.com/kongbailongtian/trademark_text), which consists of 2000 trademark images. The results show that compared with the optimal text detection algorithm, the text detection branch of TextCls increases the accuracy rate from 94.44% to 95.16%, with the harmonic mean F1 score reaching 92.12%; the F1 score of the regional classification branch also increases from 97.09% to 98.18%.

Key words:trademark sub-card|end-to-end|text detection|multi-task learning|datasets

Get Citation

张贞&#;,苏海,余松森.基于端到端的多任务商标分卡模型.计算机系统应用,2023,32(8):105-115

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:February 09,2023
Revised:March 14,2023
Adopted:
Online: June 09,2023
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063