Visible to the public Deeply Multi-channel guided Fusion Mechanism for Natural Scene Text Detection

TitleDeeply Multi-channel guided Fusion Mechanism for Natural Scene Text Detection
Publication TypeConference Paper
Year of Publication2021
AuthorsZhang, Xiangyu, Yang, Jianfeng, Li, Xiumei, Liu, Minghao, Kang, Ruichun, Wang, Runmin
Conference Name2021 7th International Conference on Big Data and Information Analytics (BigDIA)
KeywordsBig Data, composability, convolutional neural network, convolutional neural networks, diversity reception, feature extraction, Fully Convolutional Networks, Human Behavior, Metrics, Natural scene text detection, Neural networks, pubcrawl, Scalability, semantic segmentation, Semantics, text analytics
AbstractScene text detection methods have developed greatly in the past few years. However, due to the limitation of the diversity of the text background of natural scene, the previous methods often failed when detecting more complicated text instances (e.g., super-long text and arbitrarily shaped text). In this paper, a text detection method based on multi -channel bounding box fusion is designed to address the problem. Firstly, the convolutional neural network is used as the basic network for feature extraction, including shallow text feature map and deep semantic text feature map. Secondly, the whole convolutional network is used for upsampling of feature map and fusion of feature map at each layer, so as to obtain pixel-level text and non-text classification results. Then, two independent text detection boxes channels are designed: the boundary box regression channel and get the bounding box directly on the score map channel. Finally, the result is obtained by combining multi-channel boundary box fusion mechanism with the detection box of the two channels. Experiments on ICDAR2013 and ICDAR2015 demonstrate that the proposed method achieves competitive results in scene text detection.
DOI10.1109/BigDIA53151.2021.9619703
Citation Keyzhang_deeply_2021