Visible to the public Docscanner: document location and enhancement based on image segmentation

TitleDocscanner: document location and enhancement based on image segmentation
Publication TypeConference Paper
Year of Publication2022
AuthorsShan, Ziqi, Wang, Yuying, Wei, Shunzhong, Li, Xiangmin, Pang, Haowen, Zhou, Xinmei
Conference Name2022 18th International Conference on Computational Intelligence and Security (CIS)
Date Publisheddec
Keywordscomposability, compositionality, Computational Intelligence, Computational modeling, cryptography, document processing, Document Scanner, Entropy, image segmentation, pubcrawl, security, semantic segmentation
AbstractDocument scanning aims to transfer the captured photographs documents into scanned document files. However, current methods based on traditional or key point detection have the problem of low detection accuracy. In this paper, we were the first to propose a document processing system based on semantic segmentation. Our system uses OCRNet to segment documents. Then, perspective transformation and other post-processing algorithms are used to obtain well-scanned documents based on the segmentation result. Meanwhile, we optimized OCRNet's loss function and reached 97.25 MIoU on the test dataset.
DOI10.1109/CIS58238.2022.00028
Citation Keyshan_docscanner_2022