系统仿真学报 ›› 2021, Vol. 33 ›› Issue (6): 1307-1314.doi: 10.16182/j.issn1004731x.joss.20-0095

• 仿真建模理论与方法 • 上一篇    下一篇

基于生成式对抗网络的发票图像超分辨率研究

李新利, 邹昌铭, 杨国田, 刘禾   

  1. 华北电力大学 控制与计算机工程学院,北京 102206
  • 收稿日期:2020-02-28 修回日期:2020-05-02 出版日期:2021-06-18 发布日期:2021-06-23
  • 作者简介:李新利(1973-),女,博士,副教授,研究方向为模式识别、图像处理、燃烧过程检测技术。E-mail:lixinli@ncepu.edu.cn

Research of Super-resolution Processing of Invoice Image Based on Generative Adversarial Network

Li Xinli, Zou Changming, Yang Guotian, Liu He   

  1. School of Control and Computer Engineering, North China Electric Power University, Beijing, 102206, China
  • Received:2020-02-28 Revised:2020-05-02 Online:2021-06-18 Published:2021-06-23

摘要: 发票自动识别可有效提高财务工作效率。为避免低分辨率的发票图像影响自动识别的准确性,提出了一种用于对发票图像进行超分辨率处理的ESRGAN (Encoder Super-resolution Generative Adversarial Network)网络ESRGAN网络是基于带条件的生成式对抗网络,设计了辅助编码器,引导网络生成更加真实的超分辨率图像。基于实际发票图像,将ESRGAN网络与常规图像处理、SRCNN (Super-resolution Convolutional Neural Networks)网络和SRGAN (Super-resolution Generative Adversarial Network)网络进行对比实验,并通过峰值信噪比(Peak Signal to Noise Ratio, PSNR)和结构相似性(Structural Similarity, SSIM)评价指标进行模型评价。实验结果表明基于ESRGAN超分辨率处理的图像在视觉效果和评价指标上均具有良好的效果。

关键词: 发票图像, 超分辨率, 生成式对抗网络, ESRGAN, 评价指标

Abstract: Automatic identification of invoices can effectively improve financial efficiency. But low-resolution invoice image reduces the accuracy of automatic identification, an ESRGAN (Encoder Super-resolution Generative Adversarial Network) network for super-resolution processing of invoice images is proposed. The ESRGAN network is based on a conditional generative adversarial network. An auxiliary encoder is designed to guide the network to generate a more realistic super-resolution image. Based on the actual invoice image, the ESRGAN network and the conventional image processing, SRCNN (Super-resolution Convolutional Neural Networks) network and SRGAN (Super-resolution Generative Adversarial Network) network. The model is evaluated through two evaluation indicators of peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). The experimental results show that the images processed based on ESRGAN super-resolution are better on visual effects and evaluation indicators.

Key words: invoice image, super-resolution, generative adversarial networks, ESRGAN, evaluation indicator

中图分类号: