Bill image layout analysis method and device
An analysis method and ticket-like technology, applied in image data processing, graphics and image conversion, details involving image stitching, etc., can solve problems such as large labor costs, many rules, and complex upgrade and maintenance, so as to reduce workload and improve accuracy Sexuality and the effect of improving operating efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] This embodiment provides specific implementation steps of a method for parsing a layout of a bill-like image. As an optional mode, the hardware and software platform used in this embodiment includes: a server with a 3.0G Hz central processing unit, an Nvida 1080GPU processor and a 16G byte memory, and an end-to-end OCR is pre-programmed in the python language. The layout analysis program can complete the recognition of the text box in the bill image and the recognition of the text position and text.
[0024] like figure 1 As shown, the software adopted in this embodiment adopts a method for parsing the image layout of bills, including the following steps:
[0025] s1 prepares the training layout samples for model training and manually marks them. The training layout samples can be preliminarily analyzed by an end-to-end OCR layout analysis program, and then manually labeled, or directly manually labeled. Preferably, the step s1 further includes: adopting a data augme...
Embodiment 2
[0034] This embodiment provides a specific implementation of a bill image layout analysis device, based on the bill image layout analysis method described in Embodiment 1.
[0035] like figure 2 As shown, the bill type image layout analysis device includes:
[0036] The training layout sample labeling module is used to label the training samples; preferably, the training layout sample labeling module is also used to: adopt a data augmentation strategy to perform data augmentation on the training layout samples; wherein, the data augmentation The strategy includes one or more of the following methods: 1) Randomly perturb the coordinate points of the detection frame in the training layout sample; 2) Randomly discard one or more detection frames in the training layout sample; 3) Randomly cut Divide the detection frame, and randomly split the text in the detection frame; 4) randomly replace the text content in the detection frame;
[0037] The text box feature encoding module i...
Embodiment 3
[0042] This embodiment provides a specific implementation manner of an electronic device, based on the method for parsing the image layout of a receipt described in Embodiment 1.
[0043] like Figure 4 As shown, the electronic device includes: a processor (processor) 401, a communication interface (Communications Interface) 402, a memory (memory) 403 and a communication bus 404, wherein the processor 401, the communication interface 402, and the memory 403 pass through the communication bus 404 Complete mutual communication. The processor 401 can call a computer program stored in the memory 403 and runnable on the processor 401 to execute the methods provided by the above-mentioned embodiments, for example, including: preparing training layout samples for model training, and assisting in manual labeling; Carry out feature encoding to the text box in the training layout sample; Carry out feature splicing with the coordinate feature of described text box and text feature, form...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com