Method and device for extracting official document elements based on OCR character recognition

A technology for text recognition and official documents, applied in character and pattern recognition, instruments, computing, etc., can solve problems such as low efficiency, waste of manpower and time, and achieve the effect of improving speed and efficiency, improving efficiency, and being easy to implement.

Active Publication Date: 2020-12-29
北京中宏立达信创科技股份有限公司 +1
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of the above analysis, the embodiment of the present invention aims to provide a method and device for extracting official document elements based on OCR text recognition, so as to solve the problem of waste of manpower and time and low efficiency of existing methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting official document elements based on OCR character recognition
  • Method and device for extracting official document elements based on OCR character recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings, wherein the accompanying drawings constitute a part of the application and together with the embodiments of the present invention are used to explain the principle of the present invention and are not intended to limit the scope of the present invention.

[0058] The existing method of artificially extracting official document elements has a large workload, is prone to errors, and has high repetitive work. If the volume of processed documents increases, the cost of manpower and time will be huge, and the method of artificially extracting official document elements is accurate. Poor degree and low efficiency. To this end, this application proposes a method and device for extracting official document elements based on OCR text recognition. By using a scanner to scan paper official documents into electronic official documents, the coordinate area p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and device for extracting official document elements based on OCR character recognition, belongs to the technical field of intelligent file processing, and solves theproblems that an existing method wastes manpower and time and is low in efficiency. The method comprises the following steps: scanning a paper official document file containing official document element information to obtain an electronic official document file; dynamically generating an algorithm selection box based on the electronic official document file, and obtaining a corresponding algorithm based on the algorithm selection box; acquiring official document element information in the electronic official document file based on an algorithm; and storing the obtained official document element information to a corresponding field position of an official document element form in a business information processing system. The invention is simple and feasible, is easy to implement, achievesthe quick extraction of the official document elements, saves the manpower and cost, and improves the efficiency.

Description

technical field [0001] The invention relates to the technical field of document intelligent processing, in particular to a method and device for extracting official document elements based on OCR text recognition. Background technique [0002] At present, the traditional sending and receiving of documents is mainly to scan the documents into electronic files through a scanner, and then upload them to the relevant information system for document processing. In the process of document processing, it is often necessary to extract the official document elements in the documents, and the current official document elements in office business Recognition is mainly based on manual methods, through artificial identification of official document elements in documents, and manual input of official document content elements into the business information processing system. [0003] Manually identifying official document elements and manually inputting them into the business information p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34
CPCG06V30/153G06V10/267G06V30/287
Inventor 张朝壹李志芳侯文君邓倩楠李旭明陈毅彬
Owner 北京中宏立达信创科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products