Document auditing method, device and system, equipment and storage medium

A document and plain text technology, applied in the direction of instruments, electrical digital data processing, calculation models, etc., can solve the problems of large differences in document review systems, long development cycles, and poor information extraction functions

Pending Publication Date: 2020-02-28
DATAGRAND TECH INC
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The defect of the existing technology is that information extraction based on traditional technologies such as keyword search, text matching, and regular expressions, often due to reasons such as rich text expressions in actual business forecast samples, leads to poor information extraction functions and directly affects subsequent links.
At the same time, due to the particularity of the field and the difference of document categories, the document review system in the prior art often can only deal with a certain scene of a certain document in a certain field
Between different scenarios, the document review system is quite different and cannot be expanded
New scenarios often need to be developed from scratch, the existing system is less reused, and the development cycle is longer

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document auditing method, device and system, equipment and storage medium
  • Document auditing method, device and system, equipment and storage medium
  • Document auditing method, device and system, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] figure 1 It is a flow chart of a document review method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of auditing documents, and the method can be executed by the document auditing device provided in the embodiment of the present invention, which can be implemented in the form of software and / or hardware, and can generally be integrated in computer equipment in the document review component.

[0034] like figure 1 As shown, the method of the embodiment of the present invention specifically includes:

[0035] Step 101: Obtain the object numbered notation format document sent by the document parsing component, and obtain the preset machine learning model and document review rules matching the object numbered notation format document according to the business scenario of the object numbered notation format document.

[0036] Wherein, the webpage component obtains the document to be reviewed inputted by the user through...

Embodiment 2

[0058] figure 2 It is a flow chart of a document review method provided by Embodiment 2 of the present invention. The embodiment of the present invention can be combined with each optional solution in the above-mentioned one or more embodiments. In the embodiment of the present invention, it may further include: according to the training sample set corresponding to each business scenario, the training sample set includes The set number of documents; use the training sample set to train the machine learning model, and obtain the preset machine learning model matching each business scenario.

[0059] And, according to the business scenario of the object numbered notation format document, obtaining the preset machine learning model and document review rules matching the object numbered notation format document may include: obtaining the object numbered notation model in the preset machine learning model matching each business scenario A preset machine learning model for busines...

Embodiment 3

[0084] image 3 It is a schematic structural diagram of a document checking device provided by Embodiment 3 of the present invention. like image 3 As shown, the apparatus can be configured in a document review component of a computer device, including: a document acquisition module 301 , a key information extraction module 302 and an audit result generation module 303 .

[0085] Among them, the document acquisition module 301 is used to obtain the object numbered notation format document sent by the document parsing component, and according to the business scenario of the object numbered notation format document, obtain the preset machine learning model and document review rules that match the object numbered notation format document; key information extraction Module 302 is used to extract the key information in the document in numbered musical notation format of the object through the preset machine learning model; the review result generation module 303 is used to generat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a document auditing method, device and system, equipment and a storage medium. The method comprises the following steps: acquiring an object numbered musicalnotation format document sent by a document analysis component, and acquiring a preset machine learning model and a document auditing rule matched with the object numbered musical notation format document according to a business scene of the object numbered musical notation format document; extracting key information in the object numbered musical notation format document through a preset machinelearning model; and generating a document auditing result through a rule engine according to the document auditing rule and the key information, and sending the document auditing result to the webpagecomponent. According to the embodiment of the invention, the matched preset machine learning model and document auditing rule can be obtained according to the business scene. The key information in the object numbered musical notation format document can be accurately extracted through the preset machine learning model, and then document auditing is carried out through the rule engine according to the document auditing rule and the key information, so that the document auditing requirements under different business scenes are met, and the document auditing effect and the document auditing efficiency are improved.

Description

technical field [0001] Embodiments of the present invention relate to document processing technology, and in particular to a document review method, device, system, equipment and storage medium. Background technique [0002] Document review is a common work scenario. Various types of documents have related review standards and procedures, and only compliant documents can enter the next step. At present, most of the document review work is purely manual. Not only is the efficiency relatively low, but also due to the boring and heavy work, some information may be missed, leading to risks. [0003] The existing technology introduces automation solutions in some links of document review, which can perform certain auxiliary work in a certain link, reducing labor costs and pressure. For example, the document review process includes: document parsing, information extraction, and document review. The document review system in the prior art extracts information based on traditiona...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/205G06F40/103G06F40/151G06N20/00
CPCG06N20/00
Inventor 高翔陈运文安怡赵业辉李瀚清纪达麒
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products