Document comparative analysis method and device, electronic equipment and storage medium

An analysis method and document technology, applied in the direction of electrical digital data processing, instruments, calculations, etc., can solve the problems of wasting human resources, time and energy, etc.

Pending Publication Date: 2020-04-10
DATAGRAND TECH INC
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this requires manual comprehensive reading of documents of different revision versions, wasting human res

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document comparative analysis method and device, electronic equipment and storage medium
  • Document comparative analysis method and device, electronic equipment and storage medium
  • Document comparative analysis method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] Figure 1a It is a flow chart of a document comparison and analysis method provided by Embodiment 1 of the present invention. This embodiment is applicable to the comparison and analysis of multiple documents to find out the differences in the documents. For example, different versions of In the case of comparative analysis of documents, the method can be executed by a document comparison and analysis device, which can be implemented by software and / or hardware, and the device can be integrated in a processor, such as Figure 1a As shown, the method specifically includes:

[0035] Step 110: Parse the first document to be analyzed and the second document to be analyzed respectively to obtain a first element set and a second element set.

[0036] Wherein, the first document to be analyzed can be the latest version of the document, and the second document to be analyzed can be a different version of the same type as the first document to be analyzed, a document issued in a ...

Embodiment 2

[0063] Figure 1e It is a flow chart of a document comparison and analysis method provided in Embodiment 2 of the present invention; this embodiment further refines the above technical solution, as Figure 1e As shown, the method specifically includes:

[0064] In step 310, the first document to be analyzed and the second document to be analyzed are respectively parsed by the element parsing module to obtain a first element set and a second element set.

[0065] Wherein, the function of the element parsing module is to parse according to the element granularity required by the first document to be analyzed and the second document to be analyzed, and the parsing result is an element set. Such as Figure 1b As shown, when the granularity is a paragraph, the parsing result is that the first element set and the second element set are a paragraph set; when the granularity is a sentence, the parsing result is that the first element set and the second element set are a sentence set...

Embodiment 3

[0073] figure 2 is a schematic structural diagram of a document comparison and analysis device provided in Embodiment 3 of the present invention, as shown in figure 2 As shown, the device includes: an element set analysis module 210 , an element matching module 220 , an element determination module 230 , a first identification module 240 , an element pair formation module 250 and a second identification module 260 .

[0074] An element set parsing module 210, configured to parse the first document to be analyzed and the second document to be analyzed respectively to obtain the first element set and the second element set;

[0075] An element matching module 220, configured to calculate the degree of matching between each element in the first element set and each element in the second element set;

[0076] An element determination module 230, configured to respectively determine the element with the highest matching degree between the second element set and each element in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiments of the invention disclose a document comparative analysis method and device, electronic equipment and a storage medium. The method comprises the steps of analyzing a first to-be-analyzed document and a second to-be-analyzed document to obtain a first element set and a second element set; calculating the matching degree of each element in the first element set and each element in the second element set; determining an element, having the highest matching degree with each element in the first element set, in the second element set; if the matching degree of the target element inthe first element set and the element with the highest matching degree in the second element set is smaller than a set threshold value, identifying the target element; if the matching degree of the target element in the first element set and the element with the highest matching degree in the second element set is greater than the set threshold, forming an element pair; and analyzing characters inthe element pairs through an element comparison module, and identifying differences in the element pairs in the first to-be-analyzed document and the second to-be-analyzed document respectively. Thus, the difference between the documents can be quickly and accurately found.

Description

technical field [0001] Embodiments of the present invention relate to natural language processing technologies, and in particular to a document comparison and analysis method, device, electronic equipment, and storage medium. Background technique [0002] In today's era, information updates are iterated rapidly. For management decision makers, every slight change in information may become an important factor affecting decision-making. Often, only those who have more comprehensive, timely and effective information can become the winners. However, information changes frequently, and the number of documents that need to be read within a limited time is increasing. How to quickly obtain valuable information in documents in a short period of time has become a challenge for management decision makers. In particular, in various official documents such as government work reports, laws and regulations, standard documents, contracts, official documents, research reports, etc., there a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/194
Inventor 王文广贺梦洁陈运文
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products