Text paragraph identification comparison method and system based on longest common subsequence

A longest common, sub-sequence technology, applied in the field of text processing, can solve the problem of document comparison without paragraph information

Active Publication Date: 2018-11-02
DATAGRAND TECH INC
View PDF10 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The main purpose of this application is to provide a text paragraph identification and comparison method to solve the problem that existing document comparison tools cannot compare documents that cannot obtain paragraph information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text paragraph identification comparison method and system based on longest common subsequence
  • Text paragraph identification comparison method and system based on longest common subsequence
  • Text paragraph identification comparison method and system based on longest common subsequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is an embodiment of a part of the application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0037] It should be noted that the terms "first" and "second" in the description and claims of the present application and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It should be understood that the data so used may be interchanged under appropriate circumstances for...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text paragraph identification comparison method and a text paragraph identification comparison system based on a longest common subsequence. The text paragraph identificationcomparison method comprises the steps of acquiring a first text character string and a second text character string; performing paragraph identification on the first text character string and the second text character string; performing paragraph order adjustment on the first text character string and the second text character string; and comparing the first text character string and the second text character string which are subjected to paragraph order adjustment to obtain a difference item. The text paragraph identification comparison system comprises a front end, a conversion module, a paragraph identification module and a comparison module. With the text paragraph identification comparison method and the text paragraph identification comparison system based on the longest common subsequence, the problems that texts whose paragraph information cannot be acquired cannot be compared and the paragraph adjustment situation cannot be processed well in an existing text comparison tool are solved.

Description

technical field [0001] The present application relates to the field of text processing, in particular, to a method and system for identifying and comparing text paragraphs based on the longest common subsequence. Background technique [0002] In contemporary society, many companies have a large number of internal documents, such as contracts, instructions, tenders, etc. These documents have a high degree of similarity with only a few differences, and there is often a need to compare documents. For example, by comparing two contracts and finding out the differences between them, you can quickly find the key points and risks of the contract, etc. It can be said that document comparison has great practical value for many enterprises. In the past, manual comparison was often used, which was inefficient and error-prone, and document comparison tools were created. [0003] The current document comparison tools, such as Word’s built-in comparison function, take the entire document...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/34
CPCG06V30/416G06V10/267G06V30/153
Inventor 李瀚清高翔纪达麒陈运文
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products