Method and system for text paragraph recognition and comparison based on longest common subsequence

A longest common, subsequence technology, applied in the field of text processing, can solve problems such as document comparison without paragraph information

Active Publication Date: 2022-08-09
DATAGRAND TECH INC
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The main purpose of this application is to provide a text paragraph identification and comparison method to solve the problem that existing document comparison tools cannot compare documents that cannot obtain paragraph information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for text paragraph recognition and comparison based on longest common subsequence
  • Method and system for text paragraph recognition and comparison based on longest common subsequence
  • Method and system for text paragraph recognition and comparison based on longest common subsequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make those skilled in the art better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only The embodiments are part of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the scope of protection of the present application.

[0037] It should be noted that the terms "first", "second", etc. in the description and claims of the present application and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable un...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application discloses a method and system for identifying and comparing text paragraphs based on the longest common subsequence. The text paragraph identification and comparison method includes: obtaining a first text string and a second text string; performing paragraph identification on the first text string and the second text string; The second text string is adjusted in paragraph order; the first text string after the paragraph order is adjusted and the second text string are compared to obtain a difference item. The text paragraph recognition and comparison system includes: a front end, a conversion module, a paragraph recognition module, and a comparison module. The present application solves the problems that existing document comparison tools cannot compare documents for which paragraph information cannot be obtained, and cannot properly handle the situation of paragraph reversal.

Description

technical field [0001] The present application relates to the field of text processing, and in particular, to a method and system for identifying and comparing text paragraphs based on the longest common subsequence. Background technique [0002] In contemporary society, many companies have a large number of documents, such as contracts, instructions, tenders, etc., and the similarity between such documents is relatively high, with only a few differences. There is often a need to compare the documents. For example, by comparing two contracts to find out the differences between them, the key points and risks of the contracts can be quickly found. It can be said that document comparison has great practical value for many enterprises. In the past, manual comparison was often used, which was inefficient and prone to errors, resulting in document comparison tools. [0003] The current document comparison tool, such as the comparison function that comes with Word, takes the entir...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/10G06V30/418G06V30/148G06V30/19
CPCG06V30/416G06V10/267G06V30/153
Inventor 李瀚清高翔纪达麒陈运文
Owner DATAGRAND TECH INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products