Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Knowledge graph-based text comparison method, apparatus and device, and storage medium

A knowledge graph and text technology, applied in the field of big data, can solve the problems of loss, deviation of comparison results, etc.

Pending Publication Date: 2020-11-06
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method will extract the semantics of the text content, which is conducive to refining the text and improving the efficiency of text element comparison. However, in the process of text refining, some useful text information will inevitably be lost, resulting in deviations in the comparison results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge graph-based text comparison method, apparatus and device, and storage medium
  • Knowledge graph-based text comparison method, apparatus and device, and storage medium
  • Knowledge graph-based text comparison method, apparatus and device, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0111] see Figure 4 , Figure 4 A specific implementation after step S24 is shown, including:

[0112] S25: Judging whether two or more initial entities form a compound word by counting the degree of cohesion of the initial entities in the short sentence of the text, and obtaining a judgment result.

[0113] Specifically, through tf-idf and co-occurrence analysis, the degree of agglomeration of initial entities in text sentences is counted.

[0114] Among them, tf-idf is a statistical method to evaluate the importance of a word for a file set or a file in a corpus. The importance of a word increases proportionally to the number of times it appears in the document, but decreases inversely proportional to the frequency it appears in the corpus. Co-word analysis uses the co-occurrence of words and noun phrases in a collection of documents to determine the relationship between topics in the disciplines represented by the collection. It is generally believed that the more time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a big data technology, and discloses a knowledge graph-based text comparison method, which comprises the following steps: obtaining a training text, identifying a target entity and a target relationship in the training text, generating a graph by taking the target entity as a node and the target relationship as an edge, and taking the graph as an initial graph; marking a target entity and a target relationship of an initial graph, and clustering nodes of the initial graph according to the marked target entity and target relationship to obtain a target graph; obtainingto-be-compared texts, inputting the to-be-compared texts into the target atlas, and counting the coverage rate of entities and relationships extracted from each to-be-compared text on core informationin the target atlas; and if the coverage rate exceeds a preset threshold, judging that the to-be-compared text is a similar text. The invention also relates to a blockchain technology, and the to-be-compared text is stored in the blockchain. According to the invention, the text comparison accuracy and efficiency are improved in an atlas comparison mode.

Description

technical field [0001] The present application relates to the field of big data technology, and in particular to a text comparison method, device, device and storage medium based on a knowledge map. Background technique [0002] Text content comparison technology is widely used in both vertical and general fields. For example, in insurance, banking, investment and other financial text processing scenarios involving incoming review or risk monitoring, it is necessary to compare multiple documents and check whether there are any contradictions in the information provided by different documents to achieve the audit purpose. [0003] The existing text comparison technology is to use the automatic abstract generation technology to split the text, and then generate abstracts for the split paragraphs, and finally compare the abstracts of the two articles to judge whether the main content of the two articles expresses the meaning. Consistent, and then judge whether the two articles...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F40/211G06F40/295G06F40/30G06F40/194G06K9/62G06N3/04G06N3/08
CPCG06F40/211G06F40/295G06F40/30G06F16/367G06F40/194G06N3/08G06N3/045G06F18/231G06F18/22G06F18/24
Inventor 朱昱锦徐国强
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products