Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity

A technology of similarity calculation and dependency relationship, applied in the field of Chinese long sentence similarity calculation based on fine-grained dependency relationship, which can solve the problem that the importance of roles is not fully considered.

Inactive Publication Date: 2016-09-28
ANYANG NORMAL UNIV
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a Chinese long sentence similarity calculation method based on fine-grained dependencies, aiming to solve the problem that the current sentence similarity method based on dependency syntax does not fully consider the role importance of each element inside the dependency relationship, but only The problem of labeling dependencies as a whole

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity
  • Fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity
  • Fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0032] The application principle of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0033] Such as figure 1 As shown, the Chinese long sentence similarity calculation method based on the fine-grained dependency relationship in the embodiment of the present invention comprises the following steps:

[0034] S101: Based on the dependency relationship pair, match the dependency relationship pair sets of the two sentences, and select the corresponding relationship that maximizes the sum of the relationship pair similarities;

[0035] S102: On the basis of the correspondi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity. The method comprises the steps of calculating a dependence relationship pair set of two sentences based on dependence relationship pairs; selecting a corresponding relationship which enables a relationship pair similarity sum to be maximal; and based on the corresponding relationship, calculating an average value of relationship pair similarity sums as syntactic structure similarity of the two sentences. According to the method, five characteristic variables such as concepts, word properties, relationship types and the like of two vocabularies in a dependence relationship are compared and a syntactic structure similarity relationship of the sentences are measured; an experimental result already prominently shows the advantages of semantic analysis in sentence similarity calculation; compared with similarity calculation in the prior art, the correct rate is increased to a certain extent, key information reflected around a non central word is considered, and two layers of root node-predicate head and a direct domination part in a dependence syntactic tree are mainly considered in the prior art; and the correct rate is exchanged with the time in the method.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a method for calculating the similarity of long Chinese sentences based on fine-grained dependencies. Background technique [0002] In the research of Chinese information processing, the calculation of sentence similarity is an important part, and it has a wide range of applications in automatic summarization, machine translation, information retrieval, automatic question answering system, etc. At present, the research on sentence similarity calculation can be roughly divided into word-based similarity, word meaning-based similarity, syntactic structure-based similarity, edit distance-based, dynamic programming-based similarity, etc. These methods can be classified into two categories: one does not need or only needs shallow grammatical structure relations, and the other requires deep grammatical structure and semantic analysis. In case-based mach...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/211
Inventor 熊晶郭涛翟雪孙华王继鹏高峰袁冬
Owner ANYANG NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products