Automatic classification method of text syntax structure and semantic information fused text entity relationship

A technology of grammatical structure and semantic information, applied in text database clustering/classification, semantic analysis, unstructured text data retrieval, etc. It is fully utilized and other problems to achieve the effect of alleviating the difficulty of semantic feature extraction.

Pending Publication Date: 2020-05-19
SHANGHAI UNIV +1
View PDF9 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Traditional entity-relationship classification methods using machine learning, such as support vector machines, need to construct a large number of lexical, syntactic and semantic features, and these features cannot be reused across domains, which greatly limits the scope of use of the method
Traditional methods of using neural networks to classify entity relationships, such as bidirectional recurrent memory neural networks and convolutional neural networks, simply use the direct mapping between words and vectors in the text, ignoring the impact of entities and relationships on the words in sentences , the effective information of entities and relationships cannot be fully utilized, so the classification accuracy is not high; some neural network methods for entity relationship classification using sentence grammatical structure, such as the neural network method based on the shortest dependency path, ignore other components in the sentence The impact on the entity relationship itself causes the loss of lexical information other than the entity, and it is impossible to effectively obtain the positive impact of environmental words other than the entity on the entity relationship

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic classification method of text syntax structure and semantic information fused text entity relationship
  • Automatic classification method of text syntax structure and semantic information fused text entity relationship
  • Automatic classification method of text syntax structure and semantic information fused text entity relationship

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The implementation method of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0037] Such as figure 1 As shown, an automatic text entity relationship classification method that integrates text grammatical structure and semantic information, the specific steps are as follows:

[0038] Step (1): Obtain the text training set information published on the Internet and perform preprocessing to obtain the initial vector of the sentence.

[0039] Obtain the Internet public ACL data set SemEval2010_task8 The data set features: contains 19 types of relationships, of which the main relationships are divided into {Message-Topic(e1,e2), Product-Producer(e1,e2), Instrument-Agency(e1,e2), Entity-Destination(e1,e2), Cause-Effect(e1,e2), Component-Whole(e1,e2), Entity-Origin(e1,e2), Member-Collection(e1,e2), Content-Container(e1 ,e2)} These 9 types of relationships, in which there is an order relationship between e1 and e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic classification method of a text syntax structure and semantic information fused text entity relationship. The method comprises the following specific implementation steps: (1) preprocessing a text set to obtain a sentence initial vector; (2) extracting relevancy between words in sentences and entities and relations, and updating sentence vectors; (3) inputtingthe sentence vector in step (2) into a bidirectional gate loop unit neural network to obtain a sentence vector fused with semantic information; (4) inputting the sentence vector in step (2) into a graph attention neural network to obtain a sentence vector fused with grammatical structure information; (5) splicing the output of step (3) and the output of step (4), and updating sentence vectors; and(6) inputting the sentence vector in step (5) into a fully connected neural network layer to obtain a feature vector of the sentence, and outputting an entity relationship classification vector through softmax transformation. The method can effectively relieve the problems that semantic features and grammatical features are difficult to express and artificial feature selection errors are large inentity relationship classification.

Description

technical field [0001] The present invention relates to the fields of text mining and deep learning, and more specifically, relates to a text entity relationship automatic classification method that integrates text grammatical structure and semantic information. Background technique [0002] Traditional entity-relationship classification methods using machine learning, such as support vector machines, need to construct a large number of lexical, syntactic and semantic features, and these features cannot be reused across domains, which greatly limits the scope of use of the method . Traditional methods of using neural networks to classify entity relationships, such as bidirectional recurrent memory neural networks and convolutional neural networks, simply use the direct mapping between words and vectors in the text, ignoring the impact of entities and relationships on the words in sentences , the effective information of entities and relationships cannot be fully utilized, s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/211G06F40/295G06F40/30G06N3/02
CPCG06F16/35G06N3/02
Inventor 陈雪陈光勇骆祥峰黄敬王鹏
Owner SHANGHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products