Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text feature extraction method and knowledge graph construction method

A feature extraction, text technology, applied in neural learning methods, text database query, text database clustering/classification, etc., can solve problems such as difficult to deal with referential problems, feature loss, and reduce the accuracy of relational classification tasks, and improve Accuracy, the effect of improving accuracy

Active Publication Date: 2021-11-16
广州天宸健康科技有限公司
View PDF12 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing one-step relationship extraction algorithm mainly designs the feature extraction model around the entity-relationship pair, and rarely focuses on the feature extraction design with the entity as the core, so the final accuracy of the relationship extraction is not high. , reducing the accuracy of subsequent relation classification tasks
[0007] In addition, the existing methods are difficult to deal with the reference problem of entities in the text, resulting in the loss of features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text feature extraction method and knowledge graph construction method
  • Text feature extraction method and knowledge graph construction method
  • Text feature extraction method and knowledge graph construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0063] See figure 2 A preferred embodiment of the extraction of entity characteristics, the steps are as follows:

[0064] First, the BERT model is used to perform feature extraction, and the features of each word, of course, other embodiments of the change may be used, or other models or similar Bert models can be used.

[0065] Shielding the features of non-entity keywords, and the feature E1 containing the physical word, the specific implementation method can be implemented using the MASK mechanism in the BERT model.

[0066] The length of the entity is represented by a vector, and the length feature E2 is obtained.

[0067] The feature E1 and feature E2 are spliced ​​to obtain feature E3 as entity features, and the feature vectors of the entity feature represent the specific vector of the entity that is embedded, plus the entity character. Features vector.

[0068] Further, after the feature E3 is obtained, the first neural network and a classifier C1 (i.e., the physical posit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text feature extraction method and a knowledge graph construction method. The text feature extraction method comprises the following steps: constructing negative samples on the basis of marked entities and relationships as positive samples: entity negative samples, relationship negative samples and anaphora disambiguation negative samples; performing mapping characterization on the positive sample and the negative sample to obtain a vector set formed by entity characterization, entity pair characterization and anaphora disambiguation entity pair characterization; analyzing the vector set to obtain entity features, anaphora disambiguation features and relation features; classifying the entity features, the anaphora disambiguation features and the relation features; setting loss function, and evaluating classification result. According to the method, the entity is taken as the center, anaphora disambiguation is assisted, the problem of entity loss caused by anaphora non-entity is solved, the accuracy of the whole model is improved, the relation characteristics are fused, and the accuracy of relation identification is further improved.

Description

Technical field [0001] The present invention relates to the field of natural language processing, and more particularly to a textual characteristic extraction method and a knowledge map construction method. Background technique [0002] Knowledge map is a semantic network that represents information and relationships in the form of a map data structure, thereby being used to further excavate the relationship between information and information hidden in the information. [0003] The graph in the knowledge map, consists of nodes and edges, where the nodes are used to represent concepts and entities, while indicating the relationship and attributes of things. How to extract the input or existing information (including entity extraction, relationship extraction, and attribute extraction), resulting in knowledge representation, is the basis and premise of the next construction of knowledge maps. [0004] In the prior art, there are two main methods, the first is two steps, which is t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F16/36G06F40/166G06F40/30G06N3/04G06N3/08
CPCG06F16/3344G06F16/35G06F16/367G06F40/166G06F40/30G06N3/08G06N3/045Y02D10/00
Inventor 曾祥云朱姬渊
Owner 广州天宸健康科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products