Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A knowledge base construction method for science and technology information analysis

A construction method and knowledge base technology, applied in the field of computer knowledge base construction, can solve problems such as inability to simulate labels

Inactive Publication Date: 2019-01-11
HARBIN ENG UNIV
View PDF2 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] This improvement aims at the problem that a single network cannot simulate label dependence in entity extraction and the noise problem of remote supervision labeling in entity relationship extraction. The CWATT-BiLSTM-LSTMd model is proposed for entity extraction, and the RL-TreeLSTM model is used for entity relationship extraction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A knowledge base construction method for science and technology information analysis
  • A knowledge base construction method for science and technology information analysis
  • A knowledge base construction method for science and technology information analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The specific embodiment of the present invention will be further described below in conjunction with accompanying drawing:

[0057] A knowledge base construction method for science and technology information analysis, using the CWATT-BiLSTM-LSTMd model proposed by the present invention to carry out entity extraction, mainly through the following steps:

[0058] Step 1: Divide the data set into training set, verification set and test set in a ratio of 6:2:2. The training set is used to build the model, set the corresponding classifier parameters, and train the classification model. After using the training set to train multiple models, in order to find the model with the best effect, each model is used to determine the network structure and the parameters that control the complexity of the model using the data in the verification set. After obtaining the optimal model, the test set can evaluate the performance of the model.

[0059] Step 2: Run the word2vec software to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a knowledge base construction method for scientific and technological information analysis, belonging to the field of computer knowledge base construction. A CWATT-BiLSTM-LSTMdmodel is provided for entity extraction, and an RL-TreeLSTM model is used for entity relation extraction. An encoding-decoding mode is adopted for entity extraction, a BiLSTM (Bidirectional Long-Short Memory Network) is used for coding, LSTMd (Long-Short Memory Network) is used for decoding, and the embedding layer and decoding layer are improved. Then the model is used to extract entities fromthe corpus in the field of science and technology information. On the basis of deep reinforcement learning, the RL-TreeLSTM model is provided to extract the relationship between entities. The RL-TreeLSTM model is divided into two parts: a selector and a classifier. The selector selects effective sentences to the classifier in order to reduce the noise caused by the remote monitoring method. The classifier extracts the entity relation from the effective sentences, which improves the accuracy of relation extraction.

Description

technical field [0001] The invention belongs to the field of computer knowledge base construction, and in particular relates to a knowledge base construction method for scientific and technological information analysis. Background technique [0002] As a collection of knowledge, knowledge base plays a very important role in intelligent information processing. One of the core technologies of building a knowledge base is the extraction of entities and their relationships. The main goal of entity extraction is to extract entities that appear in a given sentence, usually using machine learning models and deep learning models. For example, the CRF (Conditional Random Field) model of machine learning needs to define a feature function, which focuses on the linear weighted combination of local features in the entire sentence. The quality of the feature template has a direct impact on the result of entity extraction; another example is the LSTM of deep learning ( Long-term short-t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/28G06F17/27
CPCG06F40/289
Inventor 王红滨秦帅谢晓东白云鹏李秀明王念滨周连科赵昱杰侯莎韦正现
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products