Entity attribute value extraction method based on bidirectional long-short-term memory network

A technology of long and short-term memory and entity attributes, applied in neural learning methods, biological neural network models, instruments, etc., can solve the problem of open irregularities in a large number of fields, difficult to distinguish the relationship between entities, attribute names and attribute values, and entity attributes. The problem of complex value composition, etc., to achieve the effect of reducing dependence

Active Publication Date: 2020-04-17
INST OF ELECTRONICS & INFORMATION ENG OF UESTC IN GUANGDONG
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the massiveness, heterogeneity, openness, and non-standardization of text data on the Internet lead to various categories and complex composition of entity attribute values, which brings new research challenges to entity a...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity attribute value extraction method based on bidirectional long-short-term memory network
  • Entity attribute value extraction method based on bidirectional long-short-term memory network
  • Entity attribute value extraction method based on bidirectional long-short-term memory network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Certain terms are used, for example, in the description and claims to refer to particular components. Those skilled in the art should understand that hardware manufacturers may use different terms to refer to the same component. The specification and claims do not use the difference in name as a way to distinguish components, but use the difference in function of components as a criterion for distinguishing. As mentioned throughout the specification and claims, "comprising" is an open term, so it should be interpreted as "including but not limited to". "Approximately" means that within an acceptable error range, those skilled in the art can solve technical problems within a certain error range and basically achieve technical effects.

[0036] In the description of the present invention, it should be understood that the orientation or positional relationship indicated by the terms "upper", "lower", "front", "rear", "left", "right", horizontal" etc. are based on the draw...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of network text data processing, and particularly relates to an entity attribute value extraction method based on a bidirectional long-short-term memory network, which comprises the following steps of: 1, preprocessing a document set; step 2, adopting category mapping to identify attribute values from statements containing entities; 3, performing deep syntactic analysis on the sentences of the entities and the attribute values, and extracting related sentence components as training corpora; and 4, performing vector conversion on the training corpusby adopting a word vector model, training BLSTM model parameters in combination with syntactic features, and classifying the entities and the attribute values into a given attribute name category. According to the method, a bidirectional long-short-term memory network is adopted, so that the relationship among entities, attribute names and attribute values can be accurately judged.

Description

technical field [0001] The invention belongs to the technical field of network text data processing, and in particular relates to a method for extracting entity attribute values ​​based on a two-way long-short-term memory network. Background technique [0002] With the vigorous development of various online media, unstructured text data on the Internet has shown explosive growth, such as news, Weibo, blogs, chat records, emails, etc. These data contain a lot of valuable information, such as entities. Entity is the most basic unit of information in text data. With the release of a large number of data, the ambiguity and diversity of entity names are becoming more and more common. Only identifying entity names cannot meet people's needs for deep semantic information in text. Therefore, in order to describe the essence of an entity, more and more researchers have begun to pay attention to the attribute information of the entity, such as the age, place of origin, and date of bi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/211G06F40/284G06F40/295G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06N3/044Y02D10/00
Inventor 韩伟红徐菁陈雷霆陈育梅赵朗
Owner INST OF ELECTRONICS & INFORMATION ENG OF UESTC IN GUANGDONG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products