Event relationship extraction method for massive data sets

A mass data and event relationship technology, applied in database models, relational databases, structured data retrieval, etc., can solve the problems of massive training samples, difficult to reach 60%, slow calculation, etc., to solve low precision and solve extraction speed problem effect

Active Publication Date: 2020-03-17
成都迪普曼林信息技术有限公司
View PDF9 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, in the development of the knowledge graph system, for the extraction of event relationships, the mainstream algorithms are based on remote supervision algorithms. This algorithm is more practical for small data sets. Once the number of entities in the data set reaches tens of millions, Faced with the shortcomings of slow calculation, low accuracy of event relationship extraction, and the need for a large number of manually labeled training samples
It is difficult for traditional remote supervision training algorithms to reach 60% on large and massive data sets, and the number of manually labeled samples required by traditional methods is basically at the level of 100,000 or even higher

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Event relationship extraction method for massive data sets
  • Event relationship extraction method for massive data sets

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to enable those skilled in the art to better understand the technical solutions of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0028] The present invention proposes a method for extracting event relations of massive data sets, comprising the following steps:

[0029] S1: Establish association relationship and association strength between triplets according to association rules to form an undirected network;

[0030] S2: Connect the former term vector, the latter term vector and the entity type in the triplet as the feature of the node in the undirected network;

[0031] S3: Classify and process each node in the undirected network, and extract the entity relationship in the event.

[0032] Preferably, the association rules are established based on the FP-tree frequency set algorithm.

[0033] The schematic diagram of undirected network is as fol...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a mass data set event relationship extraction method, which comprises the steps of S1, establishing an association relationship and association strength among triples accordingto an association rule to form an undirected network; S2, connecting the preceding word vector, the posterior word vector and the entity type in the triad to serve as features of nodes in the undirected network; and S3, performing classification processing on each node in the undirected network, and extracting an entity relationship in the event. According to the method, the problems of extraction precision and speed of event relationships in a big data set or a mass data set are solved, and parallel operation can be carried out by using a convolution network, so that the problem of extraction speed is solved; and meanwhile, the convolutional network is used for extracting the features on the graph data structure, so that the defect of low precision caused by weak extracted features in the traditional method is overcome by utilizing the advantage of strong feature extraction of the convolutional network.

Description

technical field [0001] The invention relates to the field of event relationship extraction, in particular to a method for extracting event relationship from massive data sets. Background technique [0002] At present, in the development of the knowledge graph system, for the extraction of event relationships, the mainstream algorithms are based on remote supervision algorithms. This algorithm is more practical for small data sets. Once the number of entities in the data set reaches tens of millions, It faces shortcomings such as slow calculation, low accuracy of event relationship extraction, and the need for a large number of manually labeled training samples. It is difficult for the traditional remote supervision training algorithm to reach 60% on large and massive data sets, and the number of manually labeled samples required by the traditional method is basically 100,000 or even higher. Contents of the invention [0003] In order to solve the above problems, the prese...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/36G06F16/33G06F16/35G06F16/28
CPCG06F16/285G06F16/288G06F16/3347G06F16/35G06F16/367
Inventor 陈泽勇张治同张莉姚松
Owner 成都迪普曼林信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products