Header-column entity-relationship matching method based on deep learning multi-head selection model

A technology for selecting models and deep learning, applied in neural learning methods, biological neural network models, instruments, etc., can solve the problems of ignoring header semantics, high program startup cost, etc., and achieve the effect of low startup cost

Active Publication Date: 2021-11-16
南京烽火星空通信发展有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The header column entity relationship matching technology is to determine the corresponding relationship between the two columns of the table. This technology plays an important role in table information mining. The existing technical solution is to use data item collision for judgment, ignoring the semantic information of the table header. This method needs to maintain a benchmark database by itself before data collision can be performed. The startup cost of the scheme is relatively high, and the header semantics are not used for auxiliary judgment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Header-column entity-relationship matching method based on deep learning multi-head selection model
  • Header-column entity-relationship matching method based on deep learning multi-head selection model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0027] Example: such as Figure 1-2 As shown, the present invention is based on the table header column entity relationship matching method of the deep learning multi-head selection model, including the following steps:

[0028] Step 1: Define data entity attribute categories for the data items in the table, including time, name and company name, and build a regular recognition method;

[0029] Step 2: Construct the artificial features of the combination of any two columns in the table header. The construction method of the artificial features can be selected according to the needs of the actual scene, and record the relationship matching problem of the entities in any two columns of the table header;

[0030] Step 3: After passing the header character sequence and the data attribute sequence corresponding to the header through the respective embedding layers, the merged vector is used as the input of the next encoding layer;

[0031] Step 4: The coding layer adopts the bi-ls...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a table header column entity relationship matching method based on a deep learning multi-head selection model, including: defining data entity attribute categories for the data items of the table, such as time, name, company name, etc., and constructing a regular recognition method; constructing a table header The artificial features of any two-column combination; after the header character sequence and the data attribute sequence corresponding to the header pass through their respective embedding layers; the encoding layer adopts the bi-lstm model structure; the context encoding information is based on the multi-head selection mechanism. Splicing two combinations; calculating the binary loss value of any two positions in the header sequence for each relationship category; converging the loss value to the best model and retaining it as the model for prediction. The header column entity relationship matching method based on the deep learning multi-head selection model makes this method more convenient through the header and model.

Description

technical field [0001] The present invention relates to the technical field of header column entities, in particular to a header column entity relationship matching method based on a deep learning multi-head selection model. Background technique [0002] The header column entity relationship matching technology is to determine the corresponding relationship between the two columns of the table. This technology plays an important role in table information mining. The existing technical solution is to use data item collision for judgment, ignoring the semantic information of the table header. This method needs to maintain a benchmark database by itself before data collision can be performed. The startup cost of the scheme is relatively high, and the header semantics are not used for auxiliary judgment. Therefore, we improve this and propose a method for matching the entity relationship of header columns based on the deep learning multi-head selection model. Contents of the i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/903G06F16/906G06F40/126G06F40/18G06N3/08
CPCG06F16/90344G06F16/906G06F40/126G06F40/18G06N3/084
Inventor 高永伟李曙光宋万军姜广栋杨万刚李峰蔡晨陈玉冰皮乾东黄昌彬杜俊杰张鑫涛
Owner 南京烽火星空通信发展有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products