Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-source data fusion method and system based on word vector matrix decomposition technology

A technology of multi-source data and matrix decomposition, which is applied in electronic digital data processing, special data processing applications, natural language data processing, etc. The effect of improving quality

Pending Publication Date: 2022-03-11
武汉东湖大数据交易中心股份有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of this, the present invention proposes a multi-source data fusion method based on word vector matrix decomposition technology, which is used to solve the problem that the implicit meaning of multi-source data and the relationship between data cannot be effectively extracted

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source data fusion method and system based on word vector matrix decomposition technology
  • Multi-source data fusion method and system based on word vector matrix decomposition technology
  • Multi-source data fusion method and system based on word vector matrix decomposition technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0050] The working process of a multi-source data fusion method based on the word vector matrix decomposition technology of the present invention is shown in figure 1 , the processing steps are as follows:

[0051] The first step is to obtain multi-source data samples, the multi-source data samples include multi-modal data of text, voice, image, and video, and obtain the corresponding implicit semantic knowledge base. Go to the second step.

[0052] It should be understood that the implicit semantic knowledge base includes descriptive information and synonymous information corresponding to the implicit association of sample data; if the multimodal data is in the form of text, it is symbolic; if the multimodal data is audio or The visual form is represented as a signal, and the signal is converted into a corresponding text format, and the visual form includes pictures and videos.

[0053] For example, the contents of multi-modal data sets of multi-source data samples include:...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-source data fusion method and system based on a word vector matrix factorization technology, and the method comprises the steps: obtaining multi-source data samples, each multi-source data sample comprises multi-modal data of texts, voices, images and videos, and obtaining a corresponding implicit semantic knowledge base; jointly projecting the multi-source data samples and corresponding semantic information extracted from the implicit semantic knowledge base into a shared semantic subspace to generate a word vector matrix; decomposing the word vector matrix to obtain low-dimensional features of the multi-source data sample; training a classifier by taking the low-dimensional features of the multi-source data samples as input and taking corresponding semantic information as labels; multi-modal data of a to-be-mined target task is subjected to same word vector matrix generation and decomposition matrix processing and then input to the trained classifier, semantic information of the target task is obtained, and hidden semantic mining of the target task is completed. According to the method, the implicit meaning of the multi-source data and the relationship between the data are effectively extracted.

Description

technical field [0001] The present invention relates to the technical field of multi-source data fusion, in particular to a multi-source data fusion method and system based on word vector matrix decomposition technology. Background technique [0002] New tools for data processing free data scientists from the tedious work of data preparation, but how to customize and integrate multi-source data to form an effective analysis data set according to each data analysis project is still a more difficult problem that data scientists must face. Challenging bottleneck. [0003] With the development of computer technology and communication technology, and the continuous emergence of new theories and methods, multi-source data fusion technology will become more mature, from theoretical research to practical and wider application, and eventually to the direction of intelligence and real-time develop. Traditional word vector learning methods often rely on a large number of unlabeled te...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/284G06F40/30G06F16/35
CPCG06F40/284G06F40/30G06F16/35
Inventor 杜登斌杜小军杜乐
Owner 武汉东湖大数据交易中心股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products