Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Complex valued word vector construction method based on positions and semantics

A construction method and word vector technology, applied in the field of complex-valued vector construction based on location and semantics, can solve the problem of not making better use of complex-valued word vectors, and achieve the effect of overcoming the lack of text classification corpus

Pending Publication Date: 2020-02-28
TIANJIN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current method only uses complex vectors, and does not make better use of complex-valued word vectors to mine the relative position information of words in sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Complex valued word vector construction method based on positions and semantics
  • Complex valued word vector construction method based on positions and semantics
  • Complex valued word vector construction method based on positions and semantics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings, but the protection scope of the present invention is not limited to the following description. figure 1 It shows the flow of the neural network classification method based on the complex-valued vector representation of position and semantics proposed by this method; figure 2 A graph showing the possible distribution of 3D complex word vectors; image 3 Shows the running time comparison results of classification models for different word vectors; Figure 4 It is a comparison chart of the similarity of different words in the same sentence under different word vectors. :

[0053] The extraction, sorting and processing of the traditional webpage text content containing keywords is completely manual, which is time-consuming and laborious. With the explosive growth of webpage data, the manual method becomes inefficient. In this case, this...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a complex valued word vector construction method based on position and semantics, which comprises the following steps: searching a text classification corpus set, and dividingthe text classification corpus set into a training set, a verification set and a test set; preprocessing the text in the corpus set (removing stop words); constructing sentence representation by usingthe relative position information and the global semantic information; inputting the word vectors of the training corpus set into a complex-valued neural network, and training a semantic classification model; inputting the verification set text word vector into a complex-valued neural network model so as to calculate the prediction probability of each sample; testing the model obtained on the basis of the verification set on a test set; according to the method, the current situation that a text classification corpus set is relatively lacked is overcome, the feature information (position information) of the text can be extracted more fully, the position information and global semantic information of the text are fused, and the complex valued word vector is applied to the replicated neuralnetwork, so that neural network models have relatively strong discrimination capability.

Description

technical field [0001] The invention relates to the technical field of text classification, in particular to a method for constructing complex-valued vectors based on position and semantics. Background technique [0002] In the past few years, with the rapid development of science and technology, especially the rapid development of the Internet and social networks, all kinds of information are flooding the Internet, including comments and some of their own opinions published by users on social platforms. It has also become one of the main sources of information for users in their daily lives. People may obtain a large amount of information through the Internet, but how to manage these large amounts of information reasonably and effectively has become an issue that people are more and more concerned about. A very common management method for large amounts of information is classification, which shows that text classification contains huge social value. The present invention...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06K9/62G06N3/04G06N3/08G06F16/383G06F16/387G06F40/289
CPCG06F16/35G06N3/084G06F16/383G06F16/387G06N3/044G06N3/045G06F18/214
Inventor 赵东浩张鹏
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products