Method and device for generating word feature vector, server, electronic equipment and storage medium

A technology of feature vectors and words, applied in the field of natural language processing, can solve the problems of consuming large memory and computing resources, low training efficiency and prediction efficiency, and achieve the effect of improving training efficiency and prediction efficiency, and ensuring training and prediction effects

Pending Publication Date: 2022-06-28
刘凡平
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present invention provides a technical concept of training word feature vectors based on errors, and quickly derives word feature vectors through error calculation to solve the comparison between training efficiency and prediction efficiency of mainstream word vector generation methods in the prior art Problems that are low and consume a lot of memory and computing resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating word feature vector, server, electronic equipment and storage medium
  • Method and device for generating word feature vector, server, electronic equipment and storage medium
  • Method and device for generating word feature vector, server, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0036] It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict.

[0037] The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generall...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for generating a word feature vector, and the method comprises the steps: obtaining a minimum training unit, and carrying out the word segmentation processing, thereby obtaining a target word set; respectively obtaining an initial feature vector of each target word in the target word set; training the target words in the target word set according to the difference between the sum of the initial feature vectors of all the target words in the target word set and the number of the target words in the target word set, and determining the word feature vector of each target word according to the training result. The invention further discloses a device and a server for generating the word feature vectors. According to the device and the method provided by the invention, the word feature vector can be quickly derived, so that the problems that a mainstream word vector generation method in the prior art is relatively low in training efficiency and prediction efficiency and needs to consume a large amount of memory and computing resources are solved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, and in particular, to a method for generating a word feature vector, an apparatus for generating a word feature vector, a server, an electronic device and a storage medium. Background technique [0002] Word feature vector (abbreviation: word vector) is the technology of choice for text vectorization in various natural language processing tasks, such as part-of-speech tagging, named entity recognition, text classification, document clustering, sentiment analysis, document generation, question answering systems, etc. A word vector can map words to a fixed dimension. Mathematically, a word vector can be expressed as a mapping relationship of f:x→y, where x is a word or paragraph text, and y is a fixed dimension vector after the mapping. [0003] The current mainstream word vector generation methods include Word2vec, GloVe, and BERT models. However, the Word2vec model ha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30G06F40/126
CPCG06F40/289G06F40/30G06F40/126
Inventor 刘凡平
Owner 刘凡平
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products