Word vector training method and apparatus

A training method and word vector technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problem of low word vector training efficiency, and achieve the effect of improving training efficiency

Active Publication Date: 2017-06-27
BEIHANG UNIV
View PDF2 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method makes the training efficiency of word vectors low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word vector training method and apparatus
  • Word vector training method and apparatus
  • Word vector training method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0045] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects, without having to use To describe a specific order or sequence. It should be understood that the data used in this way can be interchange...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a word vector training method and apparatus, and belongs to the technical field of machine learning. The word vector training method comprises the steps of obtaining a newly added vocabulary library, wherein vocabularies in the newly added vocabulary library and vocabularies in an old vocabulary library form a new vocabulary library, and the vocabularies in the old vocabulary library correspond to old word vectors; performing initialization processing on the vocabularies in the new vocabulary library, thereby enabling the word vectors of the vocabularies, belonging to the old vocabulary library, in the new vocabulary library to be the old word vectors, and enabling the word vectors of the vocabularies, belonging to the newly added vocabulary library, in the new vocabulary library to be random word vectors; and updating the word vectors of the vocabularies in the new vocabulary library according to a first Huffman tree corresponding to the new vocabulary library and a second Huffman tree corresponding to the old vocabulary library respectively. According to the word vector training method and apparatus provided by the invention, the training efficiency of the word vectors is improved.

Description

Technical field [0001] The invention relates to the technical field of machine learning, in particular to a word vector training method and device. Background technique [0002] In machine learning technology, in order to make the machine understand the meaning of human language, the word representation tool of the neural network language model converts each vocabulary in the human language into the form of word vector, so that the computer can learn the human language through the word vector The meaning of each word. [0003] With the existing technology, when a new vocabulary is added to the vocabulary, it is usually necessary to re-learn all the vocabulary in the new vocabulary to obtain a new word vector for each vocabulary. However, using this method makes the training efficiency of word vectors lower. Summary of the invention [0004] The invention provides a word vector training method and device, which improves the training efficiency of the word vector. [0005] The embodi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/2365G06F40/242G06F40/284
Inventor 李建欣刘垚鹏彭浩张日崇陈汉腾
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products