Word vector learning model training method and server

A technology of learning models and training methods, applied in the field of machine learning, which can solve the problems that have nothing to do with the smallest semantic unit of words—sememe, and cannot fully express the semantic information of words

Inactive Publication Date: 2017-10-10
TSINGHUA UNIV
View PDF3 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the word vector in the above word vector learning model is not related to the smallest semantic unit of the word - sememe, therefore, it cannot fully express the semantic information of the word

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word vector learning model training method and server
  • Word vector learning model training method and server
  • Word vector learning model training method and server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0031] figure 1 It is a schematic flow chart of the training method of the word vector learning model in the embodiment of the present invention, as figure 1 As shown, a method for training a word vector learning model provided in an embodiment of the present invention includes the following steps:

[0032] S1: Obtain the word sense vector and s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention provide a word vector learning model training method and a server. The method comprises the following steps of: obtaining a word meaning vector and a primitive vector corresponding to a word in a training sample text; expressing a word vector in the training sample text according to the word meaning vector or the primitive vector; and substituting the word vector expressed by the word meaning vector or the primitive vector into an existing word vector learning model and carrying out model training. The server is used for executing above method. According to the word vector learning model training method and the server provided by the invention, the word vector is expressed through the primitive vector or the word meaning vector, and the word vector learning model corresponding to the word vector expressed by the word meaning vector or the primitive vector is trained, so that the semantic information of the word can be sufficiently expressed.

Description

technical field [0001] The embodiment of the present invention relates to the technical field of machine learning, and in particular to a training method and server for a word vector learning model. Background technique [0002] A sememe is the smallest semantic unit of a word, and a limited number of sememes can be used to describe the semantics of a large number of Chinese words. For each word, the sememe possessed by the word is not intuitively displayed. In order to conduct corresponding research, some researchers have manually marked each word, and assigned a word to each word through linguistic knowledge. or multiple sememes, and thus form a knowledge base HowNet (HowNet). [0003] Word vector refers to the vectorization of words. Word vectors include different dimensions. Each dimension describes a feature of a word. Converting words into word vectors can help computers understand natural language. The existing technology uses word The vector learning model is train...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/35G06F40/284G06F40/30
Inventor 谢若冰牛艺霖刘知远孙茂松
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products