Word vector training method, system, device and medium based on multi-task learning
A technology of multi-task learning and training methods, which is applied in the fields of instruments, computing, and electrical digital data processing, etc., and can solve problems such as inability to express word similarity, word replacement interference, and high computational complexity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0071] This embodiment provides a word vector training method based on multi-task learning, which is realized by using a word vector dictionary, a language model module and a named entity recognition module. The specific descriptions of the word vector dictionary, language model module and named entity recognition module are as follows :
[0072] 1) The input of the word vector dictionary is the one-hot vector of the word to be queried, and the output is the word vector representation of the word. The word vector dictionary is actually a dictionary matrix. For the input one-hot vector, the index value is 1 Perform a query to get the word vector representation of the word. The principle of the word vector dictionary is as follows figure 1 shown.
[0073] 2) The language model module is the first external practical task, that is, to establish a language model. The language model refers to the probability of occurrence of a word sequence. For example, the probability of occurren...
Embodiment 2
[0105] Such as Figure 9 As shown, the present embodiment provides a word vector training system based on multi-task learning, the system includes an acquisition unit 901, a construction unit 902 and a training unit 903, and the specific functions of each unit are as follows:
[0106] The acquiring unit 901 is configured to acquire a training set; wherein, the training set includes paired data of text word sequence-named entity tag sequence.
[0107] The building unit 902 is configured to build a language model module and a named entity recognition module, and use the language model module and the named entity recognition module as external modules.
[0108] The training unit 903 is used to alternately train the word vector dictionary and the external module; wherein, the word vector dictionary uses the text word sequence and the output of the external module for training, and the language model module uses the word vector dictionary training output of the word vector The nam...
Embodiment 3
[0111] This embodiment provides a computer device, which may be a server, a computer, etc., such as Figure 10 As shown, it includes a processor 1002 connected through a system bus 1001, a memory, an input device 1003, a display 1004 and a network interface 1005. The processor is used to provide calculation and control capabilities. The memory includes a non-volatile storage medium 1006 and internal Memory 1007, the non-volatile storage medium 1006 stores an operating system, computer programs and databases, the internal memory 1007 provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium, and the processor 1002 executes memory storage During computer program, realize the word vector training method of above-mentioned embodiment 1, as follows:
[0112] Obtain a training set; Wherein, the training set includes paired data of text word sequence-named entity tag sequence;
[0113] Build the language model module a...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


