A Scalable Neural Network Based Sequence Labeling Method

A neural network and sequence labeling technology, applied in the field of natural language processing, to improve the effect, reduce the risk of model overfitting, and reduce the training time.
CN107894971BActive Publication Date: 2019-11-26PEKING UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
PEKING UNIV
Publication Date
2019-11-26

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses an extensible sequence labeling method based on a neural network. The method includes the steps of creating a laminated nth-order model, and utilizing the laminated nth-order model to carry out label prediction on a set sequence to obtain a label sequence. The training process of the laminated nth-order model includes the steps of firstly, generating n label sets accordingto labels in each labeling unit in a training corpus, wherein the n label sets comprise the first-order label set, the second-order label set... and the nth-order label set; combining the label of a labeling unit i with the label of an adjacent n-1 labeling unit to form an nth-order label of the labeling unit i, wherein the nth-order label set is a label set composed of nth-order labels of each labeling unit; then, utilizing all the obtained label sets different in order to train the neural network separately to obtain n models, wherein the n models include the first-order neural network model, the second-order neural network mode ... and the nth-order neural network model. The extensible sequence labeling method can obviously reduce the overfitting risks of models and improve the effect of a sequence labeling task.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention belongs to the field of natural language processing and relates to sequence labeling, in particular to a sequence labeling method for combined decoding of different order model information. Background technique

[0002] When the neural network deals with the problem of sequence labeling, in the training phase, its corresponding label is predicted for each. The cost function is the cross entropy between the predicted output of the neural network and the standard label, and the training process minimizes the objective function. In the decoding stage, the label of the current word is directly predicted by the neural network.

[0003] When the existing neural network deals with the sequence labeling problem, the label predicted for the current word (word) does not involve the surrounding word (word) labels, that is, the predicted label of each word (word) is independent of other words (words) , and then perform gradient descent on the basis ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More