Sequence labeling method and device

A technology of sequence labeling and labeling data, applied in the computer field, can solve problems such as word segmentation and target mismatch, and achieve the effect of alleviating the mismatch

Active Publication Date: 2020-09-15
PEKING UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a sequence tagging method and device to solve, or at least partly solve, the defect that the word segmentation does not match the target in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sequence labeling method and device
  • Sequence labeling method and device
  • Sequence labeling method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0035] In order to overcome the above-mentioned problems in the prior art, an embodiment of the present invention provides a sequence tagging method and device. The inventive idea is to implicitly learn the word segmentation information of the text through reinforcement learning technology, so as to alleviate the mismatch between the word segmentat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a sequence labeling method and device. The method comprises the steps of obtaining a sentence to be analyzed; inputting a sentence to be analyzed into the sequence labeling model, and outputting a word segmentation result of the sentence and a labeling result of each word; wherein the sequence annotation model is obtained by performing reinforcement learning-based training based on sample sentences and segmented words and annotation data corresponding to the sample sentences. According to the sequence labeling method and device provided by the embodiment of the invention, word segmentation information of the text is implicitly learned through a reinforcement learning technology, and mismatching between segmented words in Chinese sequence labeling and a target can be relieved.

Description

technical field [0001] The embodiments of the present invention relate to the field of computer technology, and more specifically, to a sequence labeling method and device. Background technique [0002] Sequence tagging technology is an important technology in the field of natural language processing. Basic tasks such as part-of-speech tagging, chunk analysis, and named entity recognition can all be solved using sequence tagging technology. Traditional methods use Hidden Markov Models, Maximum Entropy Markov Models, and Conditional Random Fields to solve the problem. In order to enhance the effect of the model, many manual features are added to the model, such as part of speech, gazetteer and other information are integrated into the task of named entity recognition. [0003] In the Chinese sequence tagging task, it is usually necessary to segment the text first, and then the sequence tagging pipeline will easily lead to the problem that the word segmentation does not match...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/117G06F40/289G06F40/205G06N3/04G06N3/08
CPCG06F40/117G06F40/289G06F40/205G06N3/08G06N3/045
Inventor 叶蔚张世琨盛中昊刘学洋胡文蕙
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products