A sequence marking system and method

A sequence tagging and sequence technology, applied in the field of natural language processing, can solve problems such as unbalanced distribution, time-consuming and labor-intensive models, and models that cannot process text correctly, so as to improve production efficiency and reuse rate

Pending Publication Date: 2019-03-29
成都数联铭品科技有限公司
View PDF11 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the large differences between the new data and the training data, such as the appearance of some proper nouns (such as the name "Abraham", which does not exist in the training data), or the incomplete coverage and uneven distribution of the training data, it will lead to training A good model cannot process some texts correctly, and re-labeling data training is time-consuming and laborious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A sequence marking system and method
  • A sequence marking system and method
  • A sequence marking system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0031] In order to understand the sequence tagging s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a sequence marking system, which comprises a model marking module, an adjustment module, and a strategy base; an output end of the model marking module and an input end of theadjustment module. The model annotation module is used to annotate the input text data serially. One or more policies are stored in the policy library, and the adjustment module is used for retrieving the policies from the policy library, and adjusting the annotation results output by the model annotation module according to the policies and the input text data. As that system or the method of the invention label the text sequence, the accuracy and the applicability of the original model labeling can be enhance.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a sequence labeling system and method. Background technique [0002] Most of the knowledge and information of human society are recorded in the form of text. These knowledge and information are described in the form of human language, which cannot be directly recognized by machines. Natural language processing is an algorithmic technology for processing human natural language texts, among which Words Segmentation, POS Tagging and Named Entity Recognition are the basic tasks. Word segmentation is to divide a sentence from a sequence of words into a sequence of words; part-of-speech tagging is to assign a part of speech to each word, such as nouns, verbs, adjectives, etc.; named entity recognition is to extract specific types of nouns in the text, such as " Xiao Ming" (type: person name), "this morning" (type: time). Word segmentation, part-of-speech tagging,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/21
CPCG06F40/117
Inventor 纪大胜崔诚煜刘世林丁国栋曾途吴桐
Owner 成都数联铭品科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products