Unlock instant, AI-driven research and patent intelligence for your innovation.

Part-of-speech tagging model training device and part-of-speech tagging system and method thereof

A part-of-speech tagging and model training technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of low recall, no effective solution to automatic tagging of unregistered parts of speech, and accuracy, etc. Good stability, improve recall rate, reduce the effect of dependence

Inactive Publication Date: 2013-01-23
NEC (CHINA) CO LTD
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The current technology does not effectively solve the problem of automatic part-of-speech tagging of unregistered words. Among them, the patent CN1369877 cannot give a reasonable part-of-speech judgment for unregistered words with zero character separation probability. In addition, the accuracy of the part-of-speech tagging depends on the selected Dictionary, while the method Recall (20%) corresponding to the literature [Lu, X.F.Hybrid Methods for POS Guessing of Chinese Unknown Words.Proceedings of the ACL Student Research Workshop, pages 1-6] is relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Part-of-speech tagging model training device and part-of-speech tagging system and method thereof
  • Part-of-speech tagging model training device and part-of-speech tagging system and method thereof
  • Part-of-speech tagging model training device and part-of-speech tagging system and method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Hereinafter, preferred embodiments of the present invention will be described with reference to the accompanying drawings. In the drawings, the same elements will be denoted by the same reference symbols or numerals. Also, in the following description of the present invention, detailed descriptions of known functions and configurations will be omitted to avoid making the subject matter of the present invention unclear.

[0037] Fig. 1a is a schematic diagram showing the first embodiment of the part-of-speech tagging system of the present invention. The left dashed box shows the part-of-speech tagging model training device 10 , and the right dashed box shows the part-of-speech tagging device 20 . The part-of-speech tagging model training device 10 includes a dictionary 1, a dictionary semantic extension device 2, a part-of-speech tagging model training device 3, and a part-of-speech tagging model 4; the part-of-speech tagging device 20 includes an input device 6, a mode...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a part-of-speech tagging model training device, which comprises a direct constituent analysis unit for performing direct constituent analysis of words to obtain direct constituents, attributes thereof and position relations thereof; a converting unit for converting the results of the direct constituent analysis into training data; and a machine learning unit for machine leaning of the converted training data to generate a part-of-speech tagging model. The invention relates to a part-of-speech tagging model training method and a part-of-speech tagging system and a method thereof. The system comprises the part-of-speech tagging model training device for performing the direct constituent analysis of words in a dictionary to generate the part-of-speech tagging model and a part-of-speech tagging device based on the model for tagging unlisted words by using the part-of-speech tagging model. According to the system of the invention, the part of speech of the unlistedwords can be tagged accurately on the basis of the prior text message, and the efficiency for the text message processing is improved.

Description

technical field [0001] The present invention relates to the field of text information processing, in particular to a part-of-speech tagging model training device and method thereof, and a part-of-speech tagging system and method thereof. Background technique [0002] With the widespread popularization of the Internet and the increasing informatization of society, there are more and more text information, and the corresponding social demand for text information processing is increasing. People are increasingly eager to use natural language to communicate with computers, and hope to use Automated means to process massive text information. In order to process text information better, people need to accumulate a large amount of language data resources, including dictionaries. As an important tool for processing texts - dictionaries are often compiled by humans. The main components of dictionaries include words and their attributes. The currently developed unregistered words (ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
Inventor 胡长建赵凯邱立坤
Owner NEC (CHINA) CO LTD