High-precision Chinese predicate identification method

A recognition method and high-precision technology, applied in special data processing applications, instruments, electrical digital data processing and other directions, can solve the problems of no recognition accuracy rate, seldom used, less research, etc., to achieve good application value and The effect of promotion value, small calculation consumption, and high recognition rate

Inactive Publication Date: 2013-06-12
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF2 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] (2) Predicate recognition for scientific and technological papers: This method is a predicate recognition proposed for the syntactic analysis research of a specific style of scientific and technological papers. It only recognizes the case where the verb is the central predicate (limited to the predicate of the first layer of the sentence). The specific recognition accuracy of the experiment is not given
[0018] Summarizing the above several predicate recognition methods, it can be concluded that: (1) the accuracy of predicate recognition obtained by various methods generally does not exceed 90%, and there is still a lot of room for improvement in accuracy; (2) most of the features used for predicate recognition are only Lexical features are used, and other more advanced features are rarely used; (3) Most methods only recognize verbal predicates, and there are few studies on other parts of speech such as adjectives and idioms as predicates

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-precision Chinese predicate identification method
  • High-precision Chinese predicate identification method
  • High-precision Chinese predicate identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to better illustrate the purpose and advantages of the present invention, the implementation of the method of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples.

[0055] In order to identify predicates with high efficiency and high accuracy, a predicate recognition experiment is designed and deployed. In order to use a small number of features to achieve better recognition results, remove mutual constraints and features that will reduce accuracy, and obtain the optimal feature combination, first of all, perform feature screening experiments; in order to obtain the best recognition results under the same features and algorithms , it is necessary to optimize the parameters of the algorithm, so the parameter selection experiment is also carried out.

[0056] The experimental data comes from the BFS-CTC Chinese Tag Corpus (Beijing Forest Studio-Chinese Tag Corpus). Compared with the CPB corpu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a predicate identification method based on a combination of rules and statistics, and belongs to the field of natural language processing and machine learning. The identification method aims at achieving high-precision and high-efficiency predicate identification. The stepped identification method identifies predicates from morphology and syntax labeled sentences, and comprises the steps of conducting morphological analysis on the sentences to be detected, obtaining suspicious predicates and a number thereof, preliminarily identifying the predicates by using preliminary identification judgment conditions, extracting relevant morphological and syntactic characteristics of the suspicious predicates dissatisfying the preliminary identification judgment conditions, judging the predicates with a decision-making tree judgment model obtained from C4.5 training, and finally summarizing identification results in the two steps to present the predicates of the sentences to be detected. The identification method has the characteristics of high accuracy rate, identification speed and identification rate for the non-verbal predicates, and the like, is applicable to the field requiring high-precision Chinese predicate identification, greatly promotes development of sentence meaning analysis, and has high application and popularization values.

Description

technical field [0001] The invention relates to a Chinese predicate recognition method based on the combination of rules and statistics, belonging to the fields of natural language processing and machine learning. Background technique [0002] Significant progress has been made in the research of lexical and syntactic research in natural language processing technology. In comparison, the research on semantics, pragmatics and contextual knowledge has always been a bottleneck that is difficult to overcome. Semantic analysis is the only way for computers to truly understand natural language. Predicate recognition is the basis for further semantic analysis and plays a key role in the follow-up work of semantic analysis. Therefore, a high-accuracy and high-efficiency predicate recognition method is particularly important. [0003] Chinese predicate recognition needs to solve two basic problems: 1. How to extract representative and highly differentiated rules or feature combinati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/28
Inventor 罗森林白建敏潘丽敏韩磊魏超
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products