Chinese syntax parsing method with merged semantic information

A technology of syntactic analysis and semantic information, which is applied in the fields of instruments, computing, and electronic digital data processing, etc., can solve problems such as whether the description of language phenomena is accurate, data is sparse, etc., and achieve the effects of performance improvement, efficiency and accuracy improvement, and performance improvement

Inactive Publication Date: 2009-09-02
PEKING UNIV
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these two methods also have their own shortcomings: the introduction of lexical information in the lexicalization method has brought about certain data sp

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese syntax parsing method with merged semantic information
  • Chinese syntax parsing method with merged semantic information
  • Chinese syntax parsing method with merged semantic information

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0031] Describe the specific embodiment of the present invention in detail below in conjunction with accompanying drawing, the method flowchart of the present invention is as follows image 3 shown.

[0032] 1. Build a word-semantic index

[0033] According to the hyponym relationship between sememes defined in HowNet, the semantic classes of different layers from coarse to fine are extracted, and correspond to each word, so as to construct the index from word to semantic class. The words here are accompanied by part-of-speech information.

[0034] 2. Add semantic class information to the original tree bank

[0035] For the original tree bank, the semantic class information is obtained by using word and part of speech as the key value, and then the semantic class information is attached to the part of speech (pre-terminal) level to realize the refinement of the part of speech layer tag. Such parts of speech contain semantic information.

[0036] Some words may have multipl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese syntax parsing method with merged semantic information, belonging to the technical field of natural language processing. The method comprises the following steps: step 1), extracting different hierarchical semantic classes of words according to the hyponymy of the knowledge network to obtain indexes from the words to the semantic classes; step 2), using a word in a syntactic tree as a key assignment and query the knowledge network to obtain a semantic class of the word and add the semantic class to a certain layer of the syntactic tree; step 3), using the syntactic tree after being processed in the step 2) as training data to train grammar so as to obtain a grammar model; step 4), utilizing the grammar model after being trained in the step 3) to decode a sentence to be analyzed. Compared with the prior art, the invention adopts the semantic information to disambiguate parsing so that the parsing effect is remarkably improved.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and specifically relates to a Chinese syntax analysis method incorporating semantic information, which introduces semantic knowledge into the syntax analysis to help improve the performance of syntax analysis. Background technique [0002] Syntactic analysis is a very important technology in natural language processing. It analyzes how words are combined to form meaningful phrases and sentences to reveal deep language laws. The results of syntactic analysis will directly affect the understanding of natural language. In actual natural language processing applications, a high-performance syntactic analyzer is conducive to improving the performance of high-level application systems such as information extraction, information retrieval, machine translation, and automatic question answering. [0003] The process of syntactic analysis is to deduce the grammatical structure of a sen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
Inventor 吴玺宏迟惠生罗定生林小俊樊杨
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products