Chinese syntax parsing method with merged semantic information

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of syntactic analysis and semantic information, which is applied in the fields of instruments, computing, and electronic digital data processing, etc., can solve problems such as whether the description of language phenomena is accurate, data is sparse, etc., and achieve the effects of performance improvement, efficiency and accuracy improvement, and performance improvement

Inactive Publication Date: 2009-09-02

PEKING UNIV

View PDF0 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, these two methods also have their own shortcomings: the introduction of lexical information in the lexicalization method has brought about certain data sparse problems, and the automatic refinement of tags in the non-lexicalization method has problems such as whether the description of language phenomena is accurate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0031] Describe the specific embodiment of the present invention in detail below in conjunction with accompanying drawing, the method flowchart of the present invention is as follows image 3 shown.

[0032] 1. Build a word-semantic index

[0033] According to the hyponym relationship between sememes defined in HowNet, the semantic classes of different layers from coarse to fine are extracted, and correspond to each word, so as to construct the index from word to semantic class. The words here are accompanied by part-of-speech information.

[0034] 2. Add semantic class information to the original tree bank

[0035] For the original tree bank, the semantic class information is obtained by using word and part of speech as the key value, and then the semantic class information is attached to the part of speech (pre-terminal) level to realize the refinement of the part of speech layer tag. Such parts of speech contain semantic information.

[0036] Some words may have multipl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a Chinese syntax parsing method with merged semantic information, belonging to the technical field of natural language processing. The method comprises the following steps: step 1), extracting different hierarchical semantic classes of words according to the hyponymy of the knowledge network to obtain indexes from the words to the semantic classes; step 2), using a word in a syntactic tree as a key assignment and query the knowledge network to obtain a semantic class of the word and add the semantic class to a certain layer of the syntactic tree; step 3), using the syntactic tree after being processed in the step 2) as training data to train grammar so as to obtain a grammar model; step 4), utilizing the grammar model after being trained in the step 3) to decode a sentence to be analyzed. Compared with the prior art, the invention adopts the semantic information to disambiguate parsing so that the parsing effect is remarkably improved.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and specifically relates to a Chinese syntax analysis method incorporating semantic information, which introduces semantic knowledge into the syntax analysis to help improve the performance of syntax analysis. Background technique [0002] Syntactic analysis is a very important technology in natural language processing. It analyzes how words are combined to form meaningful phrases and sentences to reveal deep language laws. The results of syntactic analysis will directly affect the understanding of natural language. In actual natural language processing applications, a high-performance syntactic analyzer is conducive to improving the performance of high-level application systems such as information extraction, information retrieval, machine translation, and automatic question answering. [0003] The process of syntactic analysis is to deduce the grammatical structure of a sen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/27

Inventor 吴玺宏迟惠生罗定生林小俊樊杨

Owner PEKING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Chinese syntax parsing method with merged semantic information

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology