Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Syntax analysis method and device for layering Chinese long sentences based on punctuation treatment

A syntactic analysis and hierarchical technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as unknown sentence patterns

Inactive Publication Date: 2007-03-14
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current research on the structural model of long sentences and the relationship between sentence patterns and ideographic patterns is still in its infancy, and before syntactic analysis, sentence patterns are usually unknown, which makes us need to consider other To start with, try to solve this problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Syntax analysis method and device for layering Chinese long sentences based on punctuation treatment
  • Syntax analysis method and device for layering Chinese long sentences based on punctuation treatment
  • Syntax analysis method and device for layering Chinese long sentences based on punctuation treatment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The specific steps of the hierarchical Chinese long sentence syntax analysis technology based on punctuation processing are as follows:

[0071] Step 1: Segment complex long sentences containing segmentation punctuation;

[0072] Step 2: Perform first-level syntactic analysis independently for each clause unit;

[0073] Step 3: Detect and merge phrases in parallel relation;

[0074] Step 4: Carry out the second-level analysis on the basis of the first-level analysis results, and finally obtain the complete syntax analysis tree of the entire sentence.

[0075] Figure 1: The overall structure diagram of the entire system device, which consists of four parts: a long sentence segmentation device, a first-level analysis device, a detection and merging parallel phrase device, and a second-level analysis device. The above devices are connected in sequence.

[0076] Figure 2: Represents the conditions that a "split" punctuation definition satisfies. Among them, the punctuati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Unlike to traditional method, the new hierarchy syntactic analysis method faced Chinese long sentence comprises: 1. applying special functions of some punctuations to divide the complex long sentence into sub-sentence sequences; 2. extracting grammar rule and corresponding probability distribution information from large-scale database to analyze sentence and eliminate ambiguity. Much experiences show this invention can reduce time consumption and improves the analysis right rate and the recall rate about 7%.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a method and device for syntactic analysis of hierarchical Chinese long sentences based on punctuation processing. Background technique [0002] Syntactic Parsing is one of the key technologies in natural language processing research, and the results of syntactic parsing directly affect the understanding of natural language sentences. Natural language understanding is the basis of many language processing technologies such as machine translation, information extraction, information retrieval, and automatic corpus processing. Under current conditions, syntactic analysis still plays a pivotal role in language information processing systems. At the same time, the technology used in syntactic analysis can also be used to solve problems similar to syntactic analysis in the field of biological information recognition, such as RNA molecular structure detection. The...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 宗成庆李幸
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products