Supercharge Your Innovation With Domain-Expert AI Agents!

Punctuation mark processing based Chinese syntax analysis method and apparatus

A technology of punctuation marks and syntactic analysis, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as difficult to deal with the nesting of punctuation marks, the level is not clear enough, and the processing of punctuation marks is not comprehensive enough, etc., to achieve It is beneficial to segment sentences, improve accuracy and recall, and solve nested usage problems

Active Publication Date: 2016-01-13
饶志刚
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method is not comprehensive enough to deal with punctuation marks, ignoring some grammatical and semantic information of punctuation marks, and the level is not clear enough to deal with complex nested use of punctuation marks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Punctuation mark processing based Chinese syntax analysis method and apparatus
  • Punctuation mark processing based Chinese syntax analysis method and apparatus
  • Punctuation mark processing based Chinese syntax analysis method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Exemplary embodiments of the present invention will be described below with reference to the accompanying drawings. In the interest of clarity and conciseness, not all features of an actual implementation are described in this specification. It should be appreciated, however, that in developing any such practical implementation, many implementation-specific decisions must be made in order to achieve the developer's specific goals, such as compliance with those constraints associated with the syntax analysis method, and the Restrictions may vary from implementation to implementation. Moreover, it should also be understood that development work, while potentially complex and time-consuming, would at least be a routine undertaking for those skilled in the art having the benefit of this disclosure.

[0020] Here, it should also be noted that, in order to avoid obscuring the present invention due to unnecessary details, only the device structure and / or processing steps clos...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to the technical field of natural language processing, and particularly to a punctuation mark processing based Chinese syntax analysis method and apparatus. The method comprises: studying Chinese punctuation marks and syntax and formal features of clauses, and hierarchically layering the Chinese punctuation marks and the syntax and the formal features of the clauses; according to the features, preparing a resolution rule table; according to hierarchical level relationships of the punctuation marks and the clauses, transforming an original input sentence into a series of tags, and generating a tag table; and according to hierarchies and levels of the tags in the tag table, performing syntax analysis and subtree splicing, and calling a resolution rule in the analysis and splicing process. Compared with a one-pass analysis method without consideration into the punctuation marks and an analysis method of decomposing a long sentence by using the punctuation marks, the method makes the hierarchies of the syntactic analysis more distinct, and makes full use of grammatical and semantic information of the punctuation marks, so that efficiency and accuracy of syntactic analysis are improved, and the problem of nestedly using the punctuation marks can be very simply, conveniently and distinctly solved, without exhausting all punctuation mark combination usages.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a Chinese syntax analysis method and device based on punctuation processing. Background technique [0002] Syntactic analysis is one of the core technologies of natural language processing and the basis for in-depth understanding of language, while natural language understanding is the basis of many language processing technologies such as machine translation, information extraction, and semantic retrieval. Therefore, syntactic analysis is an indispensable core and basic technology for Chinese information processing. [0003] Punctuation marks are an important part of written Chinese, and most of the existing automatic syntax analysis systems ignore their role. The paper "Hierarchical Chinese Long Sentence Syntax Analysis Method Introducing Punctuation Processing" (Li Xing, Zong Chengqing, Chinese Journal of Information, Volume 20, Issue 4) pointed out that i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
Inventor 饶志刚
Owner 饶志刚
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More