Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Computer-based method and device for analyzing natural language syntactic structures

A syntactic structure, natural language technology, applied in the field of computer data processing, can solve problems of high complexity

Active Publication Date: 2014-11-19
秦一男
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the complexity of this method is high, and the analytical accuracy of compound sentence structures needs to be further improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Computer-based method and device for analyzing natural language syntactic structures
  • Computer-based method and device for analyzing natural language syntactic structures
  • Computer-based method and device for analyzing natural language syntactic structures

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0313] The sequential value sequence 1, 2, ..., k of the original sentence can be regarded as the equivalent substitution of the syntax vector with a clear position in the possible matrix solution and the inability to find it in the possible matrix solution obtained by finite global interpolation between syntactic vectors to well-defined positions. That is, the initial syntactic vector corresponding to the original sentence It can be regarded as obtained through the equivalent substitution of the syntax vectors that can find a clear position in the possible matrix solution, and then through a limited number of overall interpolation between the syntax vectors that cannot find a clear position in the possible matrix solution . Various interpolation situations are essentially permutations and combinations in combinatorics.

[0314] The above two methods both meet the requirements of the above principles, and the final results of the two methods are completely consistent. It c...

example 1

[0590] Example 1: Through preprocessing, impurities in the sentence can be removed, and the word unit number and type in the sentence can be marked and identified. For example, for the English sentence S="I can completely understand what what you just said really meant", the sentence S="I can understand what what you said meant" obtained after removing impurities is carried out on word unit recognition and word After the unit type is labeled and numbered, the data structure matching the table below can be obtained.

[0591] sentence

word unit type

Numbering

I

noun pronoun unit

1

[0592] can understand

predicate verb unit

2

what a

subordinating unit

3

what B

subordinating unit

4

you

noun pronoun unit

5

said

predicate verb unit

6

meant

predicate verb unit

7

[0593]The present invention is based on syntactic analysis of the preprocessed sen...

example 2

[0722] Example 2: As another example, the following describes the parsing process of the method of this embodiment for a sentence with a complex structure such as: "That men who were appointed didn't bother the liberals wash't remarked upon by the press."

[0723] The above statement is preprocessed to remove impurities and the word sequence list after numbering is:

[0724] original sentence phrase

phrase type

sequence number

that

subordinating unit

1

the men

noun pronoun unit

2

who

subordinating unit

3

were appointed

predicate verb unit

4

didn't bother

predicate verb unit

5

the liberals

noun pronoun unit

6

[0725] wasn't remarked

predicate verb unit

7

[0726] There are three predicate verb units in this sentence, which are denoted as r 1 、r 2 and r 3 .

[0727] for r 1 Yes, {r 1}={were appointed}

[0728] {x 1} = {That, who, e} (...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a computer-based method and device for analyzing natural language syntactic structures. According to the method and device, the natural language syntactic structures are analyzed by building a matrix model and a linear model and constructing a recursive function through the mathematical thought of a composite function according to mathematical principles of subjects including the abstract algebra, the set theory, the combinatorial mathematics, the computability theory, the computational linguistics and the like and corresponding computer technologies; meanwhile, methods such as the mathematical induction are comprehensively applied to proving important conclusions. By means of the method and device, one set of brand new mathematic models are built for sentences of the natural language, and the thought is basically different from that of a conventional traditional method; the two overall plug-in methods including the single-side same-direction order preserving method and the single-side same-direction non-order-preserving method are creatively provided, and a parallel syntactic constituent generating and processing method of a set family is creatively applied; the rules of the mathematic subjects and the computer subjects are sufficiently used, and the method is high in accuracy and large in operation amount, and has certain technological difficulty.

Description

technical field [0001] The invention relates to the field of computer data processing, in particular to a computer-based method and device for analyzing the syntactic structure of natural language. Background technique [0002] Natural language processing is an important direction in the field of computer science and artificial intelligence. It studies various theories and methods that can realize effective communication between humans and computers using natural language. [0003] Syntactic structure parsing is an important aspect of natural language processing, which uses computers to automatically divide the sentence components of natural language sentences to assist in the further processing of sentences. In the existing syntactic structure analysis technology, the Probabilistic Context Free Grammars (PCFG) algorithm is usually used, which calculates the rule matching probability between the sentence and the syntactic structure analysis result based on the complex nesti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F40/20
CPCG06F40/20
Inventor 秦一男
Owner 秦一男
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products