Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A computer-based method and device for analyzing the syntactic structure of natural language

A syntactic structure and natural language technology, applied in the field of computer data processing, can solve complex problems

Active Publication Date: 2017-10-31
秦一男
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the complexity of this method is high, and the analytical accuracy of compound sentence structures needs to be further improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A computer-based method and device for analyzing the syntactic structure of natural language
  • A computer-based method and device for analyzing the syntactic structure of natural language
  • A computer-based method and device for analyzing the syntactic structure of natural language

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0313] The sequential value sequence 1, 2, ..., k of the original sentence can be regarded as the equivalent substitution of the syntax vector with a clear position in the possible matrix solution and the inability to find it in the possible matrix solution obtained by finite global interpolation between syntactic vectors to well-defined positions. That is, the initial syntactic vector corresponding to the original sentence It can be regarded as obtained through the equivalent substitution of the syntax vectors that can find a clear position in the possible matrix solution, and then through a limited number of overall interpolation between the syntax vectors that cannot find a clear position in the possible matrix solution . Various interpolation situations are essentially permutations and combinations in combinatorics.

[0314] The above two methods both meet the requirements of the above principles, and the final results of the two methods are completely consistent. It c...

example 1

[0590] Example 1: Through preprocessing, impurities in the sentence can be removed, and the word unit number and type in the sentence can be marked and identified. For example, for the English sentence S="I can completely understand what what you just said really meant", the sentence S="I can understand what what you said" obtained after removing impurities is carried out on word unit identification and word unit type After labeling and numbering, a data structure matching the table below can be obtained.

[0591] sentence

word unit type

Numbering

I

noun pronoun unit

1

[0592] can understand

predicate verb unit

2

what a

subordinating unit

3

what B

subordinating unit

4

you

noun pronoun unit

5

said

predicate verb unit

6

meant

predicate verb unit

7

[0593] The present invention is based on syntactic analysis of the preprocessed sentence represented by t...

example 2

[0722] Example 2: As another example, the following describes the parsing process of the method of this embodiment for a sentence with a complex structure such as: "That men who were appointed didn't bother the liberals wash't remarked upon by the press."

[0723] The above statement is preprocessed to remove impurities and the word sequence list after numbering is:

[0724] original sentence phrase

phrase type

sequence number

that

subordinating unit

1

the men

noun pronoun unit

2

who

subordinating unit

3

were appointed

predicate verb unit

4

didn't bother

predicate verb unit

5

the liberals

noun pronoun unit

6

[0725] wasn't remarked

predicate verb unit

7

[0726] There are three predicate verb units in this sentence, which are denoted as r 1 、r 2 and r 3 .

[0727] for r 1 Yes, {r 1}={were appointed}

[0728] {x 1} = {That, who, e} (e is the em...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed are a computer-based method and device for analyzing the syntactic structure of natural language. The present invention is based on the mathematical principles of abstract algebra, set theory, combinatorics, computability theory, computational linguistics and other disciplines and corresponding computer technology, using the mathematical idea of ​​composite functions, by establishing matrix models and linear models, and constructing recursive functions. Analyze the syntactic structure of natural language; at the same time, use methods such as mathematical induction to prove important conclusions. The invention establishes a set of brand-new mathematical models for sentences of natural language, and has fundamental differences in thinking compared with conventional traditional methods. The present invention creatively proposes two overall interpolation methods, one-sided same-direction-preserving and one-sided-same-direction non-preserving, and creatively uses the generation of set families to process parallel syntactic components. The present invention makes full use of the laws of mathematics and computer science, and the method has high accuracy, very large amount of computation, and certain technical difficulty.

Description

technical field [0001] The invention relates to the field of computer data processing, in particular to a computer-based method and device for analyzing the syntactic structure of natural language. Background technique [0002] Natural language processing is an important direction in the field of computer science and artificial intelligence. It studies various theories and methods that can realize effective communication between humans and computers using natural language. [0003] Syntactic structure parsing is an important aspect of natural language processing, which uses computers to automatically divide the sentence components of natural language sentences to assist in the further processing of sentences. In the existing syntactic structure analysis technology, the Probabilistic Context Free Grammars (PCFG) algorithm is usually used, which calculates the rule matching probability between the sentence and the syntactic structure analysis result based on the complex nesti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06F40/20
CPCG06F40/20
Inventor 秦一男
Owner 秦一男
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products