Unlock instant, AI-driven research and patent intelligence for your innovation.

Short sentence analytic model establishing method and system

A technology for analyzing models and establishing methods, which is applied in special data processing applications, instruments, and electrical digital data processing, etc., can solve problems such as difficult to simulate language constraint relationships, and the accuracy of recognition and analysis of short sentences is not high enough, so as to achieve the goal of improving accuracy Effect

Active Publication Date: 2014-12-10
CTRIP TRAVEL NETWORK TECH SHANGHAI0
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to overcome the fact that the natural language analysis method in the prior art is difficult to optimize according to actual data, and it is difficult to simulate the local constraint relationship in the language, resulting in insufficient accuracy of recognition and analysis of short sentences defects, and propose a method and system for establishing a short sentence parsing model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short sentence analytic model establishing method and system
  • Short sentence analytic model establishing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] Such as figure 1 As shown, the short sentence parsing model building method of the present embodiment comprises the following steps:

[0046] S 1 , get the original sentence;

[0047] S 2 , Segment the original sentence into word sequences;

[0048] S 3 , assigning a part of speech to each word in the word sequence according to the pre-stored part of speech rule;

[0049] S 4 , Identify named entities according to each word and its part of speech. Named entities include person names, place names, and institution names;

[0050] S 5 1. Identify the grammatical components of each word in the original sentence according to each word, part of speech and named entity;

[0051] S 6, Analyzing the dependencies among the various grammatical components;

[0052] S 7 , According to the dependency relationship between each grammatical component, extract the grammatical component as a feature;

[0053] S 8 1. Construct the extracted features into a feature vector, and ...

Embodiment 2

[0061] refer to figure 2 As shown, the short sentence parsing model building system of the present embodiment includes a sentence segmentation module 1, a part-of-speech assignment module 2, a named entity recognition module 3, a grammatical component recognition module 4, a dependency analysis module 5, a feature Combine module 6 and a storage module 7 .

[0062] The sentence segmentation module is used to obtain the original sentence and segment the original sentence into word sequences. The part-of-speech assigning module is used to assign a part-of-speech to each word in the word sequence according to the pre-stored part-of-speech rules. The named entity recognition module is used to recognize named entities according to each word and its part of speech, and named entities include person names, place names, and organization names. The grammatical component identification module is used to identify the grammatical components of each word in the original sentence accordin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short sentence analytic model establishing method and system. The short sentence analytic model establishing method includes the following steps that original sentences are acquired; the original sentences are segmented into word sequences; each word in the word sequences is attached to a word class; named entities are recognized according to all the words and the word classes of the words; the grammatical items represented by all the words in the original sentences are recognized according to all the words, the word classes and the named entities; the dependence relationship between all the grammatical items is analyzed; according to the dependence relationship between all the grammatical items, the grammatical items are extracted as features; feature vectors are established with the extracted features, and every two feature vectors are combined into a feature combination of binary classification; the feature vectors and the feature combinations of the binary classification are stored in a model. Through the short sentence analytic model establishing method and system, optimization can be performed according to actual data, the local constraint relationship in a natural language can be simulated to a certain degree, and thus the accuracy for recognizing and analyzing the short sentences of the natural language is greatly improved.

Description

technical field [0001] The invention relates to a method and system for establishing a short sentence analysis model. Background technique [0002] Today, with the rapid development of various technologies such as speech signal processing, speech recognition, speech synthesis and natural language understanding, speech query has high research value, and its application will certainly bring good social and economic benefits. In voice query, natural language understanding and parsing of short sentences is the key to affect voice query results. How to improve the accuracy of natural language understanding and parsing to improve the accuracy of voice query system is an important issue. [0003] Traditional natural language parsing methods for short sentences are usually rule-based methods, and their core idea is to use grammar to describe and analyze language. Firstly, it is determined whether the sentence conforms to the pre-set norms, and then it is a search process to find a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
Inventor 刘新
Owner CTRIP TRAVEL NETWORK TECH SHANGHAI0