Field question classification method combining syntax structural relationship and field characteristic

A technology of problem classification and syntax structure, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems that do not consider domain characteristics

Inactive Publication Date: 2008-12-10
KUNMING UNIV OF SCI & TECH
View PDF0 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above classification methods are mainly for the classification of open domain problems, without considering the characteristics of the domain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Field question classification method combining syntax structural relationship and field characteristic
  • Field question classification method combining syntax structural relationship and field characteristic
  • Field question classification method combining syntax structural relationship and field characteristic

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0050] Such as figure 1 , the field problem classification method combined with syntactic structure relationship and field features provided by the present invention, its specific implementation method steps are as follows:

[0051] (1) Step S1, collecting domain terms: 2012 domain terms were manually collected and sorted out mainly for the introduction of Yunnan tourism attractions, customs, tourism transportation, hotels, etc., forming a tourism domain terminology database;

[0052] (2) Step S2, defining the problem classification system in the field of tourism: the present invention defines 5 major categories and 23 subclasses of problem classification systems for the field of tourism, and Table 1 is the problem classification system in the field of tourism defined:

[0053] Table 1: Classification system of tourism issues

[0054] category

small class

attractions

Attraction Introduction, Attraction Location, Attraction Price, Attraction Transporta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for classifying field questions by integrating with syntax structure relationship and field characteristics, which comprises the following steps: field terms are collected; a special field question classification system is defined; the syntax structures of the questions are analyzed; a sentence trunk is extracted; the sentence truck and the field vocabulary are taken as question classification characteristics; a question classification model is built through an improved Bayesian classification algorithm method; a special field question classification training corpus and a test corpus are set up; a special field question classifier is set up. Because question classification is a very important step in an answering system and a key factor for establishing answer extraction strategies and positioning answers, the method of the invention can select the sentence trunk and the field vocabulary as the classification characteristics based on the syntax structure analysis by integrating with the field characteristics, adopts the improved Bayesian classification algorithm method, builds the question classification model and takes the question classification test in the field of Yunnan tourism; the result shows the method is effective, and improves the field question classification accuracy, thereby offering consultancy service to the users with high efficiency, quickly and accurately.

Description

technical field [0001] The invention relates to a method for classifying domain problems, in particular to a method for classifying problems in restricted domains that combines syntactic structure relations and domain characteristics. Background technique [0002] Question classification is a very important step in the question answering system. The accuracy of question classification has a great impact on the information retrieval and answer extraction of the question answering system. Only on the basis of correct question classification can correct answer extraction be possible. . [0003] At present, the research on question classification mainly focuses on two aspects. One is the rule-based method, which extracts the characteristic rules of the combination of interrogative words and other related words of various question types, and determines the type of the question through the rules. The classification effect of this method depends on the composition rules of various...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 余正涛孟祥燕韩露毛存礼车文刚
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products