Content-based problem automatic classifying method and system

An automatic classification and question technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as unpleasantness, unsafe question and answer environment, inaccurate classification, etc.

Active Publication Date: 2008-08-27
广东东华发思特软件有限公司
View PDF1 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to provide a content-based automatic question classification method and its system, which are used to

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Content-based problem automatic classifying method and system
  • Content-based problem automatic classifying method and system
  • Content-based problem automatic classifying method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0112] The technical solution of the present invention will be further described in more detail below in conjunction with the drawings and specific embodiments.

[0113] Such as figure 1 Shown is the structure diagram of the automatic problem classification system of the present invention. The system 100 includes: a feature space construction module 10, a question keyword acquisition module 20, and a semantic mapping module 30.

[0114] First, when a user asks a new question, the question is sent to the question keyword acquisition module 20 for processing. The question keyword obtaining module 20 obtains the question keywords of the new question according to the keyword tags and / or fillable content tags in the template, and sets a higher weight for the obtained question keywords, and at the same time obtains the question keywords according to the weight of the word The question vector of the new question. The feature space construction module 10 generates a feature vector of eac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a content-based question automatic classification method and a content-based question automatic classification system. The system comprises a question key word acquisition module, a characteristic space construction module and a semantic mapping module, wherein, the question key word acquisition module is used for acquiring a question key word of a novel question according to a key word label and/or a fillable content label in a template, setting a weight for the question key word and obtaining a question vector of the novel question; the characteristic space construction module is used for acquiring a characteristic vector of each class according to questions of all the prior classes and weights and constructing a characteristic space; the semantic mapping module is connected with the characteristic space construction module and the question key word acquisition module and used for mapping the question vector of the novel problem to the characteristic space, calculating the similarity of the novel problem and each class according to the question vector of the novel question after mapping and the characteristic vector of each class, and returning classes which are most related to the novel problem according to the similarity. The content-based question automatic classification method and the content-based question automatic classification system realize automatic classification of the novel problem which is put forward by a user and return most probable results to the user for selection.

Description

Technical field [0001] The present invention relates to classification technology, in particular to a method and system for automatically classifying problems based on content. Background technique [0002] The traditional text similarity calculation method uses its word frequency vector (or term-frequency vector) to express the text, and then calculates the distance between the word frequency vectors to obtain the text similarity. However, most of the existing methods for calculating text similarity are only suitable for long texts. The traditional method for calculating the similarity of long texts is effective because similar long texts usually contain a certain number of the same vocabulary. But for short texts such as questions, similar questions do not necessarily have the same words. The flexibility of natural language allows people to express the same meaning through different words. Therefore, the existing similarity calculation methods are not effective. . In addition, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
Inventor 刘文印
Owner 广东东华发思特软件有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products