Clustering method for question sentences in question-and-answer platform and system thereof

A clustering algorithm and question sentence technology, applied in the field of Internet search, can solve problems such as inability to achieve

Inactive Publication Date: 2010-01-20
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 50 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] If the semantic features of questions can be accurately identified, users can be provided with higher service qualit

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method for question sentences in question-and-answer platform and system thereof
  • Clustering method for question sentences in question-and-answer platform and system thereof
  • Clustering method for question sentences in question-and-answer platform and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The embodiment of the present invention proposes a clustering method and system for question sentences in the question answering platform, aiming at the clustering method and system not specifically applied in the question answering platform in the prior art. Considering the characteristics and semantic features of questions comprehensively, fast and accurate clustering results can be obtained.

[0047] For the existing similarity measurement method, for the unbalanced sentence pattern of the text length of the question sentence, the measurement result will be seriously affected. Therefore, before the similarity measurement is performed, the actual keyword number in the question sentence is less than Semantic expansion is performed on questions with a predetermined number of reference keywords. On the contrary, de-redundancy processing is performed on questions whose actual number of keywords is greater than the number of predetermined reference keywords, so as to ensure...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a clustering method for question sentences in a question-and-answer platform and a system thereof. The technical scheme is as follows: the question sentences in the question-and-answer platform is analyzed according to the semantic feature of the question sentences to obtain analysis results; the semantic feature comprises the question type and the comparison feature of the question sentences and thesaurus correlative to the content of the question sentences; and aiming at the question sentences which is analyzed by the semantic feature, a clustering algorithm for evaluating the semantic similarity of the question sentences is adopted to obtain clustering results of the question sentences in the question-and-answer platform. The system comprises a question sentences analysis module and a clustering algorithm module. Aiming at the problem that the clustering method for the question sentences in the question-and-answer platform and the system thereof are not existed in the prior art, the technical scheme of the invention fills the gap, thereby not only realizing fast and exact clustering method and system in the question-and-answer platform, but also improving user experience.

Description

technical field [0001] The invention relates to the technical field of Internet search, in particular to a method and system for clustering questions in a question-and-answer platform. Background technique [0002] With the rapid development of Internet technology, the amount of network information is also increasing rapidly. The existing question-and-answer platform already contains a large number of questions. There are more sentences. In view of this situation, when the question answering platform receives the user's search request, it needs to have the ability to quickly find the information corresponding to the search request among these massive question sentences and question answer pairs, and provide it to the user. However, the existing question and answer The platform cannot be realized yet, so fast and accurate clustering methods and systems are very necessary for the existing question answering platform. [0003] Because the question-answer platform contains a l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
Inventor 姜中博刘怀军方高林
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products