An answer extraction method of a community question and answer system

A community question answering and answer extraction technology, which is applied in the fields of instruments, electronic digital data processing, text database query, etc.

Active Publication Date: 2019-06-28
网经科技(苏州)有限公司
View PDF2 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most text similarity methods are mainly aimed at declarative sentences. When the sentence length is long and the components are complete, the effect is acceptable, but

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An answer extraction method of a community question and answer system
  • An answer extraction method of a community question and answer system
  • An answer extraction method of a community question and answer system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] In order to have a clearer understanding of the technical features, purposes and effects of the present invention, specific implementations are now described in detail.

[0082] The answer extraction method of the community question answering system of the present invention, such as figure 1 shown, including the following steps:

[0083] S101: Perform word segmentation and stop word preprocessing on the question-answer dataset;

[0084] Usually, the original question and answer data words are not separated, and contain function words, symbols, etc. that hardly contribute to the semantic expression, so preprocessing is required first;

[0085] Specifically, word segmentation methods or tools are used to preprocess the question and answer data set. The word segmentation method is a dictionary-based maximum matching method, a full segmentation path selection method, a word sequence tagging method, or a transfer-based word segmentation method. The word segmentation tool is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an answer extraction method for a community question and answer system. The method comprises the following steps: firstly, carrying out word segmentation and stop word removalpreprocessing on a question and answer data set; selecting a word meaning similarity scheme, and setting a threshold value to execute normalization of synonymous words and synonymous words; calculating smooth inverse frequency similarity based on question classification and public component removal, sorting the smooth inverse frequency similarity, and selecting k candidate questions with closestsemantics; calculating the similarity of the k candidate questions by considering the character level characteristics and the dependency pyramid characteristics, namely, based on the character vectorcalculation similarity, sorting the candidate questions, and sorting the candidate questions according to the comprehensive dependency similarity; and finally, selecting an optimal answer by comprehensively balancing the ranking and numerical values. A problem classification strategy is adopted, and the range of subsequent calculation is narrowed; public components of the data set are removed at the sentence level, and the smooth inverse frequency similarity is adopted as a sorting reference to accurately screen the first k candidate problems; the questions with the closest semantics are moreefficiently and accurately determined in the community question and answer data, and the efficiency and accuracy of answer extraction are improved.

Description

technical field [0001] The invention relates to an answer extraction method of a community question answering system, belonging to the technical field of automatic question answering. Background technique [0002] Automatic question answering is the task of using computers to automatically answer questions raised by users to meet user knowledge needs. According to different target data sources, it can be divided into three categories: retrieval question answering, community question answering and knowledge base question answering. When answering user questions, the automatic community question answering system needs to correctly understand the natural language questions raised by users, extract the key semantic information, and then obtain the answers through retrieval, matching, and reasoning in the existing corpus, knowledge base, or question answering base. and return to the user. [0003] For community Q&A, the core problem is to find historical questions that are seman...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F16/332G06F16/33G06F16/35G06Q50/00
Inventor 刘继明孟亚磊陈浮刘松金宁
Owner 网经科技(苏州)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products