Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Question routing method based on word vectors

A technology of word vectors and questions, applied in the field of question routing based on word vectors, can solve problems such as reducing accuracy

Active Publication Date: 2015-05-20
DALIAN UNIV OF TECH
View PDF5 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such methods improve the recall rate of retrieval to a certain extent, but often reduce the accuracy due to the introduction of a large amount of noise information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Question routing method based on word vectors
  • Question routing method based on word vectors
  • Question routing method based on word vectors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be described below with reference to the accompanying drawings.

[0047] like figure 1 As shown, a problem routing method based on word vector includes the following steps:

[0048] Step 1. Construction of user profiles: construct profiles for users based on the user's answer history. Users who have answered questions in the community Q&A are all candidates for a new question. The construction of each user profile includes the following sub-steps:

[0049] Step (a), download all data from the website provided by Stackoverflow website from July 2008 to March 2014, the data is in XML format, read the XML file format to extract all questions, including the questions Tag field, title field and content body field;

[0050] Step (b), collecting those questions selected as the best answer among the questions answered by the user to form the user's profile;

[0051] Step (c), ignoring the users who are selected as the best answer in a relatively sm...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a question routing method, in particular to a question routing method based on word vectors. The question routing method based on the word vectors includes the following steps that firstly, user files are established; secondly, data are preprocessed; thirdly, the word vectors are trained; fourthly, document vectors are expressed; fifthly, user authority is worked out; sixthly, user activity is worked out; seventhly, candidate answerers are ranked. The data are trained through word2vec, efficiency is high, the word vectors have the superposition property, and the defect that no co-occurrence word with the similarity of 0 exists among documents is overcome; meanwhile, document subject words are extracted, the document vectors are expressed by the word vectors, the authority, the activity and the similarity among the document vectors are worked out comprehensively, semantic information among the documents is considered, and noise is also reduced. Contrast experiments are carried out between the question routing method and classic TF_IDF and between the question routing method and a classic Language Model, and S@N of the question routing method is higher than those of the other two methods.

Description

technical field [0001] The present invention relates to a problem routing method, more particularly, to a problem routing method based on word vectors. Background technique [0002] Q&A communities in recent years, such as Yahoo! Answers, Baidu Knows, and Stackoverflow have become more and more popular. The public shares knowledge in the community. Every day, a large number of users ask questions. The content of the answers in the community provides users with optional answers. General community Q&A websites will divide the questions according to the question category. When users ask a question, they will select a suitable category, that is, the question label, and wait for other users to answer. The questioner must wait for other users to browse the community and read the question before they can provide the answer, and it is possible that many users can answer the question before they can get the best answer. This process generally takes hours or days. The answer may no ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/24578G06F16/9535G06Q50/01
Inventor 王健董华磊林鸿飞
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products