Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Associated question aggregation model generation method and device, question-and-answer mode aggregation method and device as well as equipment

A question-and-answer and question-based technology, applied in the field of data processing, can solve the problems affecting the answer satisfaction rate of the question-and-answer community, the quality of the answer is not high, and there is no answer, so as to reduce the cost of manual labeling, reduce the number of manual labeling samples, and optimize the answer satisfaction rate. Effect

Active Publication Date: 2018-11-20
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the process of realizing the present invention, the inventor found that the existing technology can only check the problems with consistent problem expressions, or only individual stop words are different, and the semantic generalization is very poor; and for the problems that already exist in the library, no Check whether there are the same questions. Some of the questions in the library have low-quality answers or no answers, which will affect the overall answer satisfaction rate of the Q&A community.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Associated question aggregation model generation method and device, question-and-answer mode aggregation method and device as well as equipment
  • Associated question aggregation model generation method and device, question-and-answer mode aggregation method and device as well as equipment
  • Associated question aggregation model generation method and device, question-and-answer mode aggregation method and device as well as equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] figure 1 It is a flowchart of a method for generating a related problem aggregation model provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of generating a related problem aggregation model for related problem aggregation. This method can be provided by the related The problem aggregation model generation device is implemented, and the device can be implemented in the form of software and / or hardware, and generally can be integrated in the generation device of the associated problem aggregation model. The device for generating the aggregation model of the associated problem includes but is not limited to a computer and the like. like figure 1 As shown, the method of this embodiment specifically includes:

[0044] S101. Obtain a first number of basic training samples according to network behavior data of at least two users, and use the basic training samples to train a first machine learning model to obtain a basic semantic ma...

Embodiment 2

[0056] Figure 2a It is a flow chart of a method for generating an aggregation model of related problems provided by Embodiment 2 of the present invention. This embodiment is embodied on the basis of the foregoing embodiments.

[0057] Correspondingly, such as Figure 2a As shown, the method of the present embodiment includes:

[0058] S201. Obtain at least two click behavior logs of the user. The click behavior logs include: a search type, a URL set recalled based on the search type, and a target URL selected by the user based on the URL set.

[0059] Wherein, the click behavior log includes: a search type, a URL set recalled based on the search type, and a target URL selected by the user based on the URL set. When a user enters a search formula in the search engine, the search engine will return multiple URLs to the user, that is, a collection of URLs recalled based on the search formula. The user will click on a part of the URL, which is the target URL selected by the us...

Embodiment 3

[0086] image 3 It is a flowchart of a method for generating an aggregation model of related problems provided by Embodiment 3 of the present invention. This embodiment is embodied on the basis of the above-mentioned embodiments.

[0087] Correspondingly, such as image 3 As shown, the method of the present embodiment includes:

[0088] S301. Obtain a first number of basic training samples according to network behavior data of at least two users, and use the basic training samples to train a first machine learning model to obtain a basic semantic matching model.

[0089] S302. Migrate the semantic representation layer in the basic semantic matching model to the second machine learning model, and divide the second number of associated question pairs into a training sample set and a testing sample set.

[0090] Wherein, the associated question pairs according to the pre-marked second quantity are divided into a training sample set and a test sample set. The training sample se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an associated question aggregation model generation method and device, a question-and-answer mode aggregation method and device as well as equipment. The methods include the following steps: obtaining a first quantity of basic training samples according to network behavior data of at least two users, and training a first machine learning model by using the basic training samples to obtain a basic semantic matching model; migrating a semantic representation layer in the basic semantic matching model to a second machine learning model, and training the second machine learning model according to a second quantity of pre-labeled associated question pairs to obtain an associated question aggregation model. According to the embodiment of the invention, the associated question aggregation model that aggregates answers of questions with consistent meanings can be obtained, the basic semantic matching model trained by the network behavior data of the users is adopted toperform migration learning to further generate the associated question aggregation model, so that the number of manually labeled samples and the manual labeling costs can be greatly reduced, and the answer satisfaction rate of the questions in a question-and-answer community can be optimized.

Description

technical field [0001] Embodiments of the present invention relate to data processing technologies, and in particular, relate to a method, device and equipment for generating an aggregation model of associated questions and a question-and-answer aggregation. Background technique [0002] At present, Baidu knows, Zhihu and other question-and-answer communities have a lot of the same problems. Some questions had no answers or were of low quality. When users search for these questions, they cannot find answers to their needs. However, there are other problems with the same meaning but different expressions on the website. (For example, "How to force an Apple phone to shut down?" and "Is there a way to force an IPhone to shut down?" are a pair of questions with the same meaning but different expressions.) These questions with the same meaning may have answers that meet the needs of users. [0003] In the prior art, when a user asks a question, some question-and-answer communi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 王硕寰孙宇
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products