Model training method and device, dialogue system evaluation method and device, equipment and storage medium

A dialogue system and model training technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as poor reply quality, low coincidence, labor cost and time cost increase, and achieve improved model performance, The effect of improving reliability

Active Publication Date: 2019-08-30
TENCENT TECH (SHENZHEN) CO LTD
View PDF16 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Among them, non-task-based chatbots interact with users through non-task-based dialogue systems. There are usually two commonly used evaluation methods for the evaluation of the response quality of non-task-based dialogue systems. One is the manual evaluation method, which requires a lot of manpower. Manually evaluate the quality of system responses. Manual evaluation is highly reliable, but the efficiency of manual evaluation is high. Especially with the explosive growth of data volume, the labor cost and time cost of system evaluation will also increase, which is difficult to meet the needs of actual business. need
[0004] The other is to use BELU (Bilingual Evaluation understanding) value and other technical indicators for automatic evaluation, and the BELU value is calculated by comparing the coincidence degree between the system reply and the existing (manual) reply, specifically comparing the two types of reply respectively. The coincidence degree of any unit fragment (n-gram), the larger the value, the higher the coincidence degree of the two types of replies and the better the reply quality, and the smaller the value, the lower the coincidence degree of the two types of replies and the better the reply quality. The worse it is, the BELU value alone can only literally evaluate the similarity between the system reply and the existing (manual) reply, that is, it only has the ability to express the literal correlation of the sentence, which leads to the inaccurate evaluation of the reply quality of the system by using the current technical indicators

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device, dialogue system evaluation method and device, equipment and storage medium
  • Model training method and device, dialogue system evaluation method and device, equipment and storage medium
  • Model training method and device, dialogue system evaluation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0044] The terms "first", "second", "third", "fourth", etc. (if any) in the specification and claims of the present application and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a model training method, device and equipment, and the method comprises the steps: obtaining a pre-trained dialogue generation model which comprises an encoder and a decoder; constructing a dialogue system evaluation model, wherein the dialogue system evaluation model takes questions and replies as input and takes scores corresponding to the replies as output; initializing parameters of an encoder in the dialogue system evaluation model according to the parameters of the encoder in the dialogue generation model; and according to the first training sample set, training the initialized dialogue system evaluation model to obtain a dialogue system evaluation model meeting a training ending condition, each training sample in the first training sample set comprising a question, a reply and a label score corresponding to the reply. The dialogue system evaluation model obtained by training through the method can evaluate the reply quality of the dialogue system from the perspective of semantic correlation, and the reliability of dialogue reply evaluation is improved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a model training method, a dialogue system evaluation method, a device, a device, and a storage medium. Background technique [0002] As an important application of artificial intelligence technology, chatbots have been widely used in various specific scenarios, such as customer service, personal assistants, knowledge quizzes, and accompanying chats. At present, chatbots can be divided into task-type and non-task-type chatbots according to their purposes, and the realization technologies of chatbots for different purposes are not the same. [0003] Among them, non-task-based chatbots interact with users through non-task-based dialogue systems. There are usually two commonly used evaluation methods for the evaluation of the response quality of non-task-based dialogue systems. One is the manual evaluation method, which requires a lot of manpower. Manu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F17/27
CPCG06F40/12G06F40/30
Inventor 涂润
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products