Dialogue strategy-optimized cold start system and method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A cold start and strategy technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as poor system performance, improve performance and avoid user loss

Active Publication Date: 2017-11-10

AISPEECH CO LTD

View PDF4 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present invention aims at the defect that the system performance is very poor at the initial stage of training in the prior art, and a large amount of dialogue data is required for training to improve the performance, and proposes a cold start system and method for dialogue strategy optimization, which can significantly improve dialogue strategy in reinforcement learning. Performance in the early stages of online training; improve the learning speed of the dialogue policy, that is, reduce the number of dialogues it uses to achieve a certain performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0027] This embodiment relates to a cold start system optimized for dialog strategies, including:

[0028] User input module 101: for receiving user input, which may be voice, image or text.

[0029] Dialogue state tracking module 102: for analyzing the semantics of the current user input, and at the same time tracking the dialogue state according to the dialogue context, that is, understanding the user's intention.

[0030] Teacher decision-making module 103: According to the designed rule-based dialogue strategy, the decision is made in the current state s t Reply action a tea .

[0031] Student decision-making module 104: decide the current state s according to the policy network (Q-network) t response action a stu , while estimating the certainty of the current decision.

[0032] Action selection module 105: select a final reply action a from the reply actions generated by the above two decision-making modules according to a random function t .

[0033] Output modul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a dialogue strategy-optimized cold start system and a method. The system comprises a user input module, a dialogue state tracking module, a teacher decision-making module, a student decision-making module, an action selection module, an output module, a strategy training module and a reward function module. The action selection module randomly selects one final reply action from all reply actions generated by the teacher decision-making module and the student decision-making module. The output module converts the final reply action into a more natural expression and displays the more natural expression to a user. The strategy training module stores the dialogue experience (transition) in an empirical pool, samples a fixed number of experiences, and updates network parameters according to a depth Q network (DQN) algorithm. The reward function module calculates the reward of the dialogue at each round of the dialogue, and outputs the reward to the strategy training module. According to the invention, the performance of the dialogue strategy during the strengthened learning on-line training initial stage can be remarkably improved. The learning speed of the dialogue strategy is increased, and the number of dialogues used for achieving certain performances is reduced.

Description

technical field [0001] The invention relates to a technology in the field of intelligent man-machine dialogue, in particular to a cold start system and method for dialogue strategy optimization. Background technique [0002] The intelligent human-computer dialogue system is an intelligent system that can communicate with users. Among them, the dialog strategy is the module in the whole system that decides how to reply to the user. The earliest design method of dialogue strategy is that designers design different logic rules according to different user inputs. The disadvantage of this method is that the dialogue strategy cannot be continuously optimized with the user's feedback to enhance the adaptive ability to the user and the environment. [0003] In recent years, deep reinforcement learning methods have gradually been used in the optimization of dialogue policies. In this method, the dialogue strategy is represented by a neural network, and the reward signal (reward) i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06G10L15/183G10L15/16

CPCG10L15/063G10L15/16G10L15/183

Inventor俞凯陈露周翔常成杨闰哲

OwnerAISPEECH CO LTD

Dialogue strategy-optimized cold start system and method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology