Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cold start system and method for dialog strategy optimization

A cold start and strategy technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as poor system performance, improve performance and avoid user loss

Active Publication Date: 2020-05-05
AISPEECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention aims at the defect that the system performance is very poor at the initial stage of training in the prior art, and a large amount of dialogue data is required for training to improve the performance, and proposes a cold start system and method for dialogue strategy optimization, which can significantly improve dialogue strategy in reinforcement learning. Performance in the early stages of online training; improve the learning speed of the dialogue policy, that is, reduce the number of dialogues it uses to achieve a certain performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cold start system and method for dialog strategy optimization
  • Cold start system and method for dialog strategy optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] This embodiment relates to a cold start system optimized for dialog strategies, including:

[0028] User input module 101: for receiving user input, which may be voice, image or text.

[0029] Dialogue state tracking module 102: for analyzing the semantics of the current user input, and at the same time tracking the dialogue state according to the dialogue context, that is, understanding the user's intention.

[0030] Teacher decision-making module 103: According to the designed rule-based dialogue strategy, the decision is made in the current state s t Reply action a tea .

[0031] Student decision-making module 104: decide the current state s according to the policy network (Q-network) t response action a stu , while estimating the certainty of the current decision.

[0032] Action selection module 105: select a final reply action a from the reply actions generated by the above two decision-making modules according to a random function t .

[0033] Output modul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a dialogue strategy-optimized cold start system and a method. The system comprises a user input module, a dialogue state tracking module, a teacher decision-making module, a student decision-making module, an action selection module, an output module, a strategy training module and a reward function module. The action selection module randomly selects one final reply action from all reply actions generated by the teacher decision-making module and the student decision-making module. The output module converts the final reply action into a more natural expression and displays the more natural expression to a user. The strategy training module stores the dialogue experience (transition) in an empirical pool, samples a fixed number of experiences, and updates network parameters according to a depth Q network (DQN) algorithm. The reward function module calculates the reward of the dialogue at each round of the dialogue, and outputs the reward to the strategy training module. According to the invention, the performance of the dialogue strategy during the strengthened learning on-line training initial stage can be remarkably improved. The learning speed of the dialogue strategy is increased, and the number of dialogues used for achieving certain performances is reduced.

Description

technical field [0001] The invention relates to a technology in the field of intelligent man-machine dialogue, in particular to a cold start system and method for dialogue strategy optimization. Background technique [0002] The intelligent human-computer dialogue system is an intelligent system that can communicate with users. Among them, the dialog strategy is the module in the whole system that decides how to reply to the user. The earliest design method of dialogue strategy is that designers design different logic rules according to different user inputs. The disadvantage of this method is that the dialogue strategy cannot be continuously optimized with the user's feedback to enhance the adaptive ability to the user and the environment. [0003] In recent years, deep reinforcement learning methods have gradually been used in the optimization of dialogue policies. In this method, the dialogue strategy is represented by a neural network, and the reward signal (reward) i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/183G10L15/16
CPCG10L15/063G10L15/16G10L15/183
Inventor 俞凯陈露周翔常成杨闰哲
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products