Unlock instant, AI-driven research and patent intelligence for your innovation.

Open domain dialogue reply method and system based on deep reinforcement learning

A reinforcement learning and deep technology, applied in the field of artificial intelligence, can solve problems such as unfavorable dialogue, empty content, difficulty in expanding other data sets, etc., and achieve the effect of convenient migration

Active Publication Date: 2021-02-26
NANKAI UNIV
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004]Using real replies as supervision information enables the neural network to learn how to generate complete and fluent reply sentences. paid more attention to the reply
The supervised method that indirectly introduces emotion annotation can achieve better results on specific dialogue datasets, but it is difficult to extend to other datasets. At present, there is no method that can be applied to any open domain dialogue dataset.
And directly use the emotion of the dialogue reply as supervisory information to control the emotion of the reply. Another problem is that it leads to the generation of an emotionally safe reply, that is, the generated reply meets the expected emotion, but the content is relatively vague, which is not conducive to the progress of the dialogue.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Open domain dialogue reply method and system based on deep reinforcement learning
  • Open domain dialogue reply method and system based on deep reinforcement learning
  • Open domain dialogue reply method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046]In order to explain in detail the technical content, structural features, achieved goals and effects of the technical solution, the following will be described in detail in conjunction with specific embodiments and accompanying drawings.

[0047] refer to figure 1 As shown in , it is a schematic diagram of the open domain dialog reply control flow. The dialogue data is input into the dialogue reply control model based on deep reinforcement learning. After the training is completed, the new dialogue text is input into the dialogue generation module in the model, and the dialogue reply with coherent content and reasonable emotion is output. In a preferred embodiment of the present invention, an open-domain dialogue reply method based on deep reinforcement learning includes:

[0048] Obtain dialogue input content for preprocessing;

[0049] The preprocessed information is input into the dialog reply control model for processing. The dialog reply control model includes a d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of artificial intelligence, relates to natural language generation and emotion analysis, and provides an open domain dialogue reply method and system based on deep reinforcement learning in order to introduce content coherence control and emotion rationality control to an open domain dialogue system. The method comprises the steps that dialogue input content is acquired and preprocessed; the preprocessed information is input into a dialogue reply control model to be processed; the dialogue reply control model comprises a dialogue generation module, a contentcoherence control module and an emotion rationality control module, wherein the dialogue generation module is used for generating a dialogue reply, the content coherence control module is used for enabling dialogue context content to be coherent, and the emotion rationality control module is used for sentence emotion classification and judging whether reply emotion is reasonable or not; and dialogue replies with coherent content and reasonable emotion are output.

Description

technical field [0001] The invention belongs to the field of artificial intelligence and relates to natural language generation and sentiment analysis. In order to introduce content coherence control and emotional rationality control to an open domain dialogue system, an open domain dialogue reply method and system based on deep reinforcement learning is proposed. Background technique [0002] Open-domain dialog system control aims to add more control and determinism to the process of generating dialog responses by neural networks. Before the neural network-based generative dialogue system, the mainstream dialogue system was retrieval-based, and all candidate dialogue responses came from the database. For a new dialogue context, when there is no suitable context in the database, the system’s The performance will be reduced, and the lack of diversity of responses is not suitable for open-domain dialogue scenarios. The generative dialogue system can generate responses outside...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/33G06F16/35G06F40/242G06F40/126G06F40/289G06N3/04
CPCG06F16/3329G06F16/3344G06F16/35G06F40/242G06F40/126G06F40/289G06N3/049G06N3/045
Inventor 张莹李丹阳郭文雅蔡祥睿袁晓洁
Owner NANKAI UNIV