Dialogue generation and corpus expansion method and device, computer equipment and storage medium

A technology for generating models and words, applied in the field of dialogue generation and corpus expansion, which can solve problems such as difficulty in collecting dialogue data, affecting dialogue quality, and difficulty in achieving training effects, etc., to achieve the effect of expanding the number of samples, improving dialogue quality, and solving insufficient samples

Pending Publication Date: 2020-04-24
CHINA SOUTHERN POWER GRID COMPANY +1
View PDF2 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The effect of the human-computer dialogue system depends on the quality and scale of the labeled data, but in specific scenarios such as enterprise intelligent assistants, the corpus required by the dialogue management module at the start-up stage is lacking, which may easily lead to insufficient generalization capabilities of the model, making it difficult to achieve good training results
For vertical fields, it is very difficult to collect dialogue data on a large scale. When the amount of data is insufficient, satisfactory intent recognition and slot filling accuracy cannot be achieved, which will affect the quality of dialogue.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dialogue generation and corpus expansion method and device, computer equipment and storage medium
  • Dialogue generation and corpus expansion method and device, computer equipment and storage medium
  • Dialogue generation and corpus expansion method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0076] It can be understood that the term "and / or" used in this application describes the association relationship of associated objects, indicating that there may be three relationships, for example, A and / or B, which may mean: A exists alone, A and B exist simultaneously, There are three cases of B alone. The character " / " generally indicates that the contextual objects are an "or" relationship.

[0077] The dialogue generation method and corpus expansion method provided by this application can be applied to such as figure 1 shown in the application environment. The a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a dialogue generation and corpus expansion method and device, computer equipment and a storage medium. The dialogue generation method comprises the steps of obtaining a current question text, wherein the current question text is obtained according to a current input question of a user; performing text vectorization processing on the current problem text, and then inputtingthe current problem text into a pre-created dialogue generation model, wherein the dialogue generation model is obtained by taking a question text vector and an answer text of the question text vector as model training samples for training, and the question text vector is obtained by performing text vectorization processing, synonym conversion and/or sentence pattern rewriting on an initial question text and conversion from a word vector to a semantic vector; and responding to the input question according to the target answer text, the input question and the response to the input question being the generated dialogues. By adopting the method, the number of samples can be increased, the sample availability is enhanced, and the dialogue quality can be improved.

Description

technical field [0001] The present application relates to the field of electric power technology, in particular to a dialog generation and corpus expansion method, device, computer equipment and storage medium. Background technique [0002] With the development of power technology and the gradual growth of business and data volume in the power industry, a large number of business scenarios that require interaction have emerged within power companies, such as intelligent assistants for operation management and control, and intelligent customer service. Among them, the process of information interaction with machines through natural language understanding to realize business requirements and data calls has important research significance and application value. [0003] Natural language processing and human-computer dialogue are the main components of speech semantic technology, and integrating various semantic analysis algorithms is one of the key supporting applications of ar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F40/247G06F40/58
CPCG06F16/3329
Inventor 吴石松吴丹
Owner CHINA SOUTHERN POWER GRID COMPANY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products