Dialogue system training data construction method and device, electronic equipment and storage medium

A dialogue system and training data technology, applied in the field of data processing, can solve the problems of low efficiency and high cost, achieve the effect of reducing cycle, improving accuracy and reliability, and improving construction efficiency

Active Publication Date: 2019-06-28
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The dialog system training data construction method, device, electronic equipment, and storage medium proposed in this application are used to solve the problem of high cost and low efficiency in the related art of manually labeling data to construct training data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dialogue system training data construction method and device, electronic equipment and storage medium
  • Dialogue system training data construction method and device, electronic equipment and storage medium
  • Dialogue system training data construction method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Embodiments of the present application are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals designate the same or similar elements throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

[0018] The embodiment of the present application aims at the problem of high cost and low efficiency in the method of manually marking data to construct training data in related technologies, and proposes a method for constructing training data of a dialog system.

[0019] The dialogue system training data construction method provided by the embodiment of the present application can perform statistical processing on the historical use data of the dialogue system, determine the historical query statement set corresponding to the dialogue system, the que...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a dialogue system training data construction method, a device, electronic equipment and a storage medium, and the method comprises the steps: carrying out the statistical processing of the historical use data of the conversation system, and determining a historical query statement set corresponding to the conversation system, the query frequency corresponding to each historical query statement, and an identification result corresponding to each historical query statement; according to the query frequency and the identification result corresponding to each historical query statement, obtaining a reference query statement from the historical query statement set;judging whether the number of all reference query statements is greater than a first threshold; and if yes, constructing a training data set of the dialogue system by utilizing all the reference query statements and the identification results corresponding to all the reference query statements. Therefore, through the dialogue system training data construction method, the labor cost is saved, the construction efficiency of the training data set is improved, and the accuracy and reliability of the dialoguesystem are further improved.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular to a method, device, electronic equipment and storage medium for constructing training data of a dialogue system. Background technique [0002] With the development of machine learning technology, especially the rapid development of neural networks in recent years, effective training data has become more and more important, and it is even called the "data oil" of the future. The Spoken Language Understanding (SLU) task in the field of Natural Language Processing (Natural Language Processing, referred to as NLP) aims to solve the problem of semantic understanding in human-computer dialogue, and parse the spoken dialogue (query) into intent (intent) and Slots are structured data for computer processing. [0003] In related technologies, machine learning technology is a main method for realizing SLU tasks. Realizing the SLU task through machine learning technolo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/33G06F16/335
Inventor 韩磊张红阳陈雷
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products