New dialogue intention recognition method based on pseudo label self-training and source domain retraining

A recognition method and self-training technology, applied in character and pattern recognition, neural learning methods, special data processing applications, etc., can solve the problems of reduced model robustness, coarse pseudo-label granularity, and difficulty in knowledge transfer, so as to improve the expression ability, improved recognition accuracy, and improved robustness

Active Publication Date: 2022-01-04
XI AN JIAOTONG UNIV
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above-mentioned technical solution has the following disadvantages: First, the pseudo-labels generated by the existing model for unlabeled data are relatively coarse-grained, and the model cannot be well trained to discover new dialogue intentions.
Second, existing models only use labeled data for model initialization, but fail to make full use of labeled data during training, making knowledge transfer difficult
Third, the existing model only uses the clustering model to generate the final prediction, and does not consider integrating other models for ensemble learning, which reduces the robustness of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • New dialogue intention recognition method based on pseudo label self-training and source domain retraining
  • New dialogue intention recognition method based on pseudo label self-training and source domain retraining
  • New dialogue intention recognition method based on pseudo label self-training and source domain retraining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0071] In order to enable those skilled in the art to better understand the solutions of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is an embodiment of a part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0072] It should be noted that the terms "first" and "second" in the description and claims of the present invention and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circums...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a new dialogue intention recognition method based on pseudo label self-training and source domain retraining and belongs to the technical field of language processing. According to the new dialogue intention recognition method based on pseudo label self-training and source domain retraining, pseudo labels are generated for unlabeled data containing new dialogue intentions, and model parameters are iteratively updated by utilizing a self-training method, so recognition accuracy is continuously improved; meanwhile, a retraining strategy is provided, so knowledge can be better migrated between a source domain and a target domain, and the expression ability of models is improved; and finally, the output of the three models is fused for integrated learning, so robustness of the models is improved.

Description

technical field [0001] The invention belongs to the technical field of language processing, and in particular relates to a new dialogue intent recognition method based on pseudo-label self-training and source domain retraining. Background technique [0002] The core module of the intelligent dialogue system is user intent recognition. New dialog intent recognition aims to discover newly generated dialog intents based on existing dialog intents, using only a small amount of labeled known intent data to discover and classify new intents from a large amount of unlabeled data. Since the data containing new intents are all unlabeled data, the existing dialog intent classification models cannot handle them, which leads to errors in the recognition of user intent and affects the subsequent response of the intelligent dialog system. [0003] In order to solve the above problems, the academic community currently mainly adopts two types of methods: 1. The method based on comparative ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F16/332G06N3/08
CPCG06F16/3329G06N3/084G06F18/23213G06F18/2415G06F18/214Y02D10/00
Inventor 田锋安文斌郑庆华
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products