Iterative incremental dialogue intention category identification method based on small sample

A recognition method and small sample technology, applied in the computer field, can solve the problems of slow query, manual labeling, time-consuming and labor-intensive, etc., and achieve the effect of ensuring robustness and accuracy

Active Publication Date: 2019-10-01
TONGJI UNIV
View PDF8 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The rule-based algorithm often roughly guesses the user's intention by counting the keyword information in the text. This classification method is slow in querying when the amount of data is large, and requires manual labeling, which is time-consuming and labor-intensive; probability-based The document classification algorithm of the statistical model has high requirements on the quality and distribution of the text, and the classification is inaccurate in the case of small samples; the text classification algorithm based on the machine learning classification model is better for short text classification, and is very difficult for long text. It is difficult to capture the dialogue context information, it is easy to answer the wrong question, and the model needs to be retrained for new predictions. As the number of samples increases, the training complexity is greater

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Iterative incremental dialogue intention category identification method based on small sample
  • Iterative incremental dialogue intention category identification method based on small sample
  • Iterative incremental dialogue intention category identification method based on small sample

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0053] The specific implementation of the iterative incremental dialogue intent recognition method based on small samples is as follows:

[0054] The first is to divide the data of telecom customer service intentions, and divide the customer service intentions into 50 categories according to the actual needs of the industry (X takes 50). The average number of important dialogue rounds is 10, and each round contains two sentences. Therefore, a customer service sample consists of about 20 sentences and an intent label. After word segmentation, each sentence has an average of about 5-10 words. Here, 1000 small sample data sets are first selected, and the 1000 data sets are divided into 10 groups, each group contains 100 data, and each group of data is divided into Da data set and Db data set again. For each set of data sets, 70% is used as the training set of a single model, and 30% is used as the verification set to evaluate the effect of the model. Until the accuracy of the mod...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an iterative incremental dialogue intention category identification method based on small samples. The method is based on a small sample data set. Training starts from a preliminary classification model, and along with the use of a model, the number of the preliminary models is continuously increased. Model accuracy is also gradually improved, and a training mode that a large number of samples are needed by a previous deep learning model is abandoned. According to the method, in the iterative training process, only a small number of samples are needed to train a new preliminary classification model each time. The weights of other existing historical preliminary classification models are not changed. Results of all the preliminary classification models are input into the secondary classification model for training, the calculation speed of the model cannot decrease along with increase of the number of samples, the similarity screening model can screen and removethe existing preliminary classification model, the performance is maintained under the condition that the accuracy is guaranteed. Compared with the prior art, the method is advantaged in that the number of training samples is small, the calculation performance is stable, and the model is easy to update and expand.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to an iterative incremental dialogue intent category recognition method based on small samples. Background technique [0002] In order to improve the quality of products and services, many companies have launched their own customer service systems to help users answer questions through manual customer service, so that users can enjoy better services and improve service quality and efficiency. However, with the increase in the number of product users, traditional The manual customer service cannot meet the needs of many users, and the manual customer service needs special training and learning for the business, which also brings a certain cost. The customer service hotline is busy for a long time, which will affect the user experience. Therefore, major companies have launched their own artificial customer service products, which can help users solve related businesses faster by lea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F16/35
CPCG06F16/35G06F18/241Y02D10/00
Inventor 向阳单光旭贾圣宾徐诗瑶杨力
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products