Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text intention recognition method and system based on projection gradient descent and label smoothing

A projection gradient descent and recognition method technology, applied in neural learning methods, biological neural network models, semantic analysis, etc., can solve problems such as difficulty in adapting to limited training samples, weak semantic coding ability, high model complexity, etc., and achieve good generalization Effect, Strong Resilience, Scale-Up Effect

Active Publication Date: 2020-09-04
ZHEJIANG UNIV
View PDF6 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the problem in the field of intent recognition, the existing text classification model lacks a good trade-off between model complexity and model generalization performance. Too few parameters tend to make the semantic coding ability weak, and the accuracy rate is low in the case of many classification categories. ; Too many parameters make the model too complex and difficult to adapt to the situation of limited training samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text intention recognition method and system based on projection gradient descent and label smoothing
  • Text intention recognition method and system based on projection gradient descent and label smoothing
  • Text intention recognition method and system based on projection gradient descent and label smoothing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0095] In order to verify the implementation effect of the present invention, comparison and ablation experiments were conducted on two large public data sets IFLYTEK and TNEW. IFLYTEK is a long text classification data set, containing 17,000 long text annotation data about app application descriptions, including various application topics related to daily life, and a total of 119 categories: "Taxi":0,"Navigation": 1, "Free WIFI": 2,..., "Received": 117, "Other": 118, each category can be regarded as a type of intent in the question and answer system. The data set is divided into three parts: training set, validation set, and test set. There are 12133, 2599, and 2600 long texts respectively.

[0096] TNEW is a short text classification data set, from the news section of Today's Toutiao, extracting a total of 15 categories of news, including travel, education, finance, military, etc. The data set is also divided into three parts: training set, validation set, and test set. There ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text intention recognition method and system based on projection gradient descent and label smoothing, and relates to the field of natural language processing question-answering systems. The method comprises the following steps: (1) obtaining an initial vector code through an embedded layer; (2) adding disturbance meeting L2 constraints to an embedded layer by using a projection gradient descent algorithm to form an adversarial sample; (3) using a Transformer network to encode the context semantic information; (4) scaling the category of the real intention by using label smoothing; (5) inputting the encoder output features into a classifier, and calculating the cross entropy between the encoder output features and the smoothed labels; (6) optimizing a target function; and (7) after model training is completed, predicting and outputting intention categories. According to the model, in a classification task, sufficient semantic vector coding can be carried out on an input intention; meanwhile, disturbance is added to a text embedding layer to form an adversarial sample, label sliding is conducted on a final classification target, and the robustness and generalization capacity of the model can be remarkably improved.

Description

Technical field [0001] The invention relates to the field of natural language processing question answering systems, in particular to a method and system for text intention classification based on projection gradient descent and label smoothing. Background technique [0002] With a large number of publicly available online question and answer corpora, the question and answer system has attracted the attention of researchers from industry and academia. Question answering systems are usually based on intelligent products that meet the needs of B-end enterprises, which can significantly improve work efficiency and relieve customer service staff. Its greatest hidden value is to automatically accumulate standardized data in actual scenarios, reduce costs and improve efficiency in mining customer service value information, and can also be used for future precision marketing and product upgrades. The typical application of the question and answer system is to ask questions about a cert...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/30G06N3/04G06N3/08
CPCG06F40/30G06N3/084G06N3/045
Inventor 徐叶琛赵洲
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products