Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for identifying unknown intention text

A text and intent technology, applied in the direction of neural learning methods, character and pattern recognition, unstructured text data retrieval, etc., can solve the problem that the classification model cannot be classified, the category labels of the training samples are incomplete, and the classification model cannot identify the unknown Intent and other issues to achieve the best results

Pending Publication Date: 2022-07-15
ZHONGKE DINGFU BEIJING TECH DEV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the text classification task trains the classification model through several fixed categories of training samples, so that the classification model can recognize several fixed categories of text from unknown texts. However, for unknown texts that do not belong to these fixed categories (that is, unknown intent ), the classification model cannot classify
For example: in a news classification scenario, if the training samples include labels for the three categories of sports, economy, and entertainment, then the classification model trained using these three categories of training samples can only be used for the three categories of sports, economy, and entertainment. Classify the text to be recognized, and the text to be recognized in the life class belongs to the unknown intention for the classification model, but the classification model cannot recognize this unknown intention
[0004] In addition, in some scenarios, there may be many types of text categories, and the category labels of training samples may only cover part of the categories, that is, the category labels of training samples are incomplete

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for identifying unknown intention text
  • Method and device for identifying unknown intention text
  • Method and device for identifying unknown intention text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Text classification is one of the basic tasks in the field of natural language processing technology. It has very rich applications in real life. For example, applications based on natural language processing technology such as public opinion monitoring, news classification, and sentiment classification are all realized through text classification tasks. of.

[0033] Currently, the text classification task trains a classification model with several fixed categories of training samples, enabling the classification model to recognize several fixed categories of text from unknown texts, however, for unknown texts that do not belong to these fixed categories (i.e. unknown intent ), the classification model fails to classify. For example: in a news classification scenario, if the training samples include labels for three categories of sports, economy, and entertainment, then the classification model trained using the training samples of these three categories can only be use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides an unknown intention text recognition method and device. According to the scheme, the method comprises the steps that K positive samples and S negative samples corresponding to each training sample are obtained, wherein K and S are both positive integers larger than or equal to 1; using a classifier to obtain sentence representations of the training samples and the corresponding positive samples and negative samples, so that the sentence representations of the samples of the same category are gathered together, and the sentence representations of different categories are far away from each other; determining a decision center of each category according to the sentence representation, and learning a decision boundary of each category; judging whether the to-be-recognized text is located outside decision boundaries of all categories or not; and if yes, determining that the to-be-recognized text is an unknown intention text. According to the embodiment of the invention, comparative learning and classification learning are introduced in a classifier training stage, so that sentence representations of samples of the same category are gathered together, sentence representations of different categories are far away from each other, the effect is better when a decision boundary is trained, and the classifier can recognize a text with an unknown intention more accurately.

Description

technical field [0001] The present application relates to the technical field of natural language processing, and in particular, to a method and device for recognizing unknown intent text. Background technique [0002] Text classification is one of the basic tasks in the field of natural language processing technology. It has very rich applications in real life. For example, applications based on natural language processing technology such as public opinion monitoring, news classification, and sentiment classification are all realized through text classification tasks. . [0003] Currently, the text classification task trains a classification model with several fixed categories of training samples, enabling the classification model to recognize several fixed categories of text from unknown texts, however, for unknown texts that do not belong to these fixed categories (i.e. unknown intent ), the classification model fails to classify. For example: in a news classification s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06K9/62G06N3/04G06N3/08
CPCG06F16/35G06N3/08G06N3/045G06F18/22G06F18/241
Inventor 李健铨刘小康穆晶晶胡加明
Owner ZHONGKE DINGFU BEIJING TECH DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products