Unlock instant, AI-driven research and patent intelligence for your innovation.

Intention recognition method and system based on keyword feature embedded language model

A language model and keyword technology, applied in natural language data processing, semantic analysis, instruments, etc., can solve problems such as dependence on training data, sample quality dependence, and expensive results.

Pending Publication Date: 2021-06-18
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Traditional intent recognition technology uses keyword matching and statistical information such as word frequency, TFIDF (term frequency–inverse document frequency, word frequency inverse text frequency index) and traditional machine learning models such as support vector machines and mixed Gaussian models. These methods only make use of the potential language statistics, but the deep semantic information cannot be involved. They are very dependent on the sample quality and the performance is not ideal.
[0004] Since the development of deep learning technology, advanced neural networks such as convolutional neural networks, recurrent neural networks, and transformers have been used to encode deeper semantic information, and have achieved remarkable results in various fields of natural language processing. However, these The methods are too dependent on the training data, only focus on specific tasks, and will be affected by the data set, learn other features of the data set and ignore the core semantic information of the task, once the data set changes, the performance will be greatly reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intention recognition method and system based on keyword feature embedded language model
  • Intention recognition method and system based on keyword feature embedded language model
  • Intention recognition method and system based on keyword feature embedded language model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The present invention will be further described now in conjunction with accompanying drawing.

[0057] Such as figure 1 As shown, the present invention provides an intent recognition method based on keyword features embedded in a language model, which overcomes the traditional intent recognition methods that cannot mine deep semantic information, is limited by data sets, and cannot learn common semantics for tasks. The problem is that by introducing the language model BERT pre-trained on a large-scale corpus, and embedding the external knowledge of keyword features, the target text can be identified more accurately and the result of text intent identification can be obtained.

[0058] The method includes:

[0059] Preprocessing the corpus to be recognized, using regular expressions to extract the language information of valid text in the corpus to be recognized;

[0060] Specifically, adjust the encoding format, and uniformly convert the Chinese encoding of the compli...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of natural language processing, and particularly relates to an intention recognition method based on a keyword feature embedded language model, which comprises the following steps of: performing word segmentation on extracted language information of an effective text by adopting a forward and backward maximum word segmentation algorithm to obtain different types of word segmentation results; for the obtained different types of word segmentation results, obtaining a keyword list related to candidate intentions corresponding to the different types of word segmentation results; removing universal high-frequency words and field-independent words in the keyword list related to the candidate intention corresponding to the word segmentation result of each category, obtaining a final keyword list corresponding to the word segmentation result of each category, and further obtaining different keyword feature vectors; embedding each obtained keyword feature vector into a pre-trained language model, and obtaining voice information of an effective text with keyword features; and encoding and classifying the voice information to obtain an intention recognition result of the language information of the effective text.

Description

technical field [0001] The invention belongs to the technical field of natural language processing and long text intent recognition, and in particular relates to an intent recognition method and system based on a keyword feature embedded language model. Background technique [0002] Intent recognition technology is an important technology to identify the speaker's intention hidden in long text, and it is also an important research content in the field of natural language processing. [0003] Traditional intent recognition technology uses keyword matching and statistical information such as word frequency, TFIDF (term frequency–inverse document frequency, word frequency inverse text frequency index) and traditional machine learning models such as support vector machines and mixed Gaussian models. These methods only make use of the underlying language statistical data, but the deep semantic information cannot be involved, and they are very dependent on the sample quality, and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/30G06F40/284
Inventor 颜永红林格平付瑞柳万辛张学君孙旭东孙晓晨
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI