Human-computer interactive speech recognition method and system used for intelligent equipment

A technology of human-computer interaction and intelligent equipment, which is applied in speech recognition, speech analysis, instruments, etc., and can solve the problems of combined optimization of intent recognition tasks and slot filling tasks, deviation of slot information filling, failure to meet application requirements, etc.

Inactive Publication Date: 2019-05-21
SUNING COM CO LTD
View PDF4 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] For intent recognition, it can be abstracted as a classification problem, and then the CNN+knowledge representation classifier is used to train the intent recognition model. In the intent recognition model, in addition to word embedding of the user's voice problem, the semantics of knowledge are also introduced. representation to increase the generalization ability of the presentation layer, but in practical applications, it is found that the model has the defect of slot information filling bias, which affects the accuracy of the intent recognition model
For slot filling, the essence is to formalize the sentence sequence into a label sequence. There are many commonly used methods for labeling sequences, such as hidden Markov models or conditional random field models, but these slot filling models are not suitable for specific applications. In the scene, due to the lack of context information, there will be ambiguity in slots under different semantic intentions, which cannot meet the actual application requirements
It can be seen that the training of the two models in the prior art is carried out independently, and there is no combined optimization for the intent recognition task and the slot filling task, which eventually leads to the problem of low recognition accuracy of the trained model in terms of speech recognition, reducing the user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human-computer interactive speech recognition method and system used for intelligent equipment
  • Human-computer interactive speech recognition method and system used for intelligent equipment
  • Human-computer interactive speech recognition method and system used for intelligent equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0058] figure 1 It is a schematic flow chart of the human-computer interaction speech recognition method for smart devices in Embodiment 1 of the present invention. see figure 1 , this embodiment provides a human-computer interaction speech recognition method for smart devices, including:

[0059] Segment the user's voice problem to obtain the original word sequence, and vectorize the original word sequence through embedding processing; calculate the hidden state vector h of each word segmentation vector i and the slot context vector c i S , by adding the hidden state vector h i and the slot context vector c i S After weighting processing, the slot label model y is obtained i S ;Calculate the vectorized representation of the original word sequence hidden state vector hT and intention context vector c I , by combining the hidden state vector hT and the intention context vector c I After weighting processing, the intent prediction model y is obtained I ; use slot gate...

Embodiment 2

[0074] see figure 1 and Figure 4 , the present embodiment provides a human-computer interaction speech recognition system for smart devices, including:

[0075] The word segmentation processing unit 1 is used to process the user's voice problem word segmentation to obtain the original word sequence, and carry out vectorized representation to the original word sequence by embedding process;

[0076] The first calculation unit 2 is used to calculate the hidden state vector h of each word segmentation vector i and the slot context vector c i S , by dividing the hidden state vector h i and the slot context vector c i S After weighting processing, the slot label model y is obtained i S ;

[0077] The second calculation unit 3 is used to calculate the hidden state vector hT of the original word sequence represented by vectorization and the intention context vector c I , by combining the hidden state vector hT and the intention context vector c I After weighted processing...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a human-computer interactive speech recognition method and system used for intelligent equipment, and belongs to the technical field of speech recognition. The accuracy of speech recognition is improved through joint optimization training of intention recognition and slot filling. The method includes the following steps that user's voice problem word segmentation is processed to obtain an original word sequence, and the original word sequence is vectorized through embedding processing; a slot label model yiS is obtained by weighting an implicit state vector hi and a slot context vector ciS; an intention anticipation model yI is obtained by weighting an implicit state vector hT and an intent context vector ciS; a slot door g is used for splicing the slot context vector ciS and the intent context vector ciS, and a slot gate g is used for converting the slot label model yiS; a target function is constructed through joint optimization of the intention anticipation model yI and the slot label model yiS obtained after conversion, and the user's voice problem is subjected to intention recognition based on the target function.

Description

technical field [0001] The present invention relates to the technical field of voice recognition, in particular to a human-computer interaction voice recognition method and system for smart devices. Background technique [0002] With the development of Internet technology, there are more and more smart devices using voice for human-computer interaction. The existing voice interaction systems include Siri, Xiaomi, Cortana, Xiaoice, Dumi, etc. Compared with traditional In terms of manual input human-computer interaction, it is convenient and efficient, and has a wide range of application scenarios. In the process of speech recognition, intent recognition and slot filling technology are the keys to ensure the accuracy of speech recognition results. [0003] For intent recognition, it can be abstracted as a classification problem, and then the CNN+knowledge representation classifier is used to train the intent recognition model. In the intent recognition model, in addition to wo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/22G10L15/26G10L17/22G10L15/06
CPCG10L15/26G10L15/06G10L17/22G10L15/22
Inventor 孙鹏飞贾洪园李春生
Owner SUNING COM CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products