Human-computer interactive speech recognition method and system used for intelligent equipment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of human-computer interaction and intelligent equipment, which is applied in speech recognition, speech analysis, instruments, etc., and can solve the problems of combined optimization of intent recognition tasks and slot filling tasks, deviation of slot information filling, failure to meet application requirements, etc.

Inactive Publication Date: 2019-05-21

SUNING COM CO LTD

View PDF4 Cites 24 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] For intent recognition, it can be abstracted as a classification problem, and then the CNN+knowledge representation classifier is used to train the intent recognition model. In the intent recognition model, in addition to word embedding of the user's voice problem, the semantics of knowledge are also introduced. representation to increase the generalization ability of the presentation layer, but in practical applications, it is found that the model has the defect of slot information filling bias, which affects the accuracy of the intent recognition model

For slot filling, the essence is to formalize the sentence sequence into a label sequence. There are many commonly used methods for labeling sequences, such as hidden Markov models or conditional random field models, but these slot filling models are not suitable for specific applications. In the scene, due to the lack of context information, there will be ambiguity in slots under different semantic intentions, which cannot meet the actual application requirements

It can be seen that the training of the two models in the prior art is carried out independently, and there is no combined optimization for the intent recognition task and the slot filling task, which eventually leads to the problem of low recognition accuracy of the trained model in terms of speech recognition, reducing the user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0058] figure 1 It is a schematic flow chart of the human-computer interaction speech recognition method for smart devices in Embodiment 1 of the present invention. see figure 1 , this embodiment provides a human-computer interaction speech recognition method for smart devices, including:

[0059] Segment the user's voice problem to obtain the original word sequence, and vectorize the original word sequence through embedding processing; calculate the hidden state vector h of each word segmentation vector i and the slot context vector c i S , by adding the hidden state vector h i and the slot context vector c i S After weighting processing, the slot label model y is obtained i S ;Calculate the vectorized representation of the original word sequence hidden state vector hT and intention context vector c I , by combining the hidden state vector hT and the intention context vector c I After weighting processing, the intent prediction model y is obtained I ; use slot gate...

Embodiment 2

[0074] see figure 1 and Figure 4 , the present embodiment provides a human-computer interaction speech recognition system for smart devices, including:

[0075] The word segmentation processing unit 1 is used to process the user's voice problem word segmentation to obtain the original word sequence, and carry out vectorized representation to the original word sequence by embedding process;

[0076] The first calculation unit 2 is used to calculate the hidden state vector h of each word segmentation vector i and the slot context vector c i S , by dividing the hidden state vector h i and the slot context vector c i S After weighting processing, the slot label model y is obtained i S ;

[0077] The second calculation unit 3 is used to calculate the hidden state vector hT of the original word sequence represented by vectorization and the intention context vector c I , by combining the hidden state vector hT and the intention context vector c I After weighted processing...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a human-computer interactive speech recognition method and system used for intelligent equipment, and belongs to the technical field of speech recognition. The accuracy of speech recognition is improved through joint optimization training of intention recognition and slot filling. The method includes the following steps that user's voice problem word segmentation is processed to obtain an original word sequence, and the original word sequence is vectorized through embedding processing; a slot label model yiS is obtained by weighting an implicit state vector hi and a slot context vector ciS; an intention anticipation model yI is obtained by weighting an implicit state vector hT and an intent context vector ciS; a slot door g is used for splicing the slot context vector ciS and the intent context vector ciS, and a slot gate g is used for converting the slot label model yiS; a target function is constructed through joint optimization of the intention anticipation model yI and the slot label model yiS obtained after conversion, and the user's voice problem is subjected to intention recognition based on the target function.

Description

technical field [0001] The present invention relates to the technical field of voice recognition, in particular to a human-computer interaction voice recognition method and system for smart devices. Background technique [0002] With the development of Internet technology, there are more and more smart devices using voice for human-computer interaction. The existing voice interaction systems include Siri, Xiaomi, Cortana, Xiaoice, Dumi, etc. Compared with traditional In terms of manual input human-computer interaction, it is convenient and efficient, and has a wide range of application scenarios. In the process of speech recognition, intent recognition and slot filling technology are the keys to ensure the accuracy of speech recognition results. [0003] For intent recognition, it can be abstracted as a classification problem, and then the CNN+knowledge representation classifier is used to train the intent recognition model. In the intent recognition model, in addition to wo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/22G10L15/26G10L17/22G10L15/06

CPCG10L15/26G10L15/06G10L17/22G10L15/22

Inventor 孙鹏飞贾洪园李春生

Owner SUNING COM CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Human-computer interactive speech recognition method and system used for intelligent equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology