Wake-up audio determination method, device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for determining methods and audio, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of poor alignment model performance, low efficiency, and affecting model performance, and achieve the effect of reducing the amount of calculation and improving recognition efficiency

Active Publication Date: 2021-02-23

SOUNDAI TECH CO LTD

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] In the above method of obtaining labeled data through the trained acoustic model, the alignment result will greatly affect the performance of subsequent models

For example, if the performance of the alignment model is poor and the accuracy of the alignment result is low, the performance of the trained model will be poor if the alignment result with low accuracy is used as label data.

If you want to get labeled data with high accuracy, you need to use large-scale sample data to retrain the acoustic model, which is costly and inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0072] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0073] In this application, the terms "first" and "second" are used to distinguish the same or similar items with basically the same function and function. It should be understood that "first", "second" and "nth" There are no logical or timing dependencies, nor are there restrictions on quantity or order of execution. It should also be understood that although the following description uses the terms first, second, etc. to describe various elements, these elements should not be limited by the terms. These terms are only used to distinguish one element from another. For example, a first image could be termed a second image, and, similarly, a second image could be termed a first image, without departing from the scope of the vario...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The application discloses a wake-up audio determining method, device, equipment and storage medium, belonging to the technical field of speech. In the embodiment of the present application, wake-up audio and non-wake-up audio are respectively modeled, each corresponding to a plurality of sentence states, forming a sentence state sequence, so that when the audio features of the audio are classified, it can be determined that the audio is more like a wake-up Audio is still more like non-wake-up audio. In this process, the wake-up audio and non-wake-up audio are directly modeled, and the two are independent of each other, instead of modeling for each phoneme, so there is no need for a model trained for each frame-level annotation data, and the recognition process There is also no need to determine the corresponding recognition result for each phoneme, which can greatly reduce the amount of calculation and improve the recognition efficiency.

Description

technical field [0001] The present application relates to the field of voice technology, and in particular to a wake-up audio determination method, device, device and storage medium. Background technique [0002] In recent years, with the continuous development of audio processing technology, intelligent voice interaction systems such as smart speakers and vehicle-mounted voice interaction systems have become popular. In order to reduce user operations, a voice wake-up function is provided. By recognizing the collected voice, it is determined Whether it is a wake-up voice, and then the voice wake-up of the device can be realized. [0003] In the related art, the method for determining the wake-up speech is usually: performing feature extraction on the speech to be processed to obtain a fixed-length speech feature, and inputting it into the wake-up acoustic model for classification. The sample data required for the training of the wake-up acoustic model needs to have frame-l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/06G10L15/02G10L15/10G10L15/22

CPCG10L15/02G10L15/063G10L15/10G10L15/22G10L2015/0631G10L2015/223

Inventor 陈孝良冯大航陈天峰常乐

Owner SOUNDAI TECH CO LTD

Wake-up audio determination method, device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology