Wake-up model generation method and intelligent terminal wake-up method and device

A model generation and model technology, which is applied in the field of data security, can solve the problems of unrecognizable wake-up speech, short wake-up word time, and insufficient training of neural networks, etc., to reduce manual data processing, reduce computing time and power consumption , Improve the effect of wake-up effect

Active Publication Date: 2020-04-07
SUNING CLOUD COMPUTING CO LTD
View PDF9 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the data preparation stage, it is necessary to manually intercept the positive sample data to a fixed length of time t, and the duration of recording the wake-up words cannot exceed this length of time t, which will greatly increase labor costs, and cannot be used for slow-speed wake-up voices. In addition, due to the short time of the wake-up word, the training of the neural network is insufficient, which will eventually affect the wake-up effect of the smart terminal; in addition, in the terminal wake-up phase, because the neural network needs to process the time in the terminal memory each time Audio with a length of t, so that there will be a large amount of repeated data to be processed between two adjacent time lengths t, thus increasing the calculation time and power consumption of the terminal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Wake-up model generation method and intelligent terminal wake-up method and device
  • Wake-up model generation method and intelligent terminal wake-up method and device
  • Wake-up model generation method and intelligent terminal wake-up method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] An embodiment of the present invention provides a wake-up model generation method, which can be applied to a server, such as figure 1 As shown, the method may include the steps of:

[0065] 101. Mark the start and end time of each wake-up word included in the wake-up word audio in the sample audio set, and obtain the marked wake-up word audio, wherein the time length of the wake-up word audio is not fixed.

[0066] Wherein, the sample audio set includes multiple wake-up word audios, and each wake-up word audio includes at least one wake-up word. During specific implementation, multiple wake-up word audios containing wake-up words can be recorded in a quiet environment, wherein, when recording a wake-up word audio, a certain time interval needs to be reserved between adjacent wake-up words, and each wake-up word The contents are all the same, such as "small biu small biu". In this embodiment, the audio duration of each wake-up word is approximately several seconds to s...

Embodiment 2

[0088] An embodiment of the present invention provides a method for waking up a smart terminal, which can be applied to a smart terminal. The smart terminal is pre-deployed with a wake-up model generated based on the method for generating a wake-up model in the first embodiment above, as shown in Figure 5 As shown, the method may include the steps of:

[0089] 501. The smart terminal acquires real-time audio at the current moment.

[0090] Specifically, the smart terminal can use a microphone to collect real-time audio at the current moment in the scene. Among them, smart terminals include but are not limited to robots, smart phones, wearable devices, smart homes, and vehicle terminals.

[0091] 502. Extract multiple audio frame features from real-time audio.

[0092] Specifically, with the preset window width W, moving step size S and Mel frequency cepstral coefficient C Mel , respectively extracting Mel-frequency cepstral coefficient features from each audio frame of the...

Embodiment 3

[0101] As an implementation of the wake-up model generation method provided in the first embodiment above, the embodiment of the present invention provides a wake-up model generation device, such as Figure 7 As shown, the device includes:

[0102] The first marking module 71 is used to mark the start and end time of each wake-up word included in the wake-up word audio in the sample audio set, and obtain the marked wake-up word audio, wherein the time length of the wake-up word audio is not fixed;

[0103] The noise-adding processing module 72 is used to add noise to the marked wake-up word audio by using the negative sample audio containing background noise to obtain the positive sample audio;

[0104] Feature extraction module 73, for extracting a plurality of audio frame features respectively from positive sample audio and negative sample audio;

[0105] The second labeling module 74 is used for carrying out the labeling of frame label to positive sample audio frequency an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a wake-up model generation method, an intelligent terminal wake-up method and device. The invention discloses a wake-up model generation method, belongs to the technical fieldof voice wake-up, and the wake-up model generation method comprises the following steps: marking the start and stop time of each wake-up word contained in wake-up word audios in a sample audio set, and obtaining marked wake-up word audios, and the time length of the wake-up word audios is not fixed; adding noise to the marked wake-up word audio by using a negative sample audio containing background noise to obtain a positive sample audio; respectively extracting a plurality of audio frame features from the positive sample audio and the negative sample audio, and performing frame label labelingon the positive sample audio and the negative sample audio to obtain a plurality of audio training samples; and training the recurrent neural network by using the plurality of audio training samplesto generate a wake-up model. According to the embodiment of the invention, the model training is carried out by using the variable-length input recurrent neural network, so that the operation of manually intercepting the sample can be avoided, and the wake-up effect of the intelligent terminal is improved.

Description

technical field [0001] The present invention relates to the technical field of data security, in particular to a wake-up model generation method, a smart terminal wake-up method and a device. Background technique [0002] At present, voice wake-up has a wide range of applications, such as robots, mobile phones, wearable devices, smart homes, and vehicles. Different smart terminals will have different wake-up words. When the user speaks a specific wake-up word, the smart terminal can switch from the standby state to the working state. Only when the state switching is completed quickly and accurately can the user use it almost unconsciously. Other functions of the smart terminal, therefore, it is very important to improve the wake-up effect. [0003] In the prior art, a wake-up technology based on a neural network is mainly used for waking up an intelligent terminal. In the data preparation stage, it is necessary to manually intercept the positive sample data to a fixed time...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/02G10L15/22G10L25/18G10L25/30
CPCG10L15/063G10L15/02G10L15/22G10L25/18G10L25/30G10L2015/225Y02D30/70
Inventor 白二伟倪合强宋志姚寿柏
Owner SUNING CLOUD COMPUTING CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products