Wake-up model generation method and intelligent terminal wake-up method and device
A model generation and model technology, which is applied in the field of data security, can solve the problems of unrecognizable wake-up speech, short wake-up word time, and insufficient training of neural networks, etc., to reduce manual data processing, reduce computing time and power consumption , Improve the effect of wake-up effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0064] An embodiment of the present invention provides a wake-up model generation method, which can be applied to a server, such as figure 1 As shown, the method may include the steps of:
[0065] 101. Mark the start and end time of each wake-up word included in the wake-up word audio in the sample audio set, and obtain the marked wake-up word audio, wherein the time length of the wake-up word audio is not fixed.
[0066] Wherein, the sample audio set includes multiple wake-up word audios, and each wake-up word audio includes at least one wake-up word. During specific implementation, multiple wake-up word audios containing wake-up words can be recorded in a quiet environment, wherein, when recording a wake-up word audio, a certain time interval needs to be reserved between adjacent wake-up words, and each wake-up word The contents are all the same, such as "small biu small biu". In this embodiment, the audio duration of each wake-up word is approximately several seconds to s...
Embodiment 2
[0088] An embodiment of the present invention provides a method for waking up a smart terminal, which can be applied to a smart terminal. The smart terminal is pre-deployed with a wake-up model generated based on the method for generating a wake-up model in the first embodiment above, as shown in Figure 5 As shown, the method may include the steps of:
[0089] 501. The smart terminal acquires real-time audio at the current moment.
[0090] Specifically, the smart terminal can use a microphone to collect real-time audio at the current moment in the scene. Among them, smart terminals include but are not limited to robots, smart phones, wearable devices, smart homes, and vehicle terminals.
[0091] 502. Extract multiple audio frame features from real-time audio.
[0092] Specifically, with the preset window width W, moving step size S and Mel frequency cepstral coefficient C Mel , respectively extracting Mel-frequency cepstral coefficient features from each audio frame of the...
Embodiment 3
[0101] As an implementation of the wake-up model generation method provided in the first embodiment above, the embodiment of the present invention provides a wake-up model generation device, such as Figure 7 As shown, the device includes:
[0102] The first marking module 71 is used to mark the start and end time of each wake-up word included in the wake-up word audio in the sample audio set, and obtain the marked wake-up word audio, wherein the time length of the wake-up word audio is not fixed;
[0103] The noise-adding processing module 72 is used to add noise to the marked wake-up word audio by using the negative sample audio containing background noise to obtain the positive sample audio;
[0104] Feature extraction module 73, for extracting a plurality of audio frame features respectively from positive sample audio and negative sample audio;
[0105] The second labeling module 74 is used for carrying out the labeling of frame label to positive sample audio frequency an...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com