Far-field speech recognition processing method and device
A speech recognition and processing method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of poor denoising processing effect and high equipment cost investment, achieve the best denoising processing effect, low equipment cost investment, and realize Simple and convenient effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0058] Embodiment 1 of the present invention provides a far-field speech recognition processing method, the flow of which is as followsfigure 1 shown, including the following steps:
[0059] Step S101: receiving far-field voice.
[0060] The device used for far-field voice processing receives far-field voice through the set receiving module, and performs subsequent de-reverberation and de-noising processing to obtain better-quality voice.
[0061] Step S102: Input the received far-field speech into the pre-trained speech training model based on the neural network.
[0062] After receiving the far-field voice, input the far-field voice into the voice training model for de-reverberation and de-noising processing, where the voice training model can choose a pre-trained voice training model based on a neural network (Deep Neural Network, DNN) .
[0063] The training process of the voice training model is also a learning process. By recording near-field sounds, near-field audio f...
Embodiment 2
[0070] Embodiment 2 of the present invention provides the training process of the neural network-based speech training model in the above-mentioned far-field speech recognition processing method, and its flow is as follows figure 2 shown, including the following steps:
[0071] Step S201: Record near-field voice.
[0072] The training of the neural network-based speech training model is actually a learning process. First, the characteristics of the near-field speech are learned by recording the near-field speech.
[0073] Step S202: Obtain near-field audio features from the recorded near-field voice.
[0074] After the near-field sound is recorded, near-field audio features are extracted from the near-field sound to realize the learning of near-field speech features.
[0075] Step S203: adding the ambient sound of the far-field speech to the near-field speech to obtain the simulated far-field speech.
[0076] In the training process, after learning the audio characteristic...
Embodiment 3
[0089] Embodiment 3 of the present invention provides a specific implementation method for far-field speech recognition processing, the process of which is as follows Figure 4 shown, including the following steps:
[0090] Step S301: Receive far-field voice.
[0091] Step S302: Input the received far-field speech into the pre-trained speech training model based on the neural network.
[0092] The neural network-based speech training model in this embodiment is a speech training model that does not incorporate an acoustic model, and this model only realizes the processing from far-field speech to near-field speech.
[0093] Step S303: Obtain the audio features of the far-field speech and the near-field speech included in the speech training model.
[0094] Step S304: According to the acquired audio features, de-interference processing is performed on the audio features of the received far-field voice to obtain the processed far-field voice.
[0095] Step S305: Input the pro...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com