Speech recognition method and apparatus, terminal, and computer readable storage medium
A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low recognition result confidence, high false recognition rate, and high recognition result confidence, so as to reduce the false recognition rate and avoid recognition as Effects of command words
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0025] figure 1 It is a flow chart of the speech recognition method provided in Embodiment 1 of the present invention. This embodiment is applicable to the situation of command word recognition. The method can be executed by a speech recognition device, and specifically includes the following steps:
[0026] Step 110, according to the acoustic characteristics of the collected speech, calculate the acoustic similarity probability between the speech and the phoneme sequence in the decoding network;
[0027] Wherein, the decoding network includes multiple groups of phoneme sequences; each group of phoneme sequences corresponds to a preset command word content or corresponding noise content; since the embodiment of the present invention is applied to the recognition of voice commands, any non-command word speech is In terms of command word recognition, it is all interference, so it is all noise, and the noise in the embodiment of the present invention refers to any non-command wor...
Embodiment 2
[0040] figure 2 It is a flow chart of the speech recognition method provided by Embodiment 2 of the present invention. This embodiment is applicable to command word recognition, and the method can be executed by a speech recognition device. In this embodiment, on the basis of the speech recognition method in Embodiment 1, a step of automatically adjusting decoding network parameters is added, so that the speech recognition method can dynamically modify parameters and continuously reduce the misrecognition rate. The voice recognition method provided in this embodiment includes:
[0041] Step 210, according to the acoustic characteristics of the collected speech, calculate the acoustic similarity probability between the speech and the phoneme sequence in the decoding network; wherein, the decoding network includes multiple sets of phoneme sequences; each set of phoneme sequences corresponds to a preset Command word content or corresponding noise content;
[0042] Step 220, ac...
Embodiment 3
[0051] image 3 It is a schematic structural diagram of the speech recognition device provided by Embodiment 3 of the present invention. The speech recognition device includes:
[0052] Calculation module 310, for calculating the acoustic similarity probability between the speech and the phoneme sequence in the decoding network according to the acoustic features of the collected speech; wherein, the decoding network includes multiple groups of phoneme sequences; each group of phoneme sequences corresponds to a Preset command word content or corresponding noise content;
[0053] A matching module 320, configured to obtain a matching probability between the speech and the phoneme sequence according to the acoustic similarity probability;
[0054] The recognition module 330 is configured to recognize the speech as the content corresponding to the phoneme sequence with the highest matching probability.
[0055] Preferably, said decoding network is constructed using weighted fin...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com