Voice recognition method and device
A speech recognition and to-be-recognized technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low speech recognition performance and robustness, and achieve the effect of improving performance and robustness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0066] Please refer to figure 1 , which is a flow chart of an embodiment of a speech recognition method provided by the present application, and the executing body of the method includes a speech recognition device. A speech recognition method provided by the present application includes:
[0067] Step S101: Obtain voice data to be recognized and image data corresponding to the voice data.
[0068] The speech data to be recognized and the way of obtaining it will be firstly described below.
[0069] The voice data is a sequence of sampled values of the voice signal sorted by time. The size of these sampled values represents the energy of the voice signal at the sampling point. The energy value of the silent part is small, and the energy value of the active voice part is relatively large. The speech signal is a one-dimensional continuous function with time as the independent variable. In the voice signal, the amplitude of the sound wave in the silent part is very small, ...
no. 2 example
[0125] Please see Figure 5 , which is a schematic diagram of an embodiment of the speech recognition device of the present application. Since the device embodiment is basically similar to the method embodiment, the description is relatively simple, and for relevant parts, refer to the part of the description of the method embodiment. The device embodiments described below are illustrative only.
[0126] The present application additionally provides a speech recognition device, including:
[0127] A data acquisition unit 501, configured to acquire voice data to be recognized and image data corresponding to the voice data;
[0128] The feature extraction unit 502 is configured to extract the acoustic features of the speech data through the acoustic feature extraction subnetwork included in the acoustic model; and extract the image data from the image data through the visual feature extraction subnetwork included in the acoustic model Visual features corresponding to the voic...
no. 3 example
[0142] Please refer to Figure 7 , which is a schematic diagram of an electronic device embodiment of the present application. Since the device embodiment is basically similar to the method embodiment, the description is relatively simple, and for related parts, please refer to part of the description of the method embodiment. The device embodiments described below are illustrative only.
[0143] An electronic device in this embodiment, the electronic device includes: a processor 701 and a memory 702; the memory is used to store a program for implementing the speech recognition method, and the device is powered on and runs the speech recognition method through the processor After the program, perform the following steps: acquire the voice data to be recognized and the image data corresponding to the voice data; extract the acoustic features of the voice data through the acoustic feature extraction sub-network included in the acoustic model; and, through the The visual featur...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com