Voice adaptive recognition method, system and device and storage medium
A recognition method and self-adaptive technology, applied in speech recognition, speech analysis, instruments, etc., to achieve the effect of improving the recognition rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] Such as figure 2 As shown, a speech adaptive recognition method mainly includes the following steps:
[0037] Step 1. Use the acoustic model to encode the input speech sequence into a deep feature sequence.
[0038] In the embodiment of the present invention, the acoustic model is a CTC-based end-to-end model.
[0039] Step 2. Apply the CTC criterion to the depth feature sequence, and convert the depth feature sequence into a probability distribution sequence. During the conversion process, each depth feature is activated through several hidden layers of the acoustic model, and at least one hidden layer is activated through the corresponding The attention-based gated scaling adaptation layer generates scaling transformation vectors for the corresponding deep features, and uses the scaling transformation vectors to reweight the activation outputs of the corresponding hidden layers.
[0040] In the embodiment of the present invention, the attention-based gated scaling ...
Embodiment 2
[0096] The present invention also provides a speech recognition system, which mainly includes: an acoustic model, and an attention-based gating scaling adaptive network composed of a plurality of attention-based gating scaling adaptive layers. Based on the acoustic model and the attention-based gated scaling adaptive network, the speech recognition is realized by adopting the solutions introduced in the foregoing method embodiments. The specific identification process has been introduced in detail in the aforementioned method embodiments, so it will not be repeated here.
Embodiment 3
[0098] The present invention also provides a processing device, such as Image 6 As shown, it mainly includes: one or more processors; memory for storing one or more programs; wherein, when the one or more programs are executed by the one or more processors, the One or more processors implement the methods provided in the foregoing embodiments.
[0099] Further, the processing device further includes at least one input device and at least one output device; in the processing device, the processor, memory, input device, and output device are connected through a bus.
[0100] In the embodiment of the present invention, the specific types of the memory, input device and output device are not limited; for example:
[0101] The input device can be a touch screen, an image acquisition device, a physical button or a mouse, etc.;
[0102] The output device can be a display terminal;
[0103] The memory may be random access memory (Random Access Memory, RAM), or non-volatile memory ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com