Chinese speech recognition method combining Transformer and CNN-DFSMN-CTC
A CNN-DFSMN-CTC, speech recognition technology, applied in speech recognition, speech analysis, instruments and other directions, can solve the problem of not being applied speech recognition and so on
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0073] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.
[0074] The technical scheme that the present invention solves the problems of the technologies described above is:
[0075] Such as figure 1 As shown, the present invention provides a kind of acoustic model based on CNN-DFSMN-CTC, Transformer is the speech recognition method of language model, it is characterized in that, comprises the following steps:
[0076] S1, the speech signal is preprocessed, combined with the low frame rate LFR, the speech signal is pre-emphasized first, and then analyzed through a fixed 10ms frame shift 25ms Hamming window, and 80 mel filter banks are used to extract 80-dimensional Take the logarithmic Mel filter (Filter banks, Fbank) feature;
[0077] S2, the extracted 80-dime...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com