Plurilingual voice decoding diagram establishment method, device, server and medium
A decoding map and multilingual technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of single-language recognition, different habits of each person, and low recognition effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0025] figure 1 It is a flow chart of a method for constructing a multilingual speech decoding map provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of recognizing multiple languages in speech, and the method can be provided by a multilingual speech Decode graph construction means to execute. see figure 1 , the method specifically includes:
[0026] Step 101 , transliterate the main language words and the secondary language words included in the sample corpus, and obtain the pronunciation phonemes of the main language words and the secondary language words.
[0027] Wherein, the sample corpus refers to the language material used for training or optimizing the acoustic model and the language model, the sample corpus may be a corpus including at least one sample text, and the sample corpus in this embodiment includes at least two languages. In order to improve the standardization of the words in the sample corpus and improve t...
Embodiment 2
[0041] figure 2 It is a schematic structural diagram of a device for constructing a multilingual speech decoding graph provided in Embodiment 2 of the present invention. The device can execute the method for constructing a multilingual speech decoding graph provided in any embodiment of the present invention, and has corresponding functional modules for executing the method and beneficial effects. Such as figure 2 As shown, the device may include:
[0042] The phonetic marking module 21 is used to carry out phonetic marking on the main language words and the secondary language words included in the sample corpus, and obtain the pronunciation phonemes of the main language words and the secondary language words;
[0043] The acoustic feature determination module 22 is used to determine the acoustic features of the main language words and the secondary language words according to the sample speech associated with the sample corpus in the sample corpus;
[0044] The decoding ...
Embodiment 3
[0051] image 3 A schematic structural diagram of a server provided by Embodiment 3 of the present invention, such as image 3 As shown, the server includes a processor 30, a memory 31, an input device 32 and an output device 33; the number of processors 30 in the server can be one or more, image 3 Take a processor 30 as an example; the processor 30, memory 31, input device 32 and output device 33 in the server can be connected by bus or other methods, image 3 Take connection via bus as an example.
[0052] Memory 31, as a computer-readable storage medium, can be used to store software programs, computer-executable programs and modules, such as program instructions / modules corresponding to the construction method of the multilingual speech decoding map in the embodiment of the present invention (for example, phonetic transcription module 21, acoustic feature determination module 22 and decoding map construction module 23). The processor 30 executes various functional appl...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com