Plurilingual voice decoding diagram establishment method, device, server and medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A decoding map and multilingual technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of single-language recognition, different habits of each person, and low recognition effect

Active Publication Date: 2019-04-12

北京如布科技有限公司

View PDF5 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, with the development of global diversification, users are no longer satisfied with single-language recognition

[0003] Most of the existing speech recognition systems are monolingual speech recognition, even if there are multilingual speech recognition methods, taking Chinese and English as an example, due to the particularity of Chinese English pronunciation and different habits of each person, resulting in Recognition effect is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0025] figure 1 It is a flow chart of a method for constructing a multilingual speech decoding map provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of recognizing multiple languages in speech, and the method can be provided by a multilingual speech Decode graph construction means to execute. see figure 1 , the method specifically includes:

[0026] Step 101 , transliterate the main language words and the secondary language words included in the sample corpus, and obtain the pronunciation phonemes of the main language words and the secondary language words.

[0027] Wherein, the sample corpus refers to the language material used for training or optimizing the acoustic model and the language model, the sample corpus may be a corpus including at least one sample text, and the sample corpus in this embodiment includes at least two languages. In order to improve the standardization of the words in the sample corpus and improve t...

Embodiment 2

[0041] figure 2 It is a schematic structural diagram of a device for constructing a multilingual speech decoding graph provided in Embodiment 2 of the present invention. The device can execute the method for constructing a multilingual speech decoding graph provided in any embodiment of the present invention, and has corresponding functional modules for executing the method and beneficial effects. Such as figure 2 As shown, the device may include:

[0042] The phonetic marking module 21 is used to carry out phonetic marking on the main language words and the secondary language words included in the sample corpus, and obtain the pronunciation phonemes of the main language words and the secondary language words;

[0043] The acoustic feature determination module 22 is used to determine the acoustic features of the main language words and the secondary language words according to the sample speech associated with the sample corpus in the sample corpus;

[0044] The decoding ...

Embodiment 3

[0051] image 3 A schematic structural diagram of a server provided by Embodiment 3 of the present invention, such as image 3 As shown, the server includes a processor 30, a memory 31, an input device 32 and an output device 33; the number of processors 30 in the server can be one or more, image 3 Take a processor 30 as an example; the processor 30, memory 31, input device 32 and output device 33 in the server can be connected by bus or other methods, image 3 Take connection via bus as an example.

[0052] Memory 31, as a computer-readable storage medium, can be used to store software programs, computer-executable programs and modules, such as program instructions / modules corresponding to the construction method of the multilingual speech decoding map in the embodiment of the present invention (for example, phonetic transcription module 21, acoustic feature determination module 22 and decoding map construction module 23). The processor 30 executes various functional appl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses a plurilingual voice decoding diagram establishment method, a device, a server and a medium and relates to the technical field of voice recognition. The method comprises the following steps: marking main language words and secondary language words in a sample corpus bank with phonetic symbols so as to obtain pronunciation phonemes of the main language words and the secondary language words; according to sample voice associated with sample corpora in the sample corpus bank, confirming acoustic features of the main language words and the secondary language words; according to the main language words and the secondary language words in the sample corpora in the sample corpus bank, and the pronunciation phonemes and the acoustic features of the main language words and the secondary language words, confirming decoding diagrams of plurilingual recognition. According to the embodiment of the invention, the pronunciation phonemes of the main language words and the secondary language words are obtained according to the sample corpus bank, furthermore acoustic features associated with the main language words and the secondary language words are confirmed, the decoding diagrams of plurilingual recognition are finally obtained, and the requirement of voice recognition for plurilingual mixed reading crowds can be met.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of speech recognition, and in particular to a method, device, server and medium for constructing a multilingual speech decoding graph. Background technique [0002] Speech recognition is a high technology that allows machines to convert voice signals into corresponding text or commands through the process of recognition and understanding. For example, in a smart home, after waking up the smart device, the user only needs to speak the corresponding command to operate the device. After the device correctly recognizes the user's voice information, it will act according to the information intention. Speech recognition plays an important role in human-computer interaction, and has great development prospects in various fields of modern society. However, with the development of global diversification, users are no longer satisfied with single-language recognition. [0003] Most of the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/06G10L15/14G10L15/18G10L15/26

CPCG10L15/02G10L15/063G10L15/14G10L15/18G10L15/26G10L2015/025

Inventor 何金来韩虎雷宇

Owner 北京如布科技有限公司

Plurilingual voice decoding diagram establishment method, device, server and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology