Multi-dialect mixed speech recognition method, device, system and storage medium
A technology of mixed speech and recognition methods, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high speech recognition accuracy, difficult to guarantee, unable to guarantee the accuracy of multi-dialect mixed speech speech recognition, etc., to achieve effective Recognition, high accuracy effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] figure 1 It is a schematic flowchart of a multi-dialect mixed speech recognition method provided by Embodiment 1 of the present invention. This embodiment can be applied to the situation where effective recognition of multi-dialect mixed speech is realized based on the existing dialect recognition subsystem corresponding to each dialect. , the method can be executed by a multi-dialect mixed speech recognition device, the device can be realized by software and / or hardware, and can be integrated in a multi-dialect mixed speech recognition system.
[0031] It is understandable that the existing speech recognition technology mostly recognizes the speech of a single language, and can achieve a high accuracy rate of speech recognition; however, it cannot effectively recognize the mixed speech of multiple dialects, let alone guarantee a high accuracy. speech recognition accuracy. The purpose of the present invention is to use the existing speech recognition technology for a s...
Embodiment 2
[0068] image 3 It is a schematic flowchart of a multi-dialect mixed speech recognition method provided in Embodiment 2 of the present invention. This embodiment is further optimized on the basis of Embodiment 1. In this embodiment, adding each semantic text and timeline information to the historical word segmentation set of the corresponding dialect recognition subsystem is embodied as: for each semantic text, it is judged whether the target voice corresponding to the semantic text is the The initial speech to be recognized; if the target speech corresponding to the semantic text is the initial speech to be recognized, then the semantic text is determined to be the first semantic text corresponding to the dialect recognition subsystem that generates the semantic text, and Add the binary information group composed of the first semantic text and timeline information to the historical word segmentation set corresponding to the dialect recognition subsystem that generates the fir...
Embodiment 3
[0096] Figure 4 It is a structural schematic diagram of a multi-dialect mixed speech recognition device provided in Embodiment 3 of the present invention. This embodiment can be applied to the situation where effective recognition of multi-dialect mixed speech is realized based on the existing dialect recognition subsystem corresponding to each dialect. , the device can be implemented by software and / or hardware, and specifically includes: a semantic acquisition module 301 , a semantic addition module 302 , an unprocessed acquisition module 303 , a sequence formation module 304 , and a result determination module 305 . in,
[0097] The semantic acquisition module 301 is configured to use the initial speech to be recognized as the target speech, and obtain the semantic text obtained by processing the target speech by at least one dialect recognition subsystem and the timeline information corresponding to the semantic text, each of the dialect recognition The type of dialect c...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


