Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Speech recognition method and device supporting multi-language mixing, equipment and storage medium

A speech recognition and speech recognition model technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high consumption of computing resources, poor recognition performance, low recognition accuracy, etc., to achieve high-accuracy speech recognition, applicable scope Wide and high recognition accuracy

Pending Publication Date: 2021-07-30
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Among the above-mentioned schemes, scheme 1 and scheme 2 need to rely on high-quality language classifiers, and consume a lot of computing resources, while scheme 3 has a simple system design and low computational complexity, but it does not have enough language discrimination, especially For similar pronunciation units and languages ​​with a small proportion of time in the training corpus, the recognition performance is generally poor, resulting in low recognition accuracy and poor effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device supporting multi-language mixing, equipment and storage medium
  • Speech recognition method and device supporting multi-language mixing, equipment and storage medium
  • Speech recognition method and device supporting multi-language mixing, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0050] The invention provides a voice recognition method supporting multilingual mixing. refer to figure 1 As shown, it is a schematic flowchart of a speech recognition method supporting multilingual mixing provided by an embodiment of the present invention. The method may be performed by a device, and the device may be implemented by software and / or hardware.

[0051] In the embodiment of the present invention, the speech recognition method supporting multilingual mixing mainly includes: obtaining speech features of training data; obtaining high-dimensional features respectively corresponding to the speech features through at least two parallel networks; outputting Carry out feature mosaic of high-dimensional features, and obtain the mosaic features corresponding to the training data; train the neural network model...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to artificial intelligence, and provides a speech recognition method supporting multi-language mixing. The method comprises the following steps: acquiring speech features of training data; acquiring high-dimensional features respectively corresponding to the voice features through at least two parallel networks; performing feature splicing on the high-dimensional features output by the parallel network, and obtaining splicing features corresponding to the training data; training a neural network model based on the spliced features until the neural network model converges to a preset range, and forming a speech recognition model; and performing voice recognition on a to-be-recognized multi-language mixed signal through the voice recognition model. According to the invention, the recognition precision of the multi-language mixed voice can be improved.

Description

technical field [0001] The present invention relates to the field of artificial intelligence, in particular to a method, device, equipment, electronic equipment and computer-readable storage medium supporting multilingual mixed speech recognition. Background technique [0002] With the development of artificial intelligence technology, more and more intelligent hardware has entered people's lives, and voice input, as the most natural and convenient way in human-computer interaction, has gradually become the mainstream interaction method. Therefore, the performance of speech recognition directly determines the level of interaction quality. At the same time, due to the continuous deepening of globalization, cultural and language exchanges between different regions are becoming more and more frequent. People's voices are often mixed with different languages, such as Mandarin-English, Cantonese-English, and Mandarin-Cantonese. The current speech recognition system has a good pe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/00G10L15/02G10L15/06G10L15/16
CPCG10L15/02G10L15/063G10L15/16G10L15/005
Inventor 鄢楷强魏韬马骏王少军
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products