Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-dialect accent Mandarin speech recognition model training method, device and equipment

A speech recognition model, Mandarin technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low recognition accuracy

Active Publication Date: 2021-03-12
北京远鉴信息技术有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The object of the present invention is to, aiming at the deficiencies in the above-mentioned prior art, provide a kind of multi-dialect accent Mandarin speech recognition model training method, device and equipment, so that make full use of the speech data without label to strengthen the training of the model, avoid in practice Due to the limitation of the lack of labeled training sample data in the application, the final recognition accuracy is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dialect accent Mandarin speech recognition model training method, device and equipment
  • Multi-dialect accent Mandarin speech recognition model training method, device and equipment
  • Multi-dialect accent Mandarin speech recognition model training method, device and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments.

[0074] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. It should be understood that the appended The figures are only for the purpose of illustration and description, and are not used to limit the protection scope of the present application. Additionally, it should be understood that t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a multi-dialect accent Mandarin speech recognition model training method, device and equipment, which relate to the technical field of language recognition. The method includes: obtaining training samples; using labeled standard Mandarin speech data to train to obtain an initial acoustic model, and using text data to train to obtain an initial language model; iteratively training the initial acoustic model based on unlabeled dialect accent Mandarin speech data to obtain a target acoustic model ; Use the text to be trained recognized by the target acoustic model and the initial language model to train a temporary language model, and combine the temporary language model and the initial language model to obtain the target language model; combine the target acoustic model and the target language model into a multi-dialect accent Mandarin speech recognition model. Using a large number of unlabeled Mandarin speech data with dialect accents, iterative training was carried out to obtain a speech recognition model for Mandarin with multi-dialect accents, which improved the accuracy of Mandarin speech recognition for dialect accents.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a multi-dialect Mandarin speech recognition model training method, device and equipment. Background technique [0002] With the improvement of the performance of the Internet and other mobile terminals, intelligent products based on speech recognition technology are more and more popular in industrial production and daily life, such as voice dialogue robots, voice assistants, interactive tools, etc. However, my country is a country with multiple dialects. People living in various regions will have a large degree of accent when expressing in Mandarin, which will cause a mismatch with the standard Mandarin model and result in low speech recognition accuracy. [0003] At present, multi-model research can be carried out based on temporal neural networks to realize the recognition of Mandarin accents. Among them, based on the multi-model recognition method, differe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/00G10L15/06G10L15/26G10L15/16G10L15/14
CPCG10L15/005G10L15/063G10L15/142G10L15/16G10L15/26
Inventor 胡广宇
Owner 北京远鉴信息技术有限公司