Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-dialect accent mandarin voice recognition model training method and device, and equipment

A speech recognition model and Mandarin technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low recognition accuracy

Active Publication Date: 2021-01-15
北京远鉴信息技术有限公司
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The object of the present invention is to, aiming at the deficiencies in the above-mentioned prior art, provide a kind of multi-dialect accent Mandarin speech recognition model training method, device and equipment, so that make full use of the speech data without label to strengthen the training of the model, avoid in practice Due to the limitation of the lack of labeled training sample data in the application, the final recognition accuracy is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dialect accent mandarin voice recognition model training method and device, and equipment
  • Multi-dialect accent mandarin voice recognition model training method and device, and equipment
  • Multi-dialect accent mandarin voice recognition model training method and device, and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments.

[0074] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. It should be understood that the appended The figures are only for the purpose of illustration and description, and are not used to limit the protection scope of the present application. Additionally, it should be understood that t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-dialect accent mandarin voice recognition model training method and device, and equipment, and relates to the technical field of language recognition. The method comprises the steps of obtaining a training sample; training by using standard mandarin voice data with labels to obtain an initial acoustic model, and training by using text data to obtain an initial language model; iteratively training an initial acoustic model based on the unlabeled dialect mandarin voice data to obtain a target acoustic model; training to obtain a temporary language model by using ato-be-trained text recognized by the target acoustic model and the initial language model, and combining the temporary language model with the initial language model to obtain a target language model;and combining the target acoustic model and the target language model into a multi-dialect accent mandarin speech recognition model. A large amount of dialect accent mandarin voice data which are notlabeled are used for iterative training, a multi-dialect accent mandarin voice recognition model is obtained, and the dialect accent mandarin voice recognition accuracy is improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a multi-dialect Mandarin speech recognition model training method, device and equipment. Background technique [0002] With the improvement of the performance of the Internet and other mobile terminals, intelligent products based on speech recognition technology are more and more popular in industrial production and daily life, such as voice dialogue robots, voice assistants, interactive tools, etc. However, my country is a country with multiple dialects. People living in various regions will have a large degree of accent when expressing in Mandarin, which will cause a mismatch with the standard Mandarin model and result in low speech recognition accuracy. [0003] At present, multi-model research can be carried out based on temporal neural networks to realize the recognition of Mandarin accents. Among them, based on the multi-model recognition method, differe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/00G10L15/06G10L15/26G10L15/16G10L15/14
CPCG10L15/005G10L15/063G10L15/142G10L15/16G10L15/26
Inventor 胡广宇
Owner 北京远鉴信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products