Multi-dialect accent mandarin voice recognition model training method and device, and equipment

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition model and Mandarin technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low recognition accuracy

Active Publication Date: 2021-01-15

北京远鉴信息技术有限公司

View PDF5 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The object of the present invention is to, aiming at the deficiencies in the above-mentioned prior art, provide a kind of multi-dialect accent Mandarin speech recognition model training method, device and equipment, so that make full use of the speech data without label to strengthen the training of the model, avoid in practice Due to the limitation of the lack of labeled training sample data in the application, the final recognition accuracy is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0073] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments.

[0074] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. It should be understood that the appended The figures are only for the purpose of illustration and description, and are not used to limit the protection scope of the present application. Additionally, it should be understood that t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a multi-dialect accent mandarin voice recognition model training method and device, and equipment, and relates to the technical field of language recognition. The method comprises the steps of obtaining a training sample; training by using standard mandarin voice data with labels to obtain an initial acoustic model, and training by using text data to obtain an initial language model; iteratively training an initial acoustic model based on the unlabeled dialect mandarin voice data to obtain a target acoustic model; training to obtain a temporary language model by using ato-be-trained text recognized by the target acoustic model and the initial language model, and combining the temporary language model with the initial language model to obtain a target language model;and combining the target acoustic model and the target language model into a multi-dialect accent mandarin speech recognition model. A large amount of dialect accent mandarin voice data which are notlabeled are used for iterative training, a multi-dialect accent mandarin voice recognition model is obtained, and the dialect accent mandarin voice recognition accuracy is improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a multi-dialect Mandarin speech recognition model training method, device and equipment. Background technique [0002] With the improvement of the performance of the Internet and other mobile terminals, intelligent products based on speech recognition technology are more and more popular in industrial production and daily life, such as voice dialogue robots, voice assistants, interactive tools, etc. However, my country is a country with multiple dialects. People living in various regions will have a large degree of accent when expressing in Mandarin, which will cause a mismatch with the standard Mandarin model and result in low speech recognition accuracy. [0003] At present, multi-model research can be carried out based on temporal neural networks to realize the recognition of Mandarin accents. Among them, based on the multi-model recognition method, differe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/00G10L15/06G10L15/26G10L15/16G10L15/14

CPCG10L15/005G10L15/063G10L15/142G10L15/16G10L15/26

Inventor胡广宇

Owner北京远鉴信息技术有限公司

Multi-dialect accent mandarin voice recognition model training method and device, and equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology