Training method for hybrid frequency acoustic recognition model and speech recognition method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for identifying models and mixing frequencies, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of model robustness and generalization limitations, insufficient training data, and cumbersome recognition model update and maintenance. Robustness and generalizability, effect of suppressing influence

Active Publication Date: 2018-09-07

YUTOU TECH HANGZHOU

View PDF11 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] Although the use of a dedicated acoustic model can make the test environment and the training environment more compatible, it will also bring many disadvantages: First, the update and maintenance of the recognition model will be very cumbersome, and it is necessary to conduct special training for each dedicated acoustic model. Update and maintenance; Second, training each dedicated acoustic model separately will make the training data of each model insufficient, and the robustness and generalization of the model will also be limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0049] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments, but not as a limitation of the present invention.

[0050] Based on the above-mentioned problems existing in the prior art, a method for training a mixed-frequency acoustic recognition model is now provided. In this method, a unified mixed-frequency acoustic recognition model is formed through training, so as to separately perform the first The speech signal is acoustically recognized, and the second speech signal with a second sampling frequency is acoustically recognized. In other words, in this training method, a unified acoustic recognition model is trained for recognition on a variety of speech data with different sampling frequencies, rather than a dedicated acoustic recognition model trained for each speech data as in the traditional method.

[0051] The above training methods are as follows: figure 1 shown, including:

[0052] Step S1, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a training method for a hybrid frequency acoustic recognition model and a speech recognition method, and belongs to the technical field of speech recognition. The training method comprises the steps of obtaining a first type of speech features of a first speech signal, and processing the first type of speech features so as to obtain corresponding first speech training data;obtaining a first type of speech features of a second speech signal, and processing the first type of speech features so as to obtain corresponding second speech training data; obtaining a second type of speech features of the first speech signal and a second type of speech features of the second speech signal according to a power spectrum; forming a preliminary recognition model of the hybrid frequency acoustic recognition model according to pre-training of the first speech signal and the second speech signal; and performing supervised parameter training on the preliminary recognition modelaccording to the first speech training data, the second speech training data and the second type of speech features so as to form the hybrid frequency acoustic recognition model. The technical schemehas the beneficial effect that the recognition model has good robustness and generalization.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a training method for a mixed-frequency acoustic recognition model and a speech recognition method. Background technique [0002] In the prior art, due to differences in usage environments, data transmission requirements, and technical means used for transmission, there are large differences between different recording devices and storage methods, and the main difference lies in the difference in sampling frequency. For example, voice data with a sampling frequency of 8kHz usually comes from telephone recordings, so in traditional voice recognition methods, the telephone data is specially used for training to form an acoustic recognition model of 8kHz data. Correspondingly, the speech data with a sampling frequency of 16kHz usually comes from desktop recordings, and the desktop data is also used to train an acoustic recognition model of 16kHz data. Therefore, the trad...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/02

CPCG10L15/02G10L15/063G10L25/30G10L15/142G10L15/16G10L25/21G10L25/24

Inventor 范利春

Owner YUTOU TECH HANGZHOU

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Training method for hybrid frequency acoustic recognition model and speech recognition method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology