Training method and device of voice processing model, voice recognition method, system and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech processing and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low robustness and ignore the uncertainty of the acoustic environment, so as to improve robustness, reduce distribution differences, and improve The effect of processing power

Active Publication Date: 2019-12-20

TENCENT TECH (SHENZHEN) CO LTD

View PDF6 Cites 29 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The speech enhancement processing method in the prior art is usually to design a filter to perform speech enhancement, but the filter in the prior art usually assumes that the acoustic environment is stable, ignoring the uncertainty of the acoustic environment in the real scene, Therefore, it cannot be applied to scenes with poor acoustic environment stability, and the robustness is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] In order to make the object, technical solution and beneficial effects of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0049] In order to facilitate the understanding of the embodiments of the present invention, several concepts are briefly introduced below:

[0050] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the nature of intelligence and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The application provides a training method and device of a voice processing model, a voice recognition method, system and device, and relates to the technical field of information related to artificial intelligence and machine learning. The training method of the voice processing model comprises the following steps of performing iterative joint training on a voice enhancement model, a voice recognition model and a voice discrimination model; for each training, obtaining a joint loss function of the voice enhancement model, the voice recognition model and the voice discrimination model, and a voice discrimination loss function of the voice discrimination model; and adjusting model parameters of the voice enhancement model and / or the voice recognition model according to the joint function after each training, and adjusting the model parameter of the voice discrimination model according to the voice discrimination loss function, and the trained voice processing model is acquired until thejoint loss function and the voice discrimination loss function simultaneously satisfy a convergence condition. The robustness of the voice processing model is improved.

Description

technical field [0001] The embodiments of the present invention relate to the field of information technology, and in particular, to a training method for a speech processing model, a speech recognition method, system and device. Background technique [0002] With the development of communication technology and the popularization of smart terminals, various network communication tools have become one of the main tools for public communication. Among them, due to the convenience of operation and transmission of voice information, it has become the main transmission information of various network communication tools. When using various network communication tools, it also involves the process of converting voice information into text, which is the voice recognition technology. [0003] Speech recognition technology is a technology that enables machines to convert voice information into corresponding text or commands through the process of recognition and understanding. Usual...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/14G10L15/16G10L21/0208

CPCG10L15/144G10L15/16G10L21/0208

Inventor梁山刘斌李冠君刘文举于蒙陈联武

OwnerTENCENT TECH (SHENZHEN) CO LTD

Training method and device of voice processing model, voice recognition method, system and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements:Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology