Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Training method and device of voice processing model, voice recognition method, system and device

A speech processing and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low robustness and ignore the uncertainty of the acoustic environment, so as to improve robustness, reduce distribution differences, and improve The effect of processing power

Active Publication Date: 2019-12-20
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The speech enhancement processing method in the prior art is usually to design a filter to perform speech enhancement, but the filter in the prior art usually assumes that the acoustic environment is stable, ignoring the uncertainty of the acoustic environment in the real scene, Therefore, it cannot be applied to scenes with poor acoustic environment stability, and the robustness is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training method and device of voice processing model, voice recognition method, system and device
  • Training method and device of voice processing model, voice recognition method, system and device
  • Training method and device of voice processing model, voice recognition method, system and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the object, technical solution and beneficial effects of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0049] In order to facilitate the understanding of the embodiments of the present invention, several concepts are briefly introduced below:

[0050] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the nature of intelligence and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application provides a training method and device of a voice processing model, a voice recognition method, system and device, and relates to the technical field of information related to artificial intelligence and machine learning. The training method of the voice processing model comprises the following steps of performing iterative joint training on a voice enhancement model, a voice recognition model and a voice discrimination model; for each training, obtaining a joint loss function of the voice enhancement model, the voice recognition model and the voice discrimination model, and a voice discrimination loss function of the voice discrimination model; and adjusting model parameters of the voice enhancement model and / or the voice recognition model according to the joint function after each training, and adjusting the model parameter of the voice discrimination model according to the voice discrimination loss function, and the trained voice processing model is acquired until thejoint loss function and the voice discrimination loss function simultaneously satisfy a convergence condition. The robustness of the voice processing model is improved.

Description

technical field [0001] The embodiments of the present invention relate to the field of information technology, and in particular, to a training method for a speech processing model, a speech recognition method, system and device. Background technique [0002] With the development of communication technology and the popularization of smart terminals, various network communication tools have become one of the main tools for public communication. Among them, due to the convenience of operation and transmission of voice information, it has become the main transmission information of various network communication tools. When using various network communication tools, it also involves the process of converting voice information into text, which is the voice recognition technology. [0003] Speech recognition technology is a technology that enables machines to convert voice information into corresponding text or commands through the process of recognition and understanding. Usual...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/14G10L15/16G10L21/0208
CPCG10L15/144G10L15/16G10L21/0208
Inventor 梁山刘斌李冠君刘文举于蒙陈联武
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products