Voice separation model training method and device, storage medium and computer equipment

A technology of speech separation and model training, applied in the computer field, can solve the problem of high cost

Active Publication Date: 2020-06-05
TENCENT TECH (SHENZHEN) CO LTD
View PDF2 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Based on this, it is necessary to provide a voice separation model training method, device, storage medium and computer equipment for the technical problem of high cost of existing model training methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice separation model training method and device, storage medium and computer equipment
  • Voice separation model training method and device, storage medium and computer equipment
  • Voice separation model training method and device, storage medium and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0031] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the nature of intelligence and produce a new kind of intelligent machine that can respond in a similar way to human intelligence. Artificial intelligence is t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice separation model training method and device, a computer readable storage medium and a computer device. The method comprises the steps of obtaining a first audio and asecond audio; wherein the first audio comprises a target audio and correspondingly has a marked audio; wherein the second audio comprises noise audio; obtaining a coding model, an extraction model andan initial estimation model; performing unsupervised training on the coding model, the extraction model and the estimation model according to the second audio, and adjusting model parameters of the extraction model and the estimation model; performing supervised training on the coding model and the extraction model according to the first audio and the annotated audio corresponding to the first audio, and adjusting model parameters of the coding model; and continuously carrying out unsupervised training and supervised training, so that the unsupervised training and the supervised training arecarried out in an overlapping manner, and ending the training until a training stopping condition is met. According to the scheme provided by the invention, the model training cost can be reduced.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a speech separation model training method, device, storage medium and computer equipment. Background technique [0002] Speech, as the acoustic representation of language, is one of the most natural and effective ways for humans to communicate information. In the process of voice communication, people will inevitably be disturbed by environmental noise or other speakers. These interferences make the collected audio not pure speaker's voice. In recent years, many speech separation models have been trained to separate target speaker speech from mixed audio. However, current speech separation models are usually trained in a supervised learning manner, which requires manual collection or labeling of high-quality training samples, and such a training process is expensive. Contents of the invention [0003] Based on this, it is necessary to provide a speech separation ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0308
CPCG10L21/0308Y02T10/40G10L15/05G10L15/063G10L15/16
Inventor 王珺林永业苏丹俞栋
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products