Human voice extraction method, human voice extraction device and related products

A human voice and sound technology, which is applied in the field of electronic audio signal processing, can solve problems such as the inability to extract human voice, and achieve a good effect of human voice extraction

Active Publication Date: 2019-08-02
TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
View PDF6 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, it is impossible to extract completely pure human voice from the mixed audio in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human voice extraction method, human voice extraction device and related products
  • Human voice extraction method, human voice extraction device and related products
  • Human voice extraction method, human voice extraction device and related products

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0025] The terms "first", "second", "third" and "fourth" in the specification and claims of the present application and the drawings are used to distinguish different objects, rather than to describe a specific order . Furthermore, the terms "include" and "have", as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed ste...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiments of the application provide a human voice extraction method. The method comprises the steps of performing vocal extraction on mixed audio based on a human voice extraction model to obtain intermediate audio, wherein the intermediate audio comprises a human voice audio frame and a non-human voice audio frame; filtering out the non-human voice audio frame in the intermediate audio based on a human voice filtering model to obtain human voice audio. After the human voice extraction method is adopted, the pure human voice audio can be extracted, so that the user experience is improved.

Description

technical field [0001] This application relates to the field of electronic audio signal processing, in particular to a human voice extraction method, a human voice extraction device and related products. Background technique [0002] Human voice extraction technology is a widely studied audio processing method, and there are many categories of existing human voice extraction algorithms. However, due to the limitations of the algorithm itself or the training samples, there is currently no vocal extraction algorithm that can cleanly extract the human voice. For example, in the prior art, the human voice is extracted from the mixed audio through the Hourglass model. Although the extracted human voice result is relatively clean and has a high degree of recognizability, some instrumental performances such as preludes and interludes are misidentified as human beings. Errors that are reserved for sound. Therefore, it is impossible to extract completely pure human voice from mixed...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0272G10L21/0208G10L17/04G10L15/04G10L25/18G10L25/78
CPCG10L15/04G10L17/04G10L21/0208G10L21/0272G10L25/18G10L25/78Y02T10/40
Inventor 王征韬
Owner TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products