Human voice separation method and device, user terminal and storage medium

A separation device and user terminal technology, applied in speech analysis, electro-acoustic musical instruments, instruments, etc., can solve the problems of audio quality degradation and audio auditory effect, achieve good auditory effect, save time and labor costs, and accuracy high effect

Inactive Publication Date: 2019-08-23
成都嗨翻屋科技有限公司
View PDF8 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Currently, extracting karaoke music is done during the recording process, which requires a lot of manual work and time
[0003] Most of the existing deep learning technologies for human voice separation im

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human voice separation method and device, user terminal and storage medium
  • Human voice separation method and device, user terminal and storage medium
  • Human voice separation method and device, user terminal and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0071] Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art wi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a human voice separation method and device, a user terminal and a storage medium, and relates to the technical field of audio processing. The human voice separation method comprises the steps: a sampled to-be-separated audio file sound channel is separated to obtain an initial waveform sequence; the initial waveform sequence is subjected to discrete Fourier transform to obtain an initial two-dimensional array; the initial two-dimensional array is subjected to module obtaining to obtain an initial speech spectrum; the initial two-dimensional array is subjected to phase obtaining to obtain an initial phase image; the initial speech spectrum is guided into a convolutional neural network model to obtain a mask through calculation; the mask and the initial phase image are subjected to first point multiplication operation to obtain a human voice source speech spectrum; the human voice source speech spectrum and the initial phase image are subjected to second point multiplication operation; the result of the second point multiplication operation is subjected to inverse discrete Fourier transform to obtain single human voice source audio waveforms; and the single human voice source audio waveforms are spliced to obtain stereo audio. According to the human voice separation method and device, the user terminal and the storage medium, automatic human voice separation of the audio can be achieved.

Description

technical field [0001] The present invention relates to the technical field of audio processing, in particular to a human voice separation method, device, user terminal and storage medium. Background technique [0002] Usually for popular music, the human voice is the main theme, and the accompaniment is the rhythm of the music. Since the human voice is usually accompanied by background music, vocal separation is a challenging task. A prerequisite for musical instrument classification, and these techniques can be used in applications such as recommendation systems and label classification. One of the commercial applications of the vocal separation system is karaoke, meaning a musical track without the vocals. Karaoke music helps music lovers learn to sing an existing piece or sing it in a concert. Currently, extracting karaoke music is done during the recording process, which requires a lot of manual operations and time. [0003] Most of the existing deep learning technol...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/028G10L25/30G10H1/36
CPCG10H1/361G10L21/028G10L25/30
Inventor 尹学渊江天宇陈洪宇梁超
Owner 成都嗨翻屋科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products