Method and device for human voice enhancement of audio signal

An audio signal and human voice technology, applied in the multimedia field, can solve the problems of unsatisfactory real-time audio human voice enhancement, low data processing complexity, and high algorithm complexity, so as to improve the effect of human voice dialogue, improve complexity, and meet The effect of real-time transmission

Active Publication Date: 2022-04-26
BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present disclosure provides a method and device for human voice enhancement of an audio signal, to at least solve the problem of human voice enhancement with high algorithm complexity in the prior art, which cannot satisfy real-time audio, so as to effectively improve the audio frequency with low data processing complexity. The effect of vocal dialogue enhancement in files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for human voice enhancement of audio signal
  • Method and device for human voice enhancement of audio signal
  • Method and device for human voice enhancement of audio signal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to enable ordinary persons in the art to better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings.

[0029] It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the following exemplary examples do not represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatuses and methods consi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The disclosure relates to a method and device for enhancing human voice of an audio signal, relates to the field of multimedia technology, and can solve the technical problem of human voice dialogue enhancement in real-time audio file transmission with relatively low data processing complexity. The method includes: performing windowing and framing processing on the original audio signal to obtain a plurality of audio signal segments; obtaining fundamental frequency information and a plurality of characteristic parameters of each audio signal segment according to the plurality of audio signal segments; wherein, each audio signal The multiple characteristic parameters of the segment include the characteristic parameters that each audio signal segment is divided into multiple Bark subbands on the amplitude spectrum; each audio signal segment is sequentially enhanced according to the neural network algorithm, and each audio signal segment is respectively obtained The human voice enhancement signal of the signal segment; the human voice enhancement signal of each audio signal segment is sequentially stacked and added to obtain the target enhancement signal.

Description

technical field [0001] The present disclosure relates to the field of multimedia technology, and in particular to a method and device for enhancing human voice of an audio signal. Background technique [0002] With the development of multimedia technology, live video and video sharing have become a fashionable and common way of entertainment. However, in addition to human voices, there are usually obvious noises in the video, such as wind, ringtones, or traffic horns, etc., especially when users record videos outdoors or do live video broadcasts, the external environment is usually noisy, and these noises make It is difficult for users to hear the human voice dialogue in the video clearly, which seriously affects the user's hearing effect. [0003] The current technical solutions for vocal dialogue enhancement use Recurrent Neural Network (RNN) or Convolutional Neural Network (CNN) for deep learning. Although better vocal enhancement effects can be obtained, the network Th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/0264G10L21/0316G10L21/0324
CPCG10L21/0316G10L21/0324G10L21/0264
Inventor 邓峰姜涛李岩
Owner BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products