Method and device for human voice enhancement of audio signal

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
An audio signal and human voice technology, applied in the multimedia field, can solve the problems of unsatisfactory real-time audio human voice enhancement, low data processing complexity, and high algorithm complexity, so as to improve the effect of human voice dialogue, improve complexity, and meet The effect of real-time transmission

Active Publication Date: 2022-04-26

BEIJING DAJIA INTERNET INFORMATION TECH CO LTD

View PDF10 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present disclosure provides a method and device for human voice enhancement of an audio signal, to at least solve the problem of human voice enhancement with high algorithm complexity in the prior art, which cannot satisfy real-time audio, so as to effectively improve the audio frequency with low data processing complexity. The effect of vocal dialogue enhancement in files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0028] In order to enable ordinary persons in the art to better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings.

[0029] It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the following exemplary examples do not represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatuses and methods consi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The disclosure relates to a method and device for enhancing human voice of an audio signal, relates to the field of multimedia technology, and can solve the technical problem of human voice dialogue enhancement in real-time audio file transmission with relatively low data processing complexity. The method includes: performing windowing and framing processing on the original audio signal to obtain a plurality of audio signal segments; obtaining fundamental frequency information and a plurality of characteristic parameters of each audio signal segment according to the plurality of audio signal segments; wherein, each audio signal The multiple characteristic parameters of the segment include the characteristic parameters that each audio signal segment is divided into multiple Bark subbands on the amplitude spectrum; each audio signal segment is sequentially enhanced according to the neural network algorithm, and each audio signal segment is respectively obtained The human voice enhancement signal of the signal segment; the human voice enhancement signal of each audio signal segment is sequentially stacked and added to obtain the target enhancement signal.

Description

technical field [0001] The present disclosure relates to the field of multimedia technology, and in particular to a method and device for enhancing human voice of an audio signal. Background technique [0002] With the development of multimedia technology, live video and video sharing have become a fashionable and common way of entertainment. However, in addition to human voices, there are usually obvious noises in the video, such as wind, ringtones, or traffic horns, etc., especially when users record videos outdoors or do live video broadcasts, the external environment is usually noisy, and these noises make It is difficult for users to hear the human voice dialogue in the video clearly, which seriously affects the user's hearing effect. [0003] The current technical solutions for vocal dialogue enhancement use Recurrent Neural Network (RNN) or Convolutional Neural Network (CNN) for deep learning. Although better vocal enhancement effects can be obtained, the network Th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L21/0264G10L21/0316G10L21/0324

CPCG10L21/0316G10L21/0324G10L21/0264

Inventor邓峰姜涛李岩

OwnerBEIJING DAJIA INTERNET INFORMATION TECH CO LTD

Method and device for human voice enhancement of audio signal

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements:Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology