A generative confrontation network training method and audio encoding and decoding method for frequency band expansion

A technology of network training and frequency band expansion, applied in the field of audio coding and decoding, can solve problems such as difficult convergence, and achieve the effect of low space complexity

Active Publication Date: 2021-06-01
PEKING UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Aiming at the shortcomings of generative adversarial networks that are not easy to converge and the particularity of the sound signal frequency band expansion task, the traditional generative adversarial network is improved by introducing real low-frequency information and high-frequency envelopes, and a complete single-channel network is built on this basis codec system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A generative confrontation network training method and audio encoding and decoding method for frequency band expansion
  • A generative confrontation network training method and audio encoding and decoding method for frequency band expansion
  • A generative confrontation network training method and audio encoding and decoding method for frequency band expansion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to facilitate those skilled in the art to understand the technical content of the present invention, the content of the present invention will be further explained below in conjunction with the accompanying drawings.

[0040] The invention includes three parts: improvement and training of the generative confrontation network, an encoder based on the frequency band extension algorithm of the generative confrontation network, and a decoder based on the frequency band extension algorithm of the generative confrontation network.

[0041] Improving and Training Generative Adversarial Networks

[0042]In 2014, Ian J.Goodfellow of the University of Montreal and others proposed the main idea of ​​the generative confrontation network as follows: through competitive learning, use a discriminant network to evaluate the generative network. The generative confrontation network consists of two networks: one is the generative model (Generative model) G, which is used to simul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a generative confrontation network training method and an audio encoding and decoding method oriented to frequency band expansion. The training method of the generative confrontation network of the present invention is as follows: detect the transient signal of the audio signal; then perform MDCT transformation on it according to the detection result, and use the obtained spectrum as real data; divide the spectrum into bands, and calculate the high and low frequency spectrum Energy envelope ratio, and then quantize and inverse quantize the energy envelope ratio of the high and low frequency spectrum; input the low frequency spectrum obtained by band division into the generation network GAN to generate high frequency spectrum; use the dequantized high frequency energy envelope to correct the generated The high-frequency spectrum is obtained to obtain the final generated high-frequency spectrum; the final generated high-frequency spectrum and the low-frequency spectrum obtained by banding are synthesized into a full-band generated spectrum, and the full-band generated spectrum is used as false data; the real data, Fake data is used as the input of the discriminative network D to train the generative adversarial network. The network trained by the invention is easy to converge.

Description

technical field [0001] The invention belongs to the field of audio coding and decoding, and relates to a frequency band expansion method, in particular to a frequency band expansion-oriented generative confrontation network training method, an audio coding method, and a decoding method. Background technique [0002] Audio codec technology, also known as audio compression technology, compresses and encodes audio files to reduce the file bit rate, making the results easy to record, store, and transmit, and has a wide range of uses. When the target bit rate is low, the traditional mono audio codec technology will discard high-frequency information to ensure low-frequency compression effect, but due to the lack of high-frequency information, the sound of the codec result will cause hollowness, dullness and other discomfort a feeling of. In order to improve the codec quality, the decoding result of the single-channel core encoder is usually band-extended. Such methods are colle...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/02G10L19/24G10L21/038
CPCG10L19/02G10L19/24G10L21/038
Inventor 曲天书吴玺宏黄庆博
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products