Generative adversarial network training method oriented to band spreading, and audio encoding and decoding methods

A technology of network training and frequency band expansion, which is applied in the fields of decoding and audio coding, can solve problems such as difficult convergence, and achieve the effect of low space complexity

Active Publication Date: 2018-04-20
PEKING UNIV
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Aiming at the shortcomings of generative adversarial networks that are not easy to converge and the particularity of the sound signal frequency band expansion task, the traditional generative adversarial network is improved by introducing real low-frequency information and high-frequency envelopes, and a complete single-channel network is built on this basis codec system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generative adversarial network training method oriented to band spreading, and audio encoding and decoding methods
  • Generative adversarial network training method oriented to band spreading, and audio encoding and decoding methods
  • Generative adversarial network training method oriented to band spreading, and audio encoding and decoding methods

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to facilitate those skilled in the art to understand the technical content of the present invention, the content of the present invention will be further explained below in conjunction with the accompanying drawings.

[0040] The invention includes three parts: the improvement and training of the generative confrontation network, an encoder based on the frequency band extension algorithm of the generative confrontation network, and a decoder based on the frequency band extension algorithm of the generative confrontation network.

[0041] Improving and Training Generative Adversarial Networks

[0042]In 2014, Ian J.Goodfellow of the University of Montreal and others proposed the main idea of ​​the generative confrontation network as follows: through competitive learning, use a discriminant network to evaluate the generative network. The generative confrontation network consists of two networks: one is the generative model (Generative model) G, which is used to s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a generative adversarial network training method oriented to band spreading, an audio encoding method and an audio decoding method. The generative adversarial network trainingmethod comprises the steps of: performing transient signal detection on an audio signal; then performing MDCT transformation on the audio signal separately according to detection results, and regarding an obtained frequency spectrum as real data; performing band division on the spectra, calculating a high and low frequency spectrum energy envelope ratio, then performing quantization and inverse quantization on the high and low frequency spectrum energy envelope ratio; inputting a low frequency spectrum obtained through band division into a generative adversarial network GAN to obtain a high frequency spectrum; correcting the generated high frequency spectrum by utilizing the high and low frequency spectrum energy envelope ratio after inverse quantization, so as to obtain a finally generated high frequency spectrum; synthesizing the finally generated high frequency spectrum and the low frequency spectrum obtained through band division into a generated spectrum of the whole band, and regarding the generated spectrum of the whole band as false data; and taking the obtained real data and false data as input of a discrimination network D, and training the generative adversarial network.The network trained by adopting the generative adversarial network training method is easy to converge.

Description

technical field [0001] The invention belongs to the field of audio coding and decoding, and relates to a frequency band expansion method, in particular to a frequency band expansion-oriented generative confrontation network training method, an audio coding method, and a decoding method. Background technique [0002] Audio codec technology, also known as audio compression technology, compresses and encodes audio files to reduce the file bit rate, making the results easy to record, store, and transmit, and has a wide range of uses. When the target bit rate is low, the traditional mono audio codec technology will discard high-frequency information to ensure low-frequency compression effect, but due to the lack of high-frequency information, the sound of the codec result will cause hollowness, dullness and other discomfort a feeling of. In order to improve the codec quality, the decoding result of the single-channel core encoder is usually band-extended. Such methods are colle...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/02G10L19/24G10L21/038
CPCG10L19/02G10L19/24G10L21/038
Inventor 曲天书吴玺宏黄庆博
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products