Unlock instant, AI-driven research and patent intelligence for your innovation.

A kind of audio encoding method and audio encoder

An audio coding and audio signal technology, applied in the field of audio coding and decoding, can solve the problems of high hardware equipment requirements, high power consumption, and difficulty in implementation, and achieve the effects of reducing complexity, reducing power consumption, and being easy to implement.

Inactive Publication Date: 2011-12-28
HUAWEI TECH CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the prior art, when the encoder encodes the speech signal according to the obtained masking threshold, the psychoacoustic model established for obtaining the masking threshold requires very complicated calculations, is not easy to implement, and requires high hardware equipment. high power consumption

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A kind of audio encoding method and audio encoder
  • A kind of audio encoding method and audio encoder
  • A kind of audio encoding method and audio encoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0066] This embodiment provides an audio coding method, which utilizes Warped Linear Prediction (WLP, Warped Linear Prediction) and traditional Linear Prediction (LP, Linear Prediction) to analyze the frequency resolution characteristic very close to the critical frequency band in the characteristic of human hearing and the characteristics of the masking feature, and finally obtain the masking threshold. see figure 2 As shown, the method includes:

[0067] Step 1: The encoder receives the time-domain audio signal;

[0068] The time-domain audio signal received by the encoder can be a speech signal, an audio signal, or a mixture of various sound signals that can be heard by the human ear. to 24000Hz). The audio signal received by the encoder is usually in frame format, and the length of a frame is generally between 5 milliseconds and 30 milliseconds.

[0069] Step 2: the encoder samples the received audio signal to obtain the sampled audio signal x(n);

[0070] In this em...

Embodiment 2

[0144] An embodiment of the present invention provides an audio coding method, see Figure 9 shown, and with reference to the audio encoder shown in FIG. 1 . The method for obtaining the global masking threshold of the psychoacoustic model in the audio coding method utilizes the method for establishing the psychoacoustic model provided in the first embodiment. An embodiment of the present invention provides an audio coding method comprising:

[0145] Step H1: the encoder receives the time-domain audio signal;

[0146] The time-domain audio signal received by the encoder is the same step performed in step 1 in the first embodiment.

[0147] Step H2: the encoder establishes a psychoacoustic model according to the received time-domain audio signal, and obtains a global masking threshold;

[0148] Wherein, for the execution method of step H3, reference may be made to the description in the first embodiment.

[0149] Step H3: The encoder encodes the received time-domain audio s...

Embodiment 3

[0153] This embodiment provides an audio encoder, see Figure 10 As shown, it includes: receiving unit 10, sampling unit 20, linear prediction LP unit 30, acquiring LP filter amplitude-frequency response unit 40, curling linear prediction WLP unit 50, acquiring WLP filter amplitude-frequency response unit 60, acquiring local masking curve Unit 70 , obtaining a global masking curve unit 80 , obtaining a masking threshold unit 90 and an audio coding unit 100 .

[0154] The receiving unit 10 receives a time-domain audio signal. The received time-domain audio signal may be a voice signal, an audio signal, or a mixture of various sound signals that can be heard by the human ear. The frequency bandwidth of the audio signal is usually For the human ear can hear the frequency range (ie 0Hz to 24000Hz), the audio signal is usually in the format of frames, and the length of a frame is generally between 5 milliseconds and 30 milliseconds.

[0155] The adopting unit 20 adopts the receive...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio coding method and an audio coder method. The embodiment of the invention also provides a corresponding audio coder. As the feature that the frequency resolution characteristics of linear prediction (LP) and winding linear prediction (WLP) are very close to the critical band and masking characteristic in human auditory characteristic is utilized in the technical scheme of the invention, a psychoacoustics module is built, the masking threshold is obtained, and coding is carried out on audio signals according to the obtained masking threshold, thereby decreasing the complexity of building the psychoacoustic model, being realized easily, decreasing the hardware implementation cost of the psychoacoustic model, and lowering power consumption of hardware.

Description

technical field [0001] The present invention relates to the technical field of audio coding and decoding, in particular to an audio coding method and an audio encoder. Background technique [0002] In the audio coding technology, the distorted audio coding technology can usually obtain a higher compression ratio, but in order to obtain good audio quality, it is necessary to control the degree of coding distortion in the audio coding technology. A psychoacoustic model is a mathematical model commonly used to control the degree of coding distortion. The psychoacoustic model is a mathematical model abstracted by people based on the study of the human auditory system to reflect the characteristics of human auditory perception. It reflects the human auditory system's ability to perceive and mask audio and noise. The parameter in the psychoacoustic model specifically used in the audio coding technology is usually the masking threshold, which is the sum of the values ​​masked by a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/12G10L19/14G10L19/13G10L19/18
Inventor 马鸿飞柳巍李倩宋少鹏许丽净
Owner HUAWEI TECH CO LTD