Audio coding method and audio coder

An audio coding and audio signal technology, which is applied in the field of audio coding and decoding, can solve the problems of high hardware equipment requirements, high power consumption, and difficult implementation, and achieve the effects of reducing complexity, reducing power consumption, and being easy to implement

Inactive Publication Date: 2010-06-16
HUAWEI TECH CO LTD +1
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the prior art, when the encoder encodes the speech signal according to the obtained masking threshold, the psychoacoustic model established for obtaining the masking threshold requires very complicated calculations, is not easy to implement, and requires high hardware equipment. high power consumption

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio coding method and audio coder
  • Audio coding method and audio coder
  • Audio coding method and audio coder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] The present embodiment provides a kind of audio coding method, and this method is to utilize the frequency resolution characteristic of warped linear prediction (WLP, Warped Linear Prediction) and traditional linear prediction (LP, Linear Prediction) analysis to be very close to the critical frequency band in the human auditory characteristic and the characteristics of the masking feature, and finally obtain the masking threshold. see figure 2 As shown, the method includes:

[0058] Step 1: The encoder receives the time-domain audio signal;

[0059] The time-domain audio signal received by the encoder can be a speech signal, an audio signal, or a mixture of various sound signals that can be heard by the human ear. The frequency bandwidth of the audio signal is usually within the frequency range that can be heard by the human ear (ie 0Hz to 24000Hz). The audio signal received by the encoder is usually in a frame format, and the length of a frame is generally between ...

Embodiment 2

[0134] The embodiment of the present invention provides an audio coding method, see Figure 9 , and refer to the audio encoder shown in FIG. 1 . The method for obtaining the global masking threshold of the psychoacoustic model in the audio coding method utilizes a method for establishing a psychoacoustic model provided in Embodiment 1. An embodiment of the present invention provides an audio coding method including:

[0135] Step H1: the encoder receives a time-domain audio signal;

[0136] Wherein, the time-domain audio signal received by the encoder is the same step as that performed in step 1 in the first embodiment.

[0137] Step H2: The encoder establishes a psychoacoustic model based on the received time-domain audio signal to obtain a global masking threshold;

[0138] Wherein, for the execution method of step H3, reference may be made to the description in Embodiment 1.

[0139] Step H3: The encoder encodes the received time-domain audio signal according to the glo...

Embodiment 3

[0143] This embodiment provides an audio encoder, see Figure 10 As shown, it includes: receiving unit 10, sampling unit 20, linear prediction LP unit 30, obtaining LP filter amplitude-frequency response unit 40, curl linear prediction WLP unit 50, obtaining WLP filter amplitude-frequency response unit 60, obtaining local masking curve A unit 70 is configured to obtain a global masking curve unit 80 , a masking threshold unit 90 and an audio encoding unit 100 .

[0144] Wherein, the receiving unit 10 receives a time-domain audio signal, and the received time-domain audio signal may be a speech signal, an audio signal, or mixed information of various sound signals that can be heard by various human ears, and the frequency bandwidth of the audio signal is generally Since the human ear can hear the frequency range (ie 0 Hz to 24000 Hz), the audio signal is usually in a frame format, and the length of a frame is generally between 5 milliseconds and 30 milliseconds.

[0145] The a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an audio coding method and an audio coder method. The embodiment of the invention also provides a corresponding audio coder. As the feature that the frequency resolution characteristics of linear prediction (LP) and winding linear prediction (WLP) are very close to the critical band and masking characteristic in human auditory characteristic is utilized in the technical scheme of the invention, a psychoacoustics module is built, the masking threshold is obtained, and coding is carried out on audio signals according to the obtained masking threshold, thereby decreasing the complexity of building the psychoacoustic model, being realized easily, decreasing the hardware implementation cost of the psychoacoustic model, and lowering power consumption of hardware.

Description

technical field [0001] The invention relates to the technical field of audio coding and decoding, in particular to an audio coding method and an audio coder. Background technique [0002] In the audio coding technology, the audio coding technology with distortion can usually obtain a higher compression ratio, but in order to obtain good audio quality, it is necessary to control the degree of coding distortion in the audio coding technology. The psychoacoustic model is a mathematical model commonly used to control the degree of encoding distortion. The psychoacoustic model is a mathematical model that reflects the characteristics of human auditory perception abstracted on the basis of the study of the human auditory system. It reflects the human auditory system's ability to perceive and mask audio and noise. The parameter in the psychoacoustic model specifically used in the audio coding technology is usually the masking threshold, which is the sum of the value of the signal ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/12G10L19/14G10L19/13G10L19/18
Inventor 马鸿飞柳巍李倩宋少鹏许丽净
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products