Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

40 results about "Perceptual weighting" patented technology

A perceptual weighting device for producing a perceptually weighted signal in response to a wideband signal comprises a signal pre-emphasis filter, a synthesis filter calculator, and a perceptual weighting filter. The signal pre-emphasis filter enhances the high frequency content of the wideband signal to thereby produce a pre-emphasized signal.

Perceptual weighting device and method for efficient coding of wideband signals

A perceptual weighting device for producing a perceptually weighted signal in response to a wideband signal comprises a signal pre-emphasis filter, a synthesis filter calculator, and a perceptual weighting filter. The signal pre-emphasis filter enhances the high frequency content of the wideband signal to thereby produce a pre-emphasized signal. The signal pre-emphasis filter has a transfer function of the form: P(z)=1−μz−1, wherein μ is a pre-emphasis factor having a value located between 0 and 1. The synthesis filter calculator is responsive to the pre-emphasized signal for producing synthesis filter coefficients. Finally, the perceptual weighting filter processes the pre-emphasized signal in relation to the synthesis filter coefficients to produce the perceptually weighted signal. The perceptual weighting filter has a transfer function, with fixed denominator, of the form: W(z)=A(z / γ1) / (1−γ2z−1) where 0<γ2<γ1≦1.
Owner:SAINT LAWRENCE COMM

Method and apparatus for improved quality voice transcoding

A method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a bitstream representing frames of data according to a second voice compression standard using perceptual weighting that uses tuned weighting factors, such that the bitstream of a second voice compression standard to produce a higher quality decoded voice signal than a comparable tandem transcoding solution. The method includes pre-computing weighting factors for a perceptual weighting filter optimized to a specific source and destination codec pair, pre-configuring the transcoding strategies, mapping CELP parameters in the CELP parameter space according to the selected coding strategy, performing Linear Prediction analysis if specified by the transcoding strategy, perceptually weighting the speech using with tuned weighting factors, and searching for adaptive codebook and fixed-codebook parameters to obtain a quantized set of destination codec parameters.
Owner:ONMOBILE GLOBAL LTD +1

Coding/decoding of digital audio signals

The invention relates to the coding / decoding of a signal into several sub-bands, in which at least a first and a second sub-bands which are adjacent are transform coded (601, 602). In particular, in order to apply a perceptual weighting, in the transformed domain, to at least the second sub-band, the method comprises:—determining at least one frequency masking threshold (606) to be applied on the second sub-band; and normalizing said masking threshold in order to provide a spectral continuity between the above-mentioned first and second sub-bands. An advantageous application of the invention involves a perceptual weighting of the high-frequency band in the TDAC transform coding of a hierarchical encoder according to standard G.729.1.
Owner:FRANCE TELECOM SA

Coding method and decoding method for ultra wide band expansion, coder and decoder as well as system for ultra wide band expansion

ActiveCN101527138AUWB extension implementationImplement extensionsSpeech analysisDecoding methodsMean square
The invention relates to a coding method and a decoding method for ultra wide band expansion, a coder and a decoder as well as a system for the ultra wide band expansion, wherein the coding method and the coder obtain a coding parameter of a reconstructed 7-8 kHz signal through wide band enhancement, perform perception weighting on the base, obtain a frequency band expansion parameter of a reconstructed 8-14 kHz high frequency signal according to a principle of the smallest mean square error; and a decoding end reconstructs a 7-14 kHz high frequency signal through coding transmission so as to achieve the expansion of a 7-14 kHz ultra wide band. The decoding method and the decoder decode a residual error MDCT coding parameter to obtain a 7-8 kHz residual error MDCT recovery coefficient, perform perception weighting on the base, and reconstruct a 8-14 kHz signal according to the principle of the smallest mean square error so as to achieve the reconstruction of the 7-14 kHz high frequency signal and obtain a 0-14 kHz ultra wide band signal.
Owner:江苏三晟信息科技有限公司

Encoding and decoding method for audio processing frame

The invention provides encoding and decoding methods for an audio processing framework, wherein, the steps of the encoding method are that: A. a low-frequency stage signal after being preprocessed is firstly selected through PCX / ACELP mode, and the signal is treated with LPC analysis; B. according to the result of selection, the signal enters into the ACELP or the PCX mode for decoding; for the PCX mode, the inputted low frequency signal is treat with LPC aggregate and perceptual weighted process to acquire an LPC residual; then the LPC residual is treated with an extraction of model parameters; finally, a LPC coefficient and the model parameters are all treated with quantization encoding. The decoding steps are that: C. an input code stream acquires an encoding mode, ACELP parameter or PCX parameter via analysis and anti-quantization; D. decoding enters into different decoding branches according to the acquired mode; for the decoding of PCX, a LPC excite signal is synthesized through the model parameters using a synthetic method of a model corresponding to the encoding end, and the signal acquires a final low frequency signal through time domain LPC filter.
Owner:SPREADTRUM COMM (SHANGHAI) CO LTD

Method and apparatus for speech coding using training and quantizing

A perceptually weighted speech coder system samples a speech signal and determines its pitch. The speech signal is characterized as fully voiced, partially voiced or weakly voiced. A Lloyd-Max quantizer is trained with the pitch values of those speech signals characterized as being substantially fully voiced. The quantizer quantizes the trained fully voiced pitch values and the pitch values of the non-fully voiced speech signals. The quantizer can also quantize gain values in a similar manner. Sampling is increased for fully-voice signals to improve coding accuracy. This limits application to non-real time speech storage. Mixed excitation is used to synthesize the speech signal
Owner:GOOGLE TECH HLDG LLC

Device for perceptual weighting in audio encoding/decoding

The invention relates to a hierarchical audio encoder in a frequency band divided into a first sub-band and a second sub-band which are adjacent to each other, said encoder comprising: a core encoder (305) for encoding an original signal in the first sub-band of the frequency band; a calculation stage (306) for calculating a residual signal {e} from the original signal and from the signal supplied by the core encoder; and a device (307) for perceptual weighting of the residual signal {e}. According to the invention, the perceptual weighting device comprises a perceptual weighting filter (307) with gain compensation that can perform the spectral continuity between the signal at the output of the perceptual weighting filter with gain compensation and the signal in the second sub-band. The invention can be applied to the transmission and storage of digital signals, such as the audio-frequency signals of speech, music, etc.
Owner:FRANCE TELECOM SA

Layered celp system and method with varying perceptual filter or short-term postfilter strengths

ActiveUS7606703B2Weaker perceptual filteringWeaker short-term postfilteringSpeech analysisCode-excited linear predictionPerceptual weighting
Layered code-excited linear prediction (CELP) speech encoders have progressively weaker perceptual weighting filters for each of the successive enhancement layers and decoders have progressively weaker short-term postfilters for increased bit rates (increased number of enhancement layers decoded) and a long-term postfilter for all bit rates.
Owner:TEXAS INSTR INC

Voice pitch period estimation method and device

The invention relates to a voice pitch period estimation method and device. The device comprises a signal preprocessing unit, a normalized autocorrelation function computing element and a pitch period postprocessing unit. The method includes the steps of firstly, conducting preprocessing including direct current component removal, perception weighting and under-signal sampling on voice signals; secondly, computing normalized autocorrelation function values of the processed voice signals; thirdly, determining the maximum of the normalized autocorrelation function values in the pitch period searching range, and determining a pitch period candidate value corresponding to the maximum to be a pitch period estimation value of the voice signals. According to the voice pitch period estimation method and device, frequency doubling errors and frequency halving errors in the pitch period estimation are well overcome, the noise resistance performance of the pitch period estimation method is improved, meanwhile, the algorithm complexity of an algorithm is lowered, and the corresponding digital audio / speech coding efficiency is improved. The voice pitch period estimation method and device can be applied to pitch searching of various voice coding and decoding algorithms and have a wide application range.
Owner:广东广晟研究开发院有限公司

Encoding apparatus for processing an input signal and decoding apparatus for processing an encoded signal

ActiveUS20170270941A1Reduces magnitude-rangeReduced magnitude-rangeSpeech analysisTransmissionFrequency spectrumPerceptual weighting
Disclosed is an apparatus for processing an input signal, having a perceptual weighter and a quantizer. The perceptual weighter has a model provider and a model applicator. The model provider provides a perceptual weighted model based on the input signal. The model applicator provides a perceptually weighted spectrum by applying the perceptual weighted model to a spectrum based on the input signal. The quantizer is configured to quantize the perceptually weighted spectrum and for providing a bitstream. The quantizer has a random matrix applicator and a sign function calculator. The random matrix applicator is configured for applying a random matrix to the perceptually weighted spectrum in order to provide a transformed spectrum. The sign function calculator is configured for calculating a sign function of components of the transformed spectrum in order to provide the bitstream. The invention further refers to an apparatus for processing an encoded signal and to corresponding methods.
Owner:FRAUNHOFER GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG EV

Perception weighting filtering wave method and perception weighting filter thererof

The invention discloses a method for perceptual weighted filtering, which comprises: A. to make a spectrum slant filtering processing to the inputted speech or audio signal. B. to make a traditional perceptual weighted filtering on the output speech or audio signal processed by the spectrum slant filtering. C. to output the speech or audio signal processed by the traditional perceptual weighted filtering and takes the partial coefficient of the coefficient in transfer function used in the perceptual weighted filtering as the coefficient in transfer function used in the said perceptual weighted filtering. Besides, the invention also discloses a perceptual weighted filter designed according to the said method for perceptual weighted filtering. The invention is to achieve the purpose of simulating the formant structure of the inputted speech or audio signal spectrum of the broadband to improve the coding efficiency and enhance the subjective auditory effect by taking full advantages of the human auditory masking effect.
Owner:HUAWEI TECH CO LTD +1

Pre-emphasis filter, perception weighted filtering method and system

InactiveCN101770778AImprove the subjective auditory effectTightly controlled complexitySpeech analysisPerceptual weightingPattern perception
The embodiment of the invention discloses a pre-emphasis filter, a perception weighted filtering method and a system; the pre-emphasis filter comprises a parameter determining module, a configuration module and a filtering module; the perception weighted filtering method comprises the following steps: the parameters of the pre-emphasis filter are determined according to low-frequency energy and high-frequency energy of an input signal; the pre-emphasis filter is arranged according to the parameters, the input signal is processed to obtain a first signal by the pre-emphasis filter; linear prediction LP analysis is carried out to the first signal, the coefficient of the perception weighted filter is determined; the perception weighted filter is arranged according to parameters and coefficients, and filtering is carried out to the first signal through the perception weighted filter; the perception weighted system mainly comprises the pre-emphasis filter, an LP filter, the configuration module and the perception weighted filter; by adopting the embodiment of the invention, the subjective hearing effect of voice coding is improved, and the invention is widely suitable for hybrid coding mode and can strictly control the complexity of the algorithm.
Owner:HUAWEI TECH CO LTD +1

Transcoder for speech codecs of different CELP type and method therefor

InactiveUS20050010403A1Minimizes spectral distortionQuality improvementSpeech analysisPerceptual weightingSpeech sound
A transcoder for use between speech codecs using different Code-Excited Linear Prediction (CELP) type and a method therefor are disclosed. The transcoder includes a decoding unit of an input CELP codec, a transcoding filter, a transcoding filter design unit, and an encoding unit of an output CELP codec. By substituting a post-filter and a perceptual weighting filter of a prior art with one transcoding filter, the calculation amount of the transcoder is reduced, and speech quality decoded at a receiving end is improved.
Owner:ELECTRONICS & TELECOMM RES INST

Method for high quality audio transcoding

A method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a bitstream representing frames of data according to a second voice compression standard using perceptual weighting that uses tuned weighting factors, such that the bitstream of a second voice compression standard to produce a higher quality decoded voice signal than a comparable tandem transcoding solution. The method includes pre-computing weighting factors for a perceptual weighting filter optimized to a specific source and destination codec pair, pre-configuring the transcoding strategies, mapping CELP parameters in the CELP parameter space according to the selected coding strategy, performing Linear Prediction analysis if specified by the transcoding strategy, perceptually weighting the speech using with tuned weighting factors, and searching for adaptive codebook and fixed-codebook parameters to obtain a quantized set of destination codec parameters.
Owner:ONMOBILE GLOBAL LTD

Perception weighting filtering wave method and perception weighting filter thererof

The present invention discloses a perceptual weighing filtering method. The method mainly comprises treating the input voice or audio signals through spectrum inclined filtering, processing the output voice or audio signals treated through perceptual weighing filtering, selecting the corresponding weighing factor value according to the spectral flatness of the input signals, and outputting the voice or audio signals processed through the traditional perceptual weighing filtering. Moreover, part of the coefficients of the transmission function acquired in the processing of perceptual weighing filtering are directly used as the coefficients of the transmission function acquired in the processing of spectrum inclined filtering. Simultaneously, the present invention also provides a perceptual weighing filter. The perceptual weighing filter can well simulate the formant structure of the frequency spectrum of the input wideband voice or audio signals without additional encoding bit rate and reducing the computation complexity, make full use of he masking effects of human acoustic feelings, and achieve the purposes of improving the encoding efficiency and enhancing the subjective hearing effects.
Owner:HUAWEI TECH CO LTD +1

Audio output method and device

The invention discloses an audio output method and device. The audio output method includes: determining a masking effect curve of frequency domain noise signals, performing perceptual weighting on energy information of source audio signals to acquire energy information after weighting, performing time domain to frequency domain conversion on the source audio signals to acquire frequency domain source audio signals, determining that the energy information after weighting is energy information corresponding to the frequency domain source audio signals, enhancing or weakening the frequency domain source audio signals according to the masking effect curve and the energy information corresponding to the frequency domain source audio signals, performing frequency domain to time domain conversion on the frequency domain source audio signals after being enhanced or weakened to acquire source audio signals after processing, reversely weighting the source audio signals after processing, and outputting audio signals. By combining and analyzing masking effect and an audio weighting mode, interference resistance of audio output equipment is improved without excessively changing output audio energy.
Owner:CHINA MOBILE COMM GRP CO LTD

Optimized windows and methods therefore for gradient-descent based window optimization for linear prediction analysis in the ITU-T G.723.1 speech coding standard

Primary and alternate optimization procedures are used to improve the ITU-T G.723.1 speech coding standard (the “Standard”) by replacing the Hamming window of the Standard with an optimized window, with two windows, or with two windows and an additional performance of an autocorrelation method. When two windows replace the Hamming window, at least one of which is an optimized window, generally the first is used to determine optimized unquantized LP coefficients which are used to define an optimized perceptual weighting filter, and the second is used to determine optimized unquantized LP coefficients which are used to determine optimized synthesis coefficients. Optimized windows created using the primary and alternate optimization procedures and used in the Standard yield improvements in the objective and subjective quality of synthesized speech produced by the Standard. The improved Standard, methods, and window can all be implemented as computer readable software code.
Owner:NTT DOCOMO INC

Optimized windows and methods therefore for gradient-descent based window optimization for linear prediction analysis in the ITU-T G.723.1 speech coding standard

Primary and alternate optimization procedures are used to improve the ITU-T G.723.1 speech coding standard (the “Standard”) by replacing the Hamming window of the Standard with an optimized window, with two windows, or with two windows and an additional performance of an autocorrelation method. When two windows replace the Hamming window, at least one of which is an optimized window, generally the first is used to determine optimized unquantized LP coefficients which are used to define an optimized perceptual weighting filter, and the second is used to determine optimized unquantized LP coefficients which are used to determine optimized synthesis coefficients. Optimized windows created using the primary and alternate optimization procedures and used in the Standard yield improvements in the objective and subjective quality of synthesized speech produced by the Standard. The improved Standard, methods, and widow can all be implemented as computer readable software code.
Owner:NTT DOCOMO INC

Transcoder for speech codecs of different CELP type and method therefor

A transcoder for use between speech codecs using different Code-Excited Linear Prediction (CELP) type and a method therefor are disclosed. The transcoder includes a decoding unit of an input CELP codec, a transcoding filter, a transcoding filter design unit, and an encoding unit of an output CELP codec. By substituting a post-filter and a perceptual weighting filter of a prior art with one transcoding filter, the calculation amount of the transcoder is reduced, and speech quality decoded at a receiving end is improved.
Owner:ELECTRONICS & TELECOMM RES INST

Speech coding apparatus with perceptual weighting and method therefor

A speech coding apparatus including a perceptual linear prediction (plp) analysis buffer configured to output a pitch period with respect to an original input speech signal and to analyze the input speech signal using a plp process to output a plp coefficient, an excitation signal generator configured to generate and output an excitation signal, a pitch synthesis filter configured to synthesize the pitch period output from the plp analysis buffer and the excitation signal output from the excitation signal generator, a spectral envelop filter configured to apply the plp coefficient output from the plp analysis buffer to an output of the pitch synthesis filter to output a synthesized speech signal, an adder configured to subtract the synthesized signal output from the spectral envelope filter from the original input speech signal output from the plp analysis buffer and to output a difference signal, a perceptual weighting filter configured to calculate an error by providing a weight value corresponding to a consideration of a person's auditory effect to the difference signal output from the adder, and a minimum error calculator configured to discover an excitation signal having a minimum error corresponding to the error output from the perceptual weighting filter.
Owner:LG ELECTRONICS INC

Method for coding variable speed audio frequency switching between adjacent high/low speed coding modes

InactiveCN102254562AAverage encoding rate reductionImprove coding efficiencySpeech analysisLow speedPerceptual weighting
The invention relates to a method for coding a variable speed audio frequency switching between adjacent high / low speed coding modes, which belongs to the field of audio coding and is particularly suitable for a multi-speed audio encoder. The method is technically characterized by comprising the following steps of: coding and decoding each frame of audio signals at a high speed, making coding input signals and decoding output signals in various coding modes at the speed pass through a sensing and weighting filter, calculating an average segmentation signal to noise ratio (SNR), and selecting a coding mode with the maximum sensing and weighting average segmentation SNR; selecting a coding mode with the maximum sensing and weighting average segmentation SNR of the coding input signals and the decoding output signals at a low coding speed close to the high speed; and finally, calculating average segmentation SNR of the coding input signals and the decoding output signals relative to the coding modes selected at the high speed and the low speed respectively, if the average segmentation SNR in the low-speed coding mode is greater than the average segmentation SNR in the high-speed coding mode, switching to the low-speed coding mode, otherwise, switching to the high-speed coding mode. By the method, each frame of audio signals are switched among various coding modes at adjacent high / low coding speeds according to distortion of the output signals relative to the input signals, so that high coding quality is kept, and the average coding speed of the audio signals is reduced simultaneously; therefore, the coding efficiency of the multi-speed audio encoder is improved.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Speech pitch estimation method and device

The invention relates to a voice pitch period estimation method and device. The device comprises a signal preprocessing unit, a normalized autocorrelation function computing element and a pitch period postprocessing unit. The method includes the steps of firstly, conducting preprocessing including direct current component removal, perception weighting and under-signal sampling on voice signals; secondly, computing normalized autocorrelation function values of the processed voice signals; thirdly, determining the maximum of the normalized autocorrelation function values in the pitch period searching range, and determining a pitch period candidate value corresponding to the maximum to be a pitch period estimation value of the voice signals. According to the voice pitch period estimation method and device, frequency doubling errors and frequency halving errors in the pitch period estimation are well overcome, the noise resistance performance of the pitch period estimation method is improved, meanwhile, the algorithm complexity of an algorithm is lowered, and the corresponding digital audio / speech coding efficiency is improved. The voice pitch period estimation method and device can be applied to pitch searching of various voice coding and decoding algorithms and have a wide application range.
Owner:广东广晟研究开发院有限公司

Method and device for enhancing voice signal

InactiveCN102054482AWeakening rangeNoise reduction intensity is smallSpeech analysisTime domainFrequency spectrum
The embodiment of the invention discloses a method and a device for enhancing a voice signal. The method comprises the steps of: obtaining a noising voice signal, and carrying out perception weighted filtering on the noising voice signal; converting the noising voice signal subjected to the perception weighted filtering into a frequency domain, carrying out spectrum subtraction and phase synthesis on the noising voice signal in the frequency domain, and converting the voice signal subjected to the spectrum subtraction and phase synthesis into a time domain; and carrying out inverse perception weighted filtering on the voice signal subjected to the spectrum subtraction and the phase synthesis to obtain an enhanced voice signal. Through using the invention, the noising voice signal is subjected to the perception weighted filtering, the interference of the noising voice signal to the noise is effectively eliminated, the enhanced voice signal is obtained and the human vision is met.
Owner:CHINA MOBILE COMM GRP CO LTD

UWB extension coding, decoding method, codec and UWB extension system

A super-wideband extending coding and decoding method, coder and super-wideband extending system are provided. The coding method includes separating the super-wideband speech signal into high frequency sub-band signal and low frequency sub-band signal (101); speech decoding the low frequency signal parameter obtained by speech coding the low frequency sub-band signal to obtain the low band recover signal; time-frequency domain transforming and band extending the low band recover signal respectively to obtain the low frequency domain recover coefficient, the residual frequency domain coefficient and the residual frequency domain coding coefficient; perception weighting the low frequency domain recover coefficient and the residual frequency domain coefficient  based on the low frequency sub-band signal to obtain the model frequency domain coefficient, and spectral folding and time-frequency transforming the high frequency sub-band signal to obtain the high frequency domain coefficient, and matching the model frequency domain coefficient and the high frequency domain coefficient according to the Minimum Mean Square Error rule to obtain the frequency band extended parameter; transmitting the low frequency signal parameter, the residual frequency domain coding coefficient and coded extended parameter(109).
Owner:江苏三晟信息科技有限公司

A h.264 code rate control method

The invention discloses a H.264 code rate control method, which includes the following steps: Step 1: Calculate the motion complexity SAD and visual perception weight of the image macroblock MB by performing pre-motion estimation; Step 2: Synthesize the image target code rate, the target frame rate and the filling degree of the code stream buffer to calculate the frame-level target bit number of the image; step 3: perform frame-level code rate control on the image; step 4: perform macroblocking on the image Level code rate control, its code rate control model according to the macroblock MB motion complexity and frame level target number of bits obtained in the above-mentioned process, distributes suitable quantization parameter QP for each macroblock MB; Step 5: carry out described image Formal Motion Estimation. Compared with the existing method, the present invention significantly reduces the computational complexity and improves the code rate control accuracy for re-encoding the decoded and restored video. The processing process of the present invention has the characteristics of low delay, and is suitable for video communication with relatively high real-time requirements. , the effect is more obvious.
Owner:成都随锐云科技有限公司

Multi-speed audio encoding method

The embodiment of the invention discloses a multi-rate voice frequency encoding method. The method comprises the following steps: computing frequency spectrum of a perceptual weighting filter and a first ratio of the frequency spectrum of two previous layers of synthetic voice which are firstly encoded and then decoded by an input signal for each adjacent lattice point; and encoding index values of the corresponding lattice point of the first ratio into a code stream according to a descending order of the first ratio. The method can help improve the quality of output voice or sound signals when lacking bits during transform domain encoding.
Owner:HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products