Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

MELP-based (Mixed Excitation Linear Prediction-based) multi-frame joint quantization low-rate speech coding and decoding method

An encoding and decoding method and multi-frame joint technology are applied in the field of multi-frame joint quantization low-rate speech encoding and decoding based on MELP, which can solve the problems of large parameter reconstruction distortion, low parameter quantization execution efficiency, and reduced encoding rate.

Inactive Publication Date: 2013-04-17
BEIHANG UNIV
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The papers "A VARIBLE LOW BIT RATESPEECH CODER BASED ON MELP" and "A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS" mentioned that joint quantization of four frames and six frames is used for speech signals respectively, and the document "A MELP-based In 600b / s Very Low Rate Speech Coding, three-frame joint quantization is adopted for the speech signal at the encoding end, but only the first frame and the last frame are passed when the parameters are passed, and the inter-frame linear interpolation of the hard decision nature is adopted for the parameters at the decoding end It is predicted that the paper "Joint Optimization Algorithm of Multi-parameter Codebook Size Based on Superframe Mode" adopts dynamic codebook quantization based on signal voicing for all multi-frame parameters, which reduces the encoding rate to a certain extent, but the standard voice Subjective and objective test results show that the combination of too many single-frame parameters leads to too many dimensions of the transmitted parameters, and the time required for vector quantization in the parameter quantization process is longer, and the effect on the delay effect of the coding scheme is poor; for multi-frame parameters Too simple inter-frame linear prediction under hard decision at the decoding end leads to large distortion of parameter reconstruction, which can easily lead to interference of multiplied signals, and poor intelligibility of reconstructed signals, which is different from parameter coding to reconstruct signals. Intelligibility is inconsistent with the first goal, and all parameters of the multi-frame signal are quantized based on the dynamic codebook size of the voicing situation, which leads to the need to prepare a large number of codebooks at the encoding end when quantizing the parameters. In the specific implementation process The medium occupies a large amount of storage, resulting in low execution efficiency of parameter quantization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • MELP-based (Mixed Excitation Linear Prediction-based) multi-frame joint quantization low-rate speech coding and decoding method
  • MELP-based (Mixed Excitation Linear Prediction-based) multi-frame joint quantization low-rate speech coding and decoding method
  • MELP-based (Mixed Excitation Linear Prediction-based) multi-frame joint quantization low-rate speech coding and decoding method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] Attached below figure 1 , 2 , 3, 4, 5, take the mandarin voice file of 1min as an example to specifically introduce the encoding scheme provided by the present invention, the voice signal sampling rate is 8KHz, the single frame signal time length is set to 25ms, and the sample point length is 200. See Figure 6 , as shown,

[0062] Encoder:

[0063] Step 1: Set the target encoding rate to 0.8kb / s, adopt a three-frame joint quantization scheme, assign 60 bits to the parameters of each three-frame signal for quantization, and the code rate is 60bit / 75ms=0.8kb / s, and adopt partial transmission for the parameters, The transmitted parameters include: line spectrum pair frequency lsf, pitch period pitch, gain G, and bandpass signal vp. The specific bit allocation scheme is shown in Table 1.

[0064]

[0065]

[0066] Table 1

[0067]Step 2: Perform de-power frequency processing for the input voice signal, divide the frame according to the set frame length of 25ms, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an MELP-based multi-frame joint quantization low-rate speech coding and decoding method comprising the steps that an coding end first processes signals by adopting the length of 25ms per frame, parameters, i.e. line spectrum pair frequency (lsf), pitch period (pitch), band pass unvoicing / voicing (vp) and gain (G), are sequentially extracted, joint quantization is then carried out with each three neighboring frames as a unit, wherein a three-dimensional single-code book is adopted to quantize the vp, a code book, the size of which is dynamic according to signal unvoicing / voicing, is adopted to quantize the pitch after the pitch is logarithmizd, the G is first deequalized and then quantized by a single-code book, four-stage residual vector quantization is only carried out on a 20-dimenstional vector composed of the lsf of the first frame and the lsf of the final frame, a decoding end first adopts a decoding book to process the G, the lsf and the pitch, and then adopts interpolation factor-weighted interframe linear prediction by aiming at the lsf to obtain the lsf of the middle frame, the unvoicing / voicing information of five neighboring frames of signals is embedded into the interpolation factor r solution process, and the frequency spectrum continuity and stability of the voice signals are taken into full consideration. Consequently, the method can effectively decrease the coding rate to be lower than 1.2kb / s, and has great reference value for the research and application of the low-rate speech coding technology.

Description

technical field [0001] This method relates to a low-rate speech coding method in a wireless communication system, in particular to a multi-frame joint quantization low-rate speech coding method based on mixed excitation linear predictive coding (MELP), which is suitable for wireless communication systems in communication In an environment with poor conditions and complex background noise, it occupies very few spectrum resources to realize reliable transmission of voice signals, and belongs to the field of wireless communication technology. technical background [0002] With the continuous expansion of current wireless communication services and the continuous increase in the amount of transmitted data, future wireless communication systems require higher data transmission efficiency and transmission accuracy, especially the most basic daily voice communication. However, the current wireless communication spectrum resources are becoming increasingly tight, the electromagnetic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L19/135
Inventor 修春娣苏兆安刘建伟
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products