Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A 1.2kb/s low-rate speech codec method based on mixed excitation linear prediction melp

A coding and decoding method and linear prediction technology, applied in the field of voice communication, can solve the problems of increased coding complexity, memory capacity, cost of vocoder implementation, etc., and achieve good application prospects and practical value, good clarity and intelligibility degree, to achieve the effect of low cost

Active Publication Date: 2018-12-28
CHONGQING UNIV OF POSTS & TELECOMM
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

NATO's 1.2kbps vocoder uses three 20ms subframes to form a superframe for joint quantization. Due to the large number of subframes, there are many parameters extracted from each subframe, and the encoding output of each superframe is only 72bit. Therefore, the sound The coder adopts the vector quantization coding technology for the pitch period, the line spectrum for the frequency, the residual harmonic amplitude value, the gain and the band-pass voiced sound intensity, which leads to the increase of the coding complexity and the memory capacity required for storing the vector quantization codebook, and the acoustic Encoder implementation costs increase

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A 1.2kb/s low-rate speech codec method based on mixed excitation linear prediction melp
  • A 1.2kb/s low-rate speech codec method based on mixed excitation linear prediction melp
  • A 1.2kb/s low-rate speech codec method based on mixed excitation linear prediction melp

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Below in conjunction with accompanying drawing, the present invention will be further described:

[0031] At the encoding end, the encoding frame structure is designed to determine the number of bits required for the quantization of each speech feature parameter; preprocessing such as denoising is performed on the input speech signal. The length of the subframe is 30ms, and two subframes form a superframe. Jointly quantize the speech feature parameters of the superframe, and use the remaining bits in the frame structure to perform error control coding on some important speech feature parameters, and finally transmit the binary bit stream. At the decoding end, the speech feature parameters are analyzed from the received bit stream, and the speech feature parameters analyzed are used to generate an excitation signal, and the reconstructed speech signal is obtained after passing through a synthesis filter. Gain adjustment and pulse shaping filtering are performed on the re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a 1.2kb / s low-rate speech encoding and decoding method based on mixed excitation linear prediction MELP, comprising the following steps: an encoding end frames a speech signal with 30ms as the sub frame length, groups two adjacent sub frames into a super frame, and carries out multi-frame joint quantization encoding on extracted speech feature parameters LSF, Pitch, VP, Fsmag and G; the encoding end uses the remaining bits in the frame structure to carry out error control encoding on important speech feature parameters, and finally, a binary bit stream is formed and transmitted; and a decoding end parses out the quantitative index values of the speech feature parameters from the received bit stream, works out the initial values of the speech feature parameters through quantitative indexing, then, carries out speech feature parameter integrity reconstruction, generates an excitation signal based on the reconstructed speech feature parameters, and obtains a synthetic speech signal through adaptive spectral enhancement, a synthesis filter, gain control, and distribute pulse filtering. By adopting the method, the speech encoding rate can be effectively reduced. The speech synthesized by a receiving end has high clarity and intelligibility.

Description

technical field [0001] The invention belongs to the field of voice communication, and in particular relates to a MELP-based low-rate voice codec, which is widely used in voice application services such as secure communication, satellite mobile communication, deep-sea communication and voice mailbox. Background technique [0002] Voice is the main carrier for human beings to transmit information to each other. It is the most direct, convenient and effective way of communication in modern communication, and it is also the main means of human-computer interaction in the future. With the development of communication technology, the proportion of non-voice information such as images and data in information transmission is increasing, but the effective transmission of voice information is still one of the necessary functions of many communication systems. [0003] Although the introduction of optical fiber transmission technology in recent years has provided huge transmission capa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/087
Inventor 李强付余涛舒勤军陈丁当陈浩朱兰明艳夏绪玖
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products