A 1.2kb/s low-rate speech codec method based on mixed excitation linear prediction melp

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A coding and decoding method and linear prediction technology, applied in the field of voice communication, can solve the problems of increased coding complexity, memory capacity, cost of vocoder implementation, etc., and achieve good application prospects and practical value, good clarity and intelligibility degree, to achieve the effect of low cost

Active Publication Date: 2018-12-28

CHONGQING UNIV OF POSTS & TELECOMM

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

NATO's 1.2kbps vocoder uses three 20ms subframes to form a superframe for joint quantization. Due to the large number of subframes, there are many parameters extracted from each subframe, and the encoding output of each superframe is only 72bit. Therefore, the sound The coder adopts the vector quantization coding technology for the pitch period, the line spectrum for the frequency, the residual harmonic amplitude value, the gain and the band-pass voiced sound intensity, which leads to the increase of the coding complexity and the memory capacity required for storing the vector quantization codebook, and the acoustic Encoder implementation costs increase

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] Below in conjunction with accompanying drawing, the present invention will be further described:

[0031] At the encoding end, the encoding frame structure is designed to determine the number of bits required for the quantization of each speech feature parameter; preprocessing such as denoising is performed on the input speech signal. The length of the subframe is 30ms, and two subframes form a superframe. Jointly quantize the speech feature parameters of the superframe, and use the remaining bits in the frame structure to perform error control coding on some important speech feature parameters, and finally transmit the binary bit stream. At the decoding end, the speech feature parameters are analyzed from the received bit stream, and the speech feature parameters analyzed are used to generate an excitation signal, and the reconstructed speech signal is obtained after passing through a synthesis filter. Gain adjustment and pulse shaping filtering are performed on the re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a 1.2kb / s low-rate speech encoding and decoding method based on mixed excitation linear prediction MELP, comprising the following steps: an encoding end frames a speech signal with 30ms as the sub frame length, groups two adjacent sub frames into a super frame, and carries out multi-frame joint quantization encoding on extracted speech feature parameters LSF, Pitch, VP, Fsmag and G; the encoding end uses the remaining bits in the frame structure to carry out error control encoding on important speech feature parameters, and finally, a binary bit stream is formed and transmitted; and a decoding end parses out the quantitative index values of the speech feature parameters from the received bit stream, works out the initial values of the speech feature parameters through quantitative indexing, then, carries out speech feature parameter integrity reconstruction, generates an excitation signal based on the reconstructed speech feature parameters, and obtains a synthetic speech signal through adaptive spectral enhancement, a synthesis filter, gain control, and distribute pulse filtering. By adopting the method, the speech encoding rate can be effectively reduced. The speech synthesized by a receiving end has high clarity and intelligibility.

Description

technical field [0001] The invention belongs to the field of voice communication, and in particular relates to a MELP-based low-rate voice codec, which is widely used in voice application services such as secure communication, satellite mobile communication, deep-sea communication and voice mailbox. Background technique [0002] Voice is the main carrier for human beings to transmit information to each other. It is the most direct, convenient and effective way of communication in modern communication, and it is also the main means of human-computer interaction in the future. With the development of communication technology, the proportion of non-voice information such as images and data in information transmission is increasing, but the effective transmission of voice information is still one of the necessary functions of many communication systems. [0003] Although the introduction of optical fiber transmission technology in recent years has provided huge transmission capa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L19/087

Inventor 李强付余涛舒勤军陈丁当陈浩朱兰明艳夏绪玖

Owner CHONGQING UNIV OF POSTS & TELECOMM

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A 1.2kb/s low-rate speech codec method based on mixed excitation linear prediction melp

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology