Low-rate voice coding method based on depth self-coding machine

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice coding, low-rate technology, applied in speech analysis, instruments, etc., can solve the problems of lower coding rate, high voice coding quality, etc., and achieve the effect of digitization and compression coding

Pending Publication Date: 2020-06-05

NAT UNIV OF DEFENSE TECH

View PDF3 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In view of the deficiencies in the prior art, the purpose of the present invention is to provide a novel low-rate speech encoding method based on a deep autoencoder to solve the technical problem that it is difficult to further reduce the encoding rate while maintaining a higher speech encoding quality in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0047] This embodiment provides a low-rate speech coding method based on a deep autoencoder, as shown in Figure 1, including:

[0048] Step 1, input the original speech signal s(n), and process the original speech signal s(n) into frames to obtain each frame of speech signal s m ; Wherein, n represents the time subscript, 0≤n≤L-1, L represents the frame length, m represents the subscript of each frame of speech signal, m=1,2,...,M, M represents the total number of speech frame number;

[0049] Wherein, by formula (1), the original speech signal s(n) is subjected to frame processing to obtain each frame of speech signal s m ;

[0050] the s m (n)=s m (mR+n)ω(n) (1)

[0051] In formula (1), R represents the frame shift, and ω(n) represents the Hamming window.

[0052] In this embodiment, the length of each frame of voice signal is 20-30 ms, and the length of each frame of voice signal s m Can be expressed as: s m =[s m (0),s m (1),...,s m (L-1)] T ;

[0053] Step 2,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a low-rate voice coding method based on a depth self-coding machine. The method comprises the steps: 1, inputting an original voice signal s (n), carrying out the framing of theoriginal voice signal s (n), and obtaining each frame of voice signal sm; step 2, taking a logarithm amplitude spectrum ym of each frame of voice signal sm after framing processing; 3, constructing adeep neural network model, and training the constructed deep neural network model; and step 4, inputting the logarithm amplitude spectrum ym of each frame of voice signal into the trained deep neuralnetwork model, step 5, obtaining each frame of reconstructed voice signal; and carrying out overlapping operation on each reconstructed frame of voice signal to obtain voice codes, and outputting thevoice codes. According to the invention, a data driving mode is adopted to automatically learn from the voice signals to obtain feature parameters capable of carrying out quantization coding, and digitization and compression coding of the voice signals are realized by carrying out efficient quantization on the feature parameters.

Description

technical field [0001] The invention belongs to the technical field of low-rate vocoders in speech coding, and in particular relates to a low-rate speech coding method based on a deep autoencoder. Background technique [0002] Voice communication is the most natural and convenient means for human beings to communicate with each other. With the rapid development of the mobile Internet, although the volume of data communication business has surpassed the traditional voice communication business, the basic position of voice communication will not change for a long time. Speech coding, which aims at efficiently compressing speech signals by means of digital signal processing to meet the needs of limited communication bandwidth, is one of the core and key technologies of speech communication. With years of in-depth research, many successful speech coding models have been proposed and a series of speech compression coding standards have been formulated, such as the ITU-T G.711 st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L19/08G10L25/30G10L25/45

CPCG10L19/08G10L25/30G10L25/45

Inventor闵刚张长青解云虹谭薇周怀军吴广恩刘向阳

OwnerNAT UNIV OF DEFENSE TECH

Low-rate voice coding method based on depth self-coding machine

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology