Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice signal reestablishment method based on deep autoencoder

A technology of deep encoder and autoencoder, which is applied in speech analysis, instrumentation, etc., can solve problems such as quantization errors, and achieve the effect of speech evaluation parameter optimization

Active Publication Date: 2019-11-22
ZHEJIANG SHUREN COLLEGE ZHEJIANG SHUREN UNIV
View PDF14 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method directly quantizes the output of the encoding layer to 0 or 1, thereby realizing the binarization of the encoding layer. However, the output distribution of the encoding layer is uncertain during the training process. When the output of the encoding layer is approximately 0-1 distribution , can achieve a better quantization effect, but when the output of the coding layer is not 0-1 distribution, it will lead to a large quantization error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice signal reestablishment method based on deep autoencoder
  • Voice signal reestablishment method based on deep autoencoder
  • Voice signal reestablishment method based on deep autoencoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The technical solutions provided by the present invention will be further described below in conjunction with the accompanying drawings.

[0054] see figure 1 , shown is the flow chart of the speech signal reconstruction method based on the depth autoencoder provided by the present invention, comprising the following steps:

[0055] Step S101: Obtain encoded data and input it into a decoding unit;

[0056] Step S102: the decoding unit processes the encoded data through the deep decoder neural network and outputs the decoded data;

[0057] Step S103: Denormalize the decoded data;

[0058] Step S104: performing inverse discrete Fourier transform on the data processed in step S103;

[0059]Step S105: Obtain a reconstructed voice signal by splicing and adding the data processed in step S104;

[0060] see figure 2 , shown as a flow chart of speech signal encoding in the present invention, the encoded data is obtained through the following steps:

[0061] Step S201: Fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice signal reestablishment method based on a deep autoencoder. The method comprises the following steps of S101, obtaining encoded data and inputting the encoded data intoa decoding unit; S102, processing the encoded data by the decoding unit through utilization of a deep decoder neural network, and outputting decoded data; S103, carrying out denormalization on the decoded data; S104, carrying out inverse discrete Fourier transform on the data processed by the S103; S, 105, carrying out overlapping-addition on the data processed by the S104, thereby obtaining reestablished voice signals, wherein the coded data is obtained through utilization of the following steps of S201, framing original voice signals; S202, carrying out discrete Fourier transform on the framed data; S203, carrying out normalization on the data processed by the S202; S204, inputting the normalized data into the coding unit; and S205, processing the data normalized by the S203 by an encoding unit through utilization of a deep encoder neural network, wherein obtaining the coded data.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a speech signal reconstruction method based on a depth autoencoder. Background technique [0002] In the speech signal transmission technology, the speech coding technology at the encoding end and the speech signal reconstruction at the decoding end are the key technologies. In the prior art, speech coding usually adopts codebook-based vector quantization technology, that is, a pre-trained codebook is stored at both the coding end and the decoding end, and speech coding and decoding is to search for an index according to the codebook or obtain codes according to the index the process of. However, when the right amount of dimensionality is high or the codebook is large, traditional vector quantization techniques will fail. For example, to perform 20-bit quantization on 100-dimensional data, 1,048,576 100-dimensional codebooks are required, and the training of su...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L19/035G10L19/16G10L25/30
CPCG10L19/035G10L19/16G10L25/30
Inventor 吴建锋秦会斌秦宏帅
Owner ZHEJIANG SHUREN COLLEGE ZHEJIANG SHUREN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products