Voice signal reestablishment method based on deep autoencoder

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of deep encoder and autoencoder, which is applied in speech analysis, instrumentation, etc., can solve problems such as quantization errors, and achieve the effect of speech evaluation parameter optimization

Active Publication Date: 2019-11-22

ZHEJIANG SHUREN COLLEGE ZHEJIANG SHUREN UNIV

View PDF14 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This method directly quantizes the output of the encoding layer to 0 or 1, thereby realizing the binarization of the encoding layer. However, the output distribution of the encoding layer is uncertain during the training process. When the output of the encoding layer is approximately 0-1 distribution , can achieve a better quantization effect, but when the output of the coding layer is not 0-1 distribution, it will lead to a large quantization error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0053] The technical solutions provided by the present invention will be further described below in conjunction with the accompanying drawings.

[0054] see figure 1 , shown is the flow chart of the speech signal reconstruction method based on the depth autoencoder provided by the present invention, comprising the following steps:

[0055] Step S101: Obtain encoded data and input it into a decoding unit;

[0056] Step S102: the decoding unit processes the encoded data through the deep decoder neural network and outputs the decoded data;

[0057] Step S103: Denormalize the decoded data;

[0058] Step S104: performing inverse discrete Fourier transform on the data processed in step S103;

[0059]Step S105: Obtain a reconstructed voice signal by splicing and adding the data processed in step S104;

[0060] see figure 2 , shown as a flow chart of speech signal encoding in the present invention, the encoded data is obtained through the following steps:

[0061] Step S201: Fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice signal reestablishment method based on a deep autoencoder. The method comprises the following steps of S101, obtaining encoded data and inputting the encoded data intoa decoding unit; S102, processing the encoded data by the decoding unit through utilization of a deep decoder neural network, and outputting decoded data; S103, carrying out denormalization on the decoded data; S104, carrying out inverse discrete Fourier transform on the data processed by the S103; S, 105, carrying out overlapping-addition on the data processed by the S104, thereby obtaining reestablished voice signals, wherein the coded data is obtained through utilization of the following steps of S201, framing original voice signals; S202, carrying out discrete Fourier transform on the framed data; S203, carrying out normalization on the data processed by the S202; S204, inputting the normalized data into the coding unit; and S205, processing the data normalized by the S203 by an encoding unit through utilization of a deep encoder neural network, wherein obtaining the coded data.

Description

technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a speech signal reconstruction method based on a depth autoencoder. Background technique [0002] In the speech signal transmission technology, the speech coding technology at the encoding end and the speech signal reconstruction at the decoding end are the key technologies. In the prior art, speech coding usually adopts codebook-based vector quantization technology, that is, a pre-trained codebook is stored at both the coding end and the decoding end, and speech coding and decoding is to search for an index according to the codebook or obtain codes according to the index the process of. However, when the right amount of dimensionality is high or the codebook is large, traditional vector quantization techniques will fail. For example, to perform 20-bit quantization on 100-dimensional data, 1,048,576 100-dimensional codebooks are required, and the training of su...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L19/035G10L19/16G10L25/30

CPCG10L19/035G10L19/16G10L25/30

Inventor 吴建锋秦会斌秦宏帅

Owner ZHEJIANG SHUREN COLLEGE ZHEJIANG SHUREN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice signal reestablishment method based on deep autoencoder

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology