Non-reference voice quality objective assessment method based on deep learning voice enhancement

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech enhancement and speech quality technology, applied in speech analysis, instruments, etc., can solve the problem of low correlation of subjective evaluation scores, and achieve the effects of high correlation, small root mean square error, and high adaptability

Active Publication Date: 2017-11-17

BEIJING INSTITUTE OF TECHNOLOGYGY

View PDF5 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] The purpose of the present invention is to solve the problem that the correlation between the results of the existing non-reference speech objective evaluation algorithm and the subjective evaluation score is too low, to provide a non-reference speech quality objective evaluation method based on deep learning speech enhancement, and to improve the accuracy of the objective evaluation method. sex

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0048] In this example, narrow-band pure speech signals and their distorted signals under 20 kinds of distortion conditions are used as the training data of the speech enhancement model. These distorted voices are scored subjectively by using the MOS method, and used as the training set and test set for the final mapping link.

[0049] The method steps of the present invention are as figure 1 As shown, the details are as follows:

[0050] Step a, the distorted speech signal to be tested is trained based on the speech enhancement model of deep belief network (DBN), obtains the signal after the enhancement, corresponding figure 1 in (1);

[0051] In this step, it is divided into two phases, training phase and enhancement phase. In the training phase, the logarithmic power spectrum information and phase information are first extracted from the pure speech signal and the distorted speech signal, and then the distorted signal parameters are input into the deep belief network. Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a non-reference voice quality objective assessment method based on deep learning voice enhancement, belonging to the voice quality assessment technology field. The non-reference voice quality objective assessment method based on deep learning voice enhancement comprises steps of enabling voice to be tested to go through a trained voice enhancement model based on a deep belief network to obtain an enhanced signal, extracting Mel-frequency cepstral coefficients of signals before and after enhancement respectively, obtaining a difference of the two coefficients, taking the parameter as an input and mapping the parameter as a final objective score through a BP neural network model of a second layer and thus realizing an objective assessment of the non-reference voice quality. Compared with a traditional voice quality assessment model, a correlation degree between the non-reference voice quality objective assessment method of the invention and the object quality score is good and a mean square error is smaller.

Description

technical field [0001] The invention relates to a speech enhancement technology based on a deep belief network and an objective evaluation index mapping technology of an artificial neural network, in particular to a no-reference speech quality objective evaluation method based on a deep learning speech enhancement, and belongs to the field of speech quality evaluation technologies. Background technique [0002] With the development of science and technology, voice communication has become an indispensable part of communication. From telephony to VoIP, voice communication permeates many aspects of our lives. During the transmission of voice communication, the voice compression of the channel, modulation and demodulation, and noise interference will greatly reduce the voice quality, thus reducing the human auditory experience. In order to be able to design a communication system with good transmission performance, judging the performance of the voice communication system has ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L25/24G10L25/30G10L25/60

CPCG10L25/24G10L25/30G10L25/60

Inventor 王晶单亚慧孟柳晨谢湘费泽松

Owner BEIJING INSTITUTE OF TECHNOLOGYGY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Non-reference voice quality objective assessment method based on deep learning voice enhancement

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology