Method and system for training neural network

A neural network and loss function technology, applied in the field of signal processing, can solve problems that cannot be fully reflected in the reconstruction output, performance loss, and metric mismatch

Pending Publication Date: 2020-07-21
SAMSUNG ELECTRONICS CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, this approach has two key issues: metric mismatch and spectral mismatch
Spectral masking estimation typically minimizes the mean square error (MSE) between the clean and estimated spectral amplitudes, which is critical for signal distortion ratio (SDR) or perceived speech due to metric mismatch. Perceptual evaluation of speech quality (PESQ) maximization is not optimal
For example, it is often observed that SDR or PESQ often degrade despite the reduction of the spectral mean square error
The second spectral mismatch problem arises from the estimation of the spectral domain
As a result, the denoised spectral amplitudes are not fully reflected in the reconstructed output, which can result in substantial performance loss

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for training neural network
  • Method and system for training neural network
  • Method and system for training neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Hereinafter, embodiments of the present disclosure are explained in detail with reference to the accompanying drawings. It should be noted that the same elements will be indicated by the same reference numerals even though they are shown in different drawings. In the following description, specific details such as detailed configuration and components are provided only to help a comprehensive understanding of the embodiments of the present disclosure. Accordingly, it will be apparent to those skilled in the art that various changes and modifications of the embodiments described herein can be made without departing from the scope of the present disclosure. Also, descriptions of well-known functions and constructions are omitted for clarity and conciseness. Terms described below are terms defined in consideration of functions in the present disclosure, and may vary according to users, users' intentions, or habits. Therefore, the definitions of these terms should be dete...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Disclosed are a method and a system for training a neural network. According to one embodiment, the method includes receiving a noisy signal, generating a denoised output signal, determining a signaldistortion ratio (SDR) loss function based on the denoised output signal, determining a perceptual evaluation of speech quality (PESQ) loss function based on the denoised output signal, and optimizingan overall loss function based on the PESQ loss function and the SDR loss function.

Description

[0001] [priority] [0002] This application is based on, and claims priority to, U.S. Provisional Patent Application, filed January 11, 2019, in the U.S. Patent and Trademark Office and assigned Serial No. 62 / 791,421, said U.S. Provisional Patent Application The entire contents of the application are incorporated by reference. technical field [0003] The present disclosure generally relates to signal processing. In particular, the present disclosure relates to Signal-to-Distortion Ratio (SDR) and Perceptual Speech Quality Estimation (PESQ) optimization. Background technique [0004] Recently, supervised learning based on deep neural networks has achieved substantial improvements in speech enhancement. The main difference from typical statistical approaches is that no prior assumptions about the signal model are necessary. For example, Wiener filters usually assume a Gaussian distribution of speech or noise models, which is often incorrect in real environments. In contra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08
CPCG06N3/082G10L25/60G10L25/30G10L21/0208G06N3/063G06N3/08G10L15/063
Inventor 金宰永李正元莫斯塔法·伊尔-哈米
Owner SAMSUNG ELECTRONICS CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products