Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Broadband speech spectrum inclination characteristic parameter reconstruction method for speech intelligibility enhancement

A feature parameter and inclination technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as the lack of acoustic features of narrowband speech signals, the inability of speech intelligibility enhancement systems to obtain spectral inclination information, and the decline in enhancement effects. Improve scalability and usability

Active Publication Date: 2019-01-15
WUHAN UNIV
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] The present invention provides a reconstruction method for characteristic parameters of wideband speech spectrum inclination for speech intelligibility enhancement, which solves the problem that due to the lack of acoustic features of narrowband speech signals, the directly calculated spectral gradient parameters have larger errors than wideband speech signals. The problem that the speech clarity enhancement system cannot obtain accurate spectrum slope information and the enhancement effect is severely reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Broadband speech spectrum inclination characteristic parameter reconstruction method for speech intelligibility enhancement
  • Broadband speech spectrum inclination characteristic parameter reconstruction method for speech intelligibility enhancement
  • Broadband speech spectrum inclination characteristic parameter reconstruction method for speech intelligibility enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The following describes the embodiments of the present invention in further detail in conjunction with the accompanying drawings in the embodiments of the present invention. It is obvious that the embodiments described herein are only some, not all, embodiments of the present invention. Any embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative work are within the protection scope of the present application.

[0043] The invention is suitable for a speech clarity enhancement system in a real-time speech communication system, and the speech clarity enhancement system improves the speech clarity based on the vocalization mechanism (Lombard effect) against the speaker's noise and the natural speech generation model. The invention provides a method for recovering speech feature parameters in a speech clarity enhancement system, that is, "a method for reconstructing wideband speech spectrum inclination parameters fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a broadband speech spectrum inclination characteristic parameter reconstruction method for speech intelligibility enhancement, comprising a training stage and a use stage of a spectrum inclination reconstruction network based on a cyclic neural network, wherein a training stage establishes a speech data set, and pretreats speech data in the data set; Input the preprocessed narrowband speech data, perform short-time Fourier transform to obtain narrowband speech spectrum, logarithmize the spectrum information to obtain logarithmic amplitude spectrum; Input the preprocessedwideband speech data, extract the parameters of all-pole model of wideband speech signal spectral inclination, and transform them into linear spectral pair parameters. The spectral tilt reconstruction network is trained and used to reconstruct the all-pole model parameters of the spectral tilt of wideband speech. The invention reconstructs the broadband speech signal spectrum inclination parameter according to the narrowband speech signal, and is suitable for all speech intelligibility enhancement systems based on the spectrum inclination characteristic, and can adapt to multilingual and multimodal speech signals.

Description

technical field [0001] The present invention provides a method for rebuilding characteristic parameters of wideband speech spectrum gradient for speech clarity enhancement, relates to the field of speech signal processing and communication technology, is applicable to all speech clarity enhancement systems based on spectrum gradient features, and can Adapt to multilingual and multimodal voice signals. Background technique [0002] Since the 21st century, mobile communication technology has developed rapidly, and mobile communication devices such as mobile phones have become popular rapidly. With the convenience brought by mobile phones, people can use mobile communication devices for real-time voice communication anytime and anywhere; with this convenience, people inevitably talk in diverse noisy environments such as stations, restaurants, factories, etc., and noise in noisy environments Seriously degraded voice call quality. [0003] The voice communication process can be...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06G10L21/02G10L25/30
CPCG10L15/063G10L21/02G10L25/30
Inventor 胡瑞敏李罡张锐王晓晨
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products