Unlock instant, AI-driven research and patent intelligence for your innovation.

A Speech Clarity Enhancement Method Based on Acoustic Feature Transformation

A technology of acoustic characteristics and clarity, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as the decline of auditory clarity and perceptual intelligibility, the failure to consider the naturalness of speech, and the lack of naturalness

Active Publication Date: 2021-06-15
WUHAN UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] The present invention provides a speech clarity enhancement method based on acoustic feature conversion, which solves the problem that when listening to speech in a noisy environment (such as: answering a phone call, listening to an interactive speech of a smart speaker), the original clear speech signal is masked by the environmental noise. Problems that cause loss of aural clarity and perceived intelligibility
Because the traditional digital signal processing method adopts the rigid transfer of spectral energy between the voice frequency bands, without considering the naturalness of the voice, although the clarity of the voice is improved, the naturalness is seriously lacking.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Speech Clarity Enhancement Method Based on Acoustic Feature Transformation
  • A Speech Clarity Enhancement Method Based on Acoustic Feature Transformation
  • A Speech Clarity Enhancement Method Based on Acoustic Feature Transformation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The following content will further describe the embodiments of the present invention in detail in conjunction with the accompanying drawings in the embodiments of the present invention. It should be understood that the embodiments described here are only some embodiments of the present invention, not all embodiments, and are not intended to limit this invention. Any embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative work are within the scope of protection of the application of the present invention.

[0038] The present invention provides a speech clarity enhancement method suitable for any real-time voice communication terminal and real-time voice interaction equipment, that is, "a speech clarity enhancement method based on acoustic feature conversion", which is used to solve the problem of listening to speech in a noisy environment. When listening to a phone (such as answering a phone call or listening t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech clarity enhancement method based on acoustic feature conversion. The method includes a training stage and a use stage; in the training stage, combined with the speaker's noise-fighting vocalization mechanism, the ordinary speech signal and the anti-noise speech under the Lombard effect are used. The signal is used as a data set to train a mapping model with the ability to convert acoustic features. The mapping features include the frequency spectrum slope, fundamental frequency coefficient and energy coefficient of the speech frame. The long short-term memory network is used to learn the feature mapping ability of the frequency spectrum slope, using Bayesian The Gaussian mixture model learns the feature mapping ability of the fundamental frequency coefficient and the energy coefficient; in the use stage, based on the trained feature mapping model and the corresponding pre-processing and post-processing technology, the decoded voice signal of the communication device or interactive device is frame by frame It is converted into a speech signal with anti-noise characteristics to improve the auditory clarity and perceptual intelligibility of the speech signal when it is played in a noisy environment.

Description

technical field [0001] The present invention provides a voice clarity enhancement method based on acoustic feature conversion, especially relates to the fields of artificial intelligence, machine learning, voice signal processing and network communication, and is applicable to systems related to voice communication such as real-time voice communication systems and human-computer interaction systems with equipment. Background technique [0002] With the rapid development of mobile communication technology, relying on powerful mobile communication networks and high-performance call terminals, mobile voice communication has gradually replaced fixed-line communication as the mainstream voice communication method. Relying on the convenience brought by mobile communication, people can carry out voice communication anytime and anywhere. But what follows is that the caller inevitably performs voice communication under the conditions of being surrounded by noisy environments such as...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/02G10L15/06G10L15/20G10L25/18G10L25/30
CPCG10L15/063G10L15/20G10L21/02G10L25/18G10L25/30
Inventor 胡瑞敏李罡张锐柯善发王晓晨
Owner WUHAN UNIV