Hearing aid speech enhancement method based on depth domain adaptive network

An adaptive network and speech enhancement technology, applied in hearing aids, hearing aid signal processing, speech analysis, etc., can solve the problems of limited applicable scenarios and limited migration effects, and achieve ingenious and novel methods, good application prospects, and improved speech enhancement performance. Effect

Active Publication Date: 2020-11-20
NANJING INST OF TECH
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This patent realizes the migration of unknown noise types and unknown signal-to-noise ratios through the migration learning algorithm, but only realizes the migration of one type of noise to another noise, and its applicable scenarios have limitations
Secondly, the algorithm has limited transfer effect when the noise type and signal-to-noise ratio do not match.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hearing aid speech enhancement method based on depth domain adaptive network
  • Hearing aid speech enhancement method based on depth domain adaptive network
  • Hearing aid speech enhancement method based on depth domain adaptive network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described and explained below in conjunction with the accompanying drawings.

[0047] as attached figure 1 Shown, a kind of hearing aid speech enhancement method based on deep domain adaptive network of the present invention comprises the following steps:

[0048] Step (A), establish training input samples: select multiple sets of data to construct sample sets, each set of data includes noisy speech and clean speech, extract frame-level logarithmic power spectrum feature LPS from noisy speech and clean speech respectively, and set All frame-level logarithmic power spectrum features LPS are used as input samples, which are used as input features and training targets of deep neural networks.

[0049] Step (B), building a baseline speech enhancement model: Construct a deep learning model based on an encoder-decoder structure in a deep neural network as a baseline speech enhancement model, where the encoder-decoder structure is a conca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hearing aid speech enhancement method based on a depth domain adaptive network. The method comprises the steps of extracting frame-level logarithm power spectrum features from noisy speech and clean speech respectively; constructing a deep learning model based on an encoder-decoder structure as a baseline speech enhancement model; on the basis of the baseline speech enhancement model, constructing a transfer learning speech enhancement model based on a depth domain adaptive network, wherein the transfer learning speech enhancement model introduces a domain adaptationlayer and a relative discriminator between a feature encoder and a reconstruction decoder; training the transfer learning speech enhancement model by using the domain adversarial loss; and in an enhancement stage, according to the trained depth domain adaptive transfer learning speech enhancement model, inputting frame-level LPS features of noisy speech in a target domain, and reconstructing an enhanced speech waveform. According to the method, the feature encoder is stimulated to generate domain invariance features through domain adversarial training, so that the adaptability of the speech enhancement model to unseen noise is improved.

Description

technical field [0001] The invention relates to the technical field of speech enhancement, in particular to a hearing aid speech enhancement method based on a deep domain adaptive network. Background technique [0002] In a complex environment, the target sound is usually submerged in the noise, and the results of the sound spectrum analysis are seriously affected, which makes the performance of the adaptive frequency reduction algorithm drop sharply. At the same time, some hearing-impaired characteristics of hearing-impaired patients, such as high hearing threshold, difficulty in identifying short-term features, and degenerated auditory periphery, make speech understanding in complex scenes a common problem and difficult problem that affects the usage rate. [0003] Classical single-channel noise suppressors are based on statistical signal processing methods, which focus on how to effectively estimate the noise spectrum from noisy speech to suppress it. Typical algorithms ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0216G10L21/0232G10L25/03G10L25/30G06N3/04G06N3/08H04R25/00
CPCG10L21/0216G10L21/0232G10L25/03G10L25/30G06N3/049G06N3/08G06N3/084H04R25/507H04R2225/43G10L2021/065G06N3/044G06N3/045
Inventor 王青云梁瑞宇程佳鸣孙世若邹采荣唐闺臣谢跃包永强
Owner NANJING INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products