Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

60results about How to "Improve speech enhancement performance" patented technology

Single-channel speech enhancement method based on joint dictionary learning and sparse representation

ActiveCN111508518AQuality improvementIncreased Time-Frequency Characterization CapabilitiesSpeech analysisComplex mathematical operationsDictionary learningFrequency spectrum
The invention provides a single-channel speech enhancement method based on joint dictionary learning and sparse representation. Carrying out dual-tree complex wavelet transform on the clean voice to obtain a group of sub-band signals, carrying out short-time Fourier transform on the sub-band signals to obtain a time-frequency spectrum of the sub-band signals, learning a joint dictionary of the clean voice by utilizing the amplitude, the real part, the imaginary part and the voice sparsity of the sub-band signals, and learning a joint dictionary of the clean noise as well; carrying out dual-tree complex wavelet transform and short-time Fourier transform on the noisy speech; obtaining a time-frequency spectrum of each sub-band signal; phase and real part imaginary part symbols are reserved;amplitude, real part and imaginary part absolute values are extracted and projected on the clean voice and clean noise joint dictionary; according to the method, the sparse representation coefficientsof the voice and the noise are obtained, the final estimation of the sub-band voice time-frequency spectrum is obtained by using the coefficients, the time-frequency spectrum phase, the real part imaginary part symbol, the mask, the weight and the like, and the enhanced voice signal is obtained by performing short-time inverse Fourier transform and dual-tree complex wavelet inverse transform, sothat the voice enhancement capability is improved.
Owner:UNIV OF SCI & TECH OF CHINA

Speech enhancement method based on time-frequency domain joint loss function

The invention provides a speech enhancement method based on a time-frequency domain joint loss function. The method comprises steps: integrating a clean voice data set and a noise data set in an open source data set into a noisy voice data set, converting the noisy voice data set into an amplitude spectrum, a phase spectrum and waveform data through preprocessing operation, and constructing a training set; constructing a CNN network model, taking the noisy voice amplitude spectrum as input, taking the clean voice amplitude spectrum as a label, and carrying out model training; performing waveform reconstruction on an amplitude spectrum estimation value output by the model and a noisy speech phase spectrum through an inverse short-time Fourier transform method to obtain a time domain waveform of estimated speech; calculating frequency domain loss through the clean voice amplitude spectrum and the amplitude spectrum estimated value; calculating time domain loss through the clean voice time domain waveform and the estimated voice time domain waveform; and constructing time-frequency domain joint loss according to the frequency domain loss and the time domain loss, and guiding the CNN network model to perform weight optimization. The phenomenon that the estimated amplitude spectrum is not matched with the phase spectrum is reduced, and the speech enhancement effect is improved.
Owner:WUHAN UNIV

Voice enhancement method and device based on multi-frame spectrum and non-negative matrix decomposition

The invention provides a voice enhancement method and device based on multi-frame spectrum and non-negative matrix decomposition and belongs to the voice enhancement and non-negative matrix decomposition field. The method comprises steps that pure voice, noise and noise-contained voice are pre-processed to acquire short-time spectrum which is converted into multi-frame spectrum; the multi-frame spectrum of the noise and the pure voice is converted into products of corresponding base matrixes and corresponding coefficient matrixes, and a base matrix of the multi-frame spectrum of the noise and a base matrix of the multi-frame spectrum of the pure voice are solved; the two base matrixes are synthesized to form a base matrix of the multi-frame spectrum of the noise-contained voice, the multi-frame spectrum of the noise-contained voice is converted into a product of a base matrix and a coefficient matrix, a coefficient matrix of the multi-frame spectrum of the noise-contained voice is acquired, and an initial estimate of the multi-frame spectrum of the noise and enhanced voice is acquired; through a Wiener filtering method, the multi-frame spectrum of the enhanced voice is acquired and is transformed into a time domain signal, and enhancement voice is lastly acquired. The method is advantaged in that the special voice information is kept, the voice is better reduced, and the voice enhancement effect is improved.
Owner:北京华控智加科技有限公司

Speech enhancement model training method and device and speech enhancement method and device

The invention relates to the technical field of speech processing, and provides a speech enhancement model training method and device and a speech enhancement method and device. The training method of the speech enhancement model comprises the following steps: acquiring a speech training set, wherein the voice training set comprises noisy voice samples and pure voice samples; acquiring an amplitude spectrum corresponding to the noisy voice sample, inputting the amplitude spectrum into the generation network, and acquiring an enhanced voice amplitude spectrum; acquiring an amplitude spectrum corresponding to the pure voice sample and an enhanced voice amplitude spectrum, and inputting the amplitude spectrum and the enhanced voice amplitude spectrum into a discrimination network to acquire a discrimination result; and adjusting network parameters of the generation network and the discrimination network according to the enhanced voice amplitude spectrum, the amplitude spectrum corresponding to the pure voice sample, the discrimination result and the optimization target, and generating a voice enhancement model. By adopting the method, the performance of the speech enhancement model can be improved, and the speech enhancement effect is further improved.
Owner:SHANGHAI WINGTECH INFORMATION TECH CO LTD

Speech enhancement method and device thereof, equipment and medium

The invention discloses a speech enhancement method and a device thereof, equipment and a medium. The method comprises the following steps: acquiring a target noisy voice signal and performing short-time Fourier transform on the target noisy voice signal to obtain a target frequency domain signal corresponding to the target noisy voice signal; inputting the target feature of the current signal frame of the target frequency domain signal into an encoder in a voice noise suppression model obtained by pre-training to obtain an encoding feature corresponding to the current signal frame of the target frequency domain signal; inputting the coding feature and a decoding feature corresponding to a previous signal frame of a current signal frame of a target frequency domain signal output by a decoder in a voice noise suppression model into the decoder to obtain a decoding feature corresponding to the current signal frame of the target frequency domain signal; and performing signal reconstruction on the decoding features corresponding to each signal frame of the target frequency domain signal to obtain a target enhanced voice signal corresponding to the target noisy voice signal. According to the technical scheme, the speech enhancement effect can be improved, and calculation time and calculation cost are reduced.
Owner:EVERSEC BEIJING TECH

Environment adaptive voice enhancement algorithm based on attention-driven circulating convolution network

The invention discloses an environment adaptive voice enhancement algorithm based on an attention-driven circulating convolution network. The environment adaptive voice enhancement algorithm comprisesthe following steps that 1, a voice enhancement task database is selected, and input data preparation is conducted; 2, amplitude information and environment information of voice are extracted, wherein the environment information of the voice is extracted by adopting a weight prediction error (WPE) method, and the amplitude information of the voice is voice spectrum information extracted through Fourier transform; 3, a depth model is constructed and trained; and 4, voice reconstructing is conducted, specifically, voice amplitude predicted in the step 3 is converted into a voice waveform. According to the environment adaptive voice enhancement algorithm, the environment information of the voice is considered, and environmental adaptability and robustness of the algorithm in different environments are improved; and in the aspect of real voice signal retention, an attention mechanism is fused to construct the attention-driven circulating convolution network, time-sequence context information of the voice is depicted more precisely, and performance of voice enhancement is effectively improved.
Owner:TIANJIN UNIV

Audio noise reduction method and device, equipment and storage medium

The invention relates to artificial intelligence, and provides an audio noise reduction method and device, equipment and a storage medium. The method comprises the following steps: pre-processing noise frequency to obtain frequency spectrum information, processing the frequency spectrum information based on a frequency domain signal processing network to obtain frequency spectrum mask features, acquiring time-frequency features according to the frequency spectrum information and the frequency spectrum mask features, processing the time-frequency features based on a time domain signal processing network to obtain time-frequency mask features, generating a predicted audio according to the time-frequency features and the time-frequency mask features, adjusting network parameters of a preset learner based on the predicted audio and the pure audio to obtain a noise reduction model, acquiring a request audio, and performing noise reduction processing on the request audio based on the noise reduction model to obtain a target audio. According to the method, the noise reduction accuracy and real-time performance of the request audio can be improved. In addition, the invention also relates to a block chain technology, and the target audio can be stored in a block chain.
Owner:PING AN TECH (SHENZHEN) CO LTD

Low-signal-to-noise-ratio speech enhancement method based on information distillation and aggregation

The invention provides a low-signal-to-noise-ratio speech enhancement method based on information distillation and aggregation. The method comprises the following steps: performing speech feature extraction on an original speech spectrogram to obtain speech information representation; performing multi-stage information distillation processing on the voice information representation to obtain a voice information distillation result after noise component filtering; and performing spectrogram reconstruction on the voice information distillation result. The calibrated information on the information distillation line at each moment in the multi-stage information distillation processing process formed according to an attention mechanism and an information distillation mechanism is used as the input of self-attention information processing sub-modules at the next moment; and through information distillation and recalibration of the N attention information processing sub-modules and N information distillation sub-modules in sequence, the noise component filtering effect is finally achieved. The method can adapt to speech feature extraction in different environments, so that the models can adapt to acoustic features of different noises, and the speech enhancement effect is remarkably improved.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products