Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Bone conduction speech enhancement method based on joint dictionary learning and sparse representation

A sparse representation and dictionary learning technology, applied in speech analysis, speech recognition, bone conduction transducer hearing equipment, etc., can solve the problems of low speech intelligibility and dullness, and achieve the effect of improving hearing quality

Pending Publication Date: 2020-11-20
UNIV OF SCI & TECH OF CHINA
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to factors such as the low-pass performance of human body conduction and the limitation of the sensor technology level, the speech clarity received by the bone conduction microphone is low and sounds dull.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bone conduction speech enhancement method based on joint dictionary learning and sparse representation
  • Bone conduction speech enhancement method based on joint dictionary learning and sparse representation
  • Bone conduction speech enhancement method based on joint dictionary learning and sparse representation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0015] Different from most of the existing single-channel speech enhancement algorithms, the embodiment of the present invention provides a bone conduction speech enhancement method based on joint dictionary learning and sparse representation. In the training phase, the method first uses bone conduction The special-shaped dual-microphone array system composed of microphones and air conduction microphones collects training speech synchronously, const...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a bone conduction speech enhancement method based on joint dictionary learning and sparse representation. In a training stage, in an indoor noise-free environment, a special-shaped double-microphone array composed of a bone conduction microphone and an air conduction microphone is used for synchronously collecting training speech, and a joint training set of the bone conduction speech and the air conduction speech is constructed; and short-time inverse Fourier transform is performed on the training signals of the bone conduction speech and the air conduction speech to obtain a time-frequency spectrum amplitude, and a joint speech dictionary of the bone conduction speech and the air conduction speech is learnt on a time-frequency spectrum. In a detection stage, short-time Fourier transform is performed on the bone conduction speech to obtain a time-frequency spectrum amplitude and a phase; the amplitude is projected on a bone conduction speech sub-dictionary of the joint speech dictionary; the air-guided speech time-frequency spectrum amplitude is reconstructed by using the obtained sparse representation coefficient and the air-guided speech sub-dictionary ofthe joint speech dictionary, two methods are provided for enhancing the time-frequency spectrum of the bone conduction speech, and finally short-time inverse Fourier transform is performed to obtain an enhanced bone conduction speech time-domain signal, so that the speech sharpness is improved.

Description

technical field [0001] The invention relates to the field of single-channel speech enhancement, in particular to a bone conduction speech enhancement method based on joint dictionary learning and sparse representation. Background technique [0002] Voice plays a leading role in people's communication activities. Due to the pollution of environmental noise, the human ear and related smart devices including air conduction microphones (referred to as air conduction microphones) receive noisy voice, and the quality and intelligibility of voice will be significantly reduced, which affects people's subjective Auditory experience and speech recognition rate of smart devices. Speech enhancement technology is the main method to solve this kind of problems. How to restore clean speech from noisy speech has always been a problem that people are trying to solve. The voice received by the air conduction microphone is called air conduction voice for short. [0003] Bone conduction mic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L21/02G10L21/0224G10L21/0316G10L25/18G10L25/27
CPCG10L15/063G10L21/0224G10L21/0316G10L25/18G10L25/27G10L2015/0633G10L2021/02087H04R2460/13
Inventor 叶中付
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products