Voice processing method and apparatus thereof

A speech processing and speech technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as inapplicable noise removal

Active Publication Date: 2017-11-24
深圳市雅今智慧科技有限公司
View PDF16 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, this method solves the problem of denoising in a high-noise background, and is not suitable for denoising in an indoor environment with long-distance communication.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method and apparatus thereof
  • Voice processing method and apparatus thereof
  • Voice processing method and apparatus thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0077] The sound signal referred to in the present invention refers to digital audio data, that is, the digital audio data obtained by first converting sound waves into analog audio signals through a sound wave conversion circuit, and then converting the above analog audio signals through an analog-to-digital converter.

[0078] refer to figure 1 , the present invention proposes a kind of voice processing method, comprises the following steps:

[0079] S10. Transform the sound signal from the time domain to the frequency domain to obtain a frequency domain signal, calculate the observed signal power spectral density of the frequency domain signal, and estimate the noise power spectral density according to the observed signal power spectral density;

[0080] S20. When it is determined that there is voice activity in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice processing method and an apparatus thereof. The method comprises the following steps of firstly, converting a sound signal into a frequency domain signal, through calculating a signal-to-noise ratio of the frequency domain signal, acquiring an adaptive updating step size of a noise power spectrum, and according to the step size, updating a noise power spectrum density; then, detecting whether a voice activity exists in the sound signal, and if the voice activity exists in the sound signal, using adaptive Kalman filtering to process the frequency domain signal and acquiring a reverberation power spectrum density; after the noise power spectrum density and the reverberation power spectrum density are determined, calculating an optimization and estimation voice spectrum; and finally, carrying out inverse Fourier transform on the optimization and estimation voice spectrum so as to restore an optimized sound signal. In the invention, collected sound signal quality under a remote speaking condition can be effectively optimized and a voice identification rate is increased.

Description

technical field [0001] The invention relates to the field of voice recognition, in particular to a voice processing method and device. Background technique [0002] In recent years, with the vigorous development of Internet technology and intelligent hardware, voice intelligent interactive technologies such as voice recognition, voiceprint recognition, and sound source detection have begun to move from laboratories to users. Because speech recognition technology is the core technology of human-computer interaction system based on speech. At present, the recognition rate has reached the usable accuracy rate under limited conditions. The so-called limited adjustment usually means that the distance between the user and the microphone is relatively close, and the noise interference is small. However, the condition that voice commands must be issued at close range limits the convenience of voice interaction. [0003] In the case of long-distance speaking, since the voice energ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0232G10L21/0208G10L25/21G10L15/20
CPCG10L15/20G10L21/0208G10L21/0232G10L25/21G10L2021/02082
Inventor 蔡钢林
Owner 深圳市雅今智慧科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products