Voice processing method and apparatus thereof

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech processing and speech technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as inapplicable noise removal

Active Publication Date: 2017-11-24

深圳市雅今智慧科技有限公司

View PDF16 Cites 27 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] However, this method solves the problem of denoising in a high-noise background, and is not suitable for denoising in an indoor environment with long-distance communication.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0076] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0077] The sound signal referred to in the present invention refers to digital audio data, that is, the digital audio data obtained by first converting sound waves into analog audio signals through a sound wave conversion circuit, and then converting the above analog audio signals through an analog-to-digital converter.

[0078] refer to figure 1 , the present invention proposes a kind of voice processing method, comprises the following steps:

[0079] S10. Transform the sound signal from the time domain to the frequency domain to obtain a frequency domain signal, calculate the observed signal power spectral density of the frequency domain signal, and estimate the noise power spectral density according to the observed signal power spectral density;

[0080] S20. When it is determined that there is voice activity in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice processing method and an apparatus thereof. The method comprises the following steps of firstly, converting a sound signal into a frequency domain signal, through calculating a signal-to-noise ratio of the frequency domain signal, acquiring an adaptive updating step size of a noise power spectrum, and according to the step size, updating a noise power spectrum density; then, detecting whether a voice activity exists in the sound signal, and if the voice activity exists in the sound signal, using adaptive Kalman filtering to process the frequency domain signal and acquiring a reverberation power spectrum density; after the noise power spectrum density and the reverberation power spectrum density are determined, calculating an optimization and estimation voice spectrum; and finally, carrying out inverse Fourier transform on the optimization and estimation voice spectrum so as to restore an optimized sound signal. In the invention, collected sound signal quality under a remote speaking condition can be effectively optimized and a voice identification rate is increased.

Description

technical field [0001] The invention relates to the field of voice recognition, in particular to a voice processing method and device. Background technique [0002] In recent years, with the vigorous development of Internet technology and intelligent hardware, voice intelligent interactive technologies such as voice recognition, voiceprint recognition, and sound source detection have begun to move from laboratories to users. Because speech recognition technology is the core technology of human-computer interaction system based on speech. At present, the recognition rate has reached the usable accuracy rate under limited conditions. The so-called limited adjustment usually means that the distance between the user and the microphone is relatively close, and the noise interference is small. However, the condition that voice commands must be issued at close range limits the convenience of voice interaction. [0003] In the case of long-distance speaking, since the voice energ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0232G10L21/0208G10L25/21G10L15/20

CPCG10L15/20G10L21/0208G10L21/0232G10L25/21G10L2021/02082

Inventor蔡钢林

Owner深圳市雅今智慧科技有限公司

Voice processing method and apparatus thereof

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology