Voice signal processing method, device and terminal

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech signal processing and speech signal technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as noise signal, reduce user experience performance, etc., to achieve the effect of improving performance, improving user experience performance, and reducing the probability of false detection

Active Publication Date: 2020-04-21

大众问问(北京)信息科技有限公司

View PDF12 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] In the process of realizing the present invention, the inventor found that the prior art has the following defects: since there is noise anywhere in the natural world, the voice that anyone sends is a voice mixed with noise signals, even in an absolutely quiet environment. The original voice signal acquired by the device will also include certain noise signals

If DRC processing is directly performed on the voice signal obtained by the front end, when the voice signal does not include the target voice signal, non-target voice signals (that is, interference signals) such as noise signals or residual echo signals included in the voice signal will be amplified at the same time, thereby Affects the false detection probability of the back-end speech recognition, misidentification occurs, and reduces user experience performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0029] figure 1 It is a flow chart of a voice signal processing method provided in Embodiment 1 of the present invention. This embodiment is applicable to the case of performing DRC processing on a voice signal including a target voice signal. The method can be executed by a voice signal processing device. The device can be realized by means of software and / or hardware, and can generally be integrated in a terminal (typically, terminals such as various vehicle-mounted devices or intelligent terminal devices). Correspondingly, such as figure 1 As shown, the method includes the following operations:

[0030] S110. Acquire a speech signal to be processed and at least two reference signals.

[0031] Wherein, the speech signal to be processed may be a speech signal requiring DRC processing. Exemplarily, the voice command signal input by the user (that is, the microphone signal) acquired by the vehicle-mounted terminal through the microphone device or the voice command signal col...

Embodiment 2

[0042] figure 2It is a flow chart of a voice signal processing method provided in Embodiment 2 of the present invention. This embodiment is embodied on the basis of the above-mentioned embodiments. In this embodiment, the calculation of the voice signal to be processed and at least two A cross-correlation parameter of the reference signal, and if it is determined according to the cross-correlation parameter that there is a target speech signal in the speech signal to be processed, performing dynamic range compression DRC processing on the speech signal to be processed. Correspondingly, such as figure 2 As shown, the method of this embodiment may include:

[0043] S210. Acquire a speech signal to be processed and at least two reference signals.

[0044] Optionally, the reference signal includes a first reference signal and a second reference signal; the first reference signal is a system audio signal; the second reference signal is a signal obtained through AEC processing o...

Embodiment 3

[0088] Figure 3a It is a flow chart of a speech signal processing method provided by Embodiment 3 of the present invention. This embodiment is embodied on the basis of the above-mentioned embodiments. In this embodiment, the signal energy according to the second reference signal is given. A specific implementation manner of determining whether the target speech signal exists in the speech signal to be processed based on an intermediate determination result with the target speech signal. Correspondingly, such as Figure 3a As shown, the method of this embodiment may include:

[0089] S310. Acquire a speech signal to be processed and at least two reference signals.

[0090] Optionally, the reference signal includes a first reference signal and a second reference signal; the cross-correlation parameter is a cross-correlation spectrum; the first reference signal is a system audio signal; the second reference signal is the Process the signal obtained by processing the speech si...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses a voice signal processing method and device, and a terminal. The method comprises the steps: obtaining a to-be-processed voice signal and at least two reference signals; calculating cross-correlation parameters of the to-be-processed voice signal and the at least two reference signals; and if it is determined that the to-be-processed voice signal has a target voice signal according to the cross-correlation parameterS, performing dynamic range compression (DRC) processing on the to-be-processed voice signal. According to the technical scheme of the embodiment of the invention, the performance of voice signal DRC processing can be improved, so that the false detection probability is reduced, and the user experience performance is improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of voice processing, and in particular, to a voice signal processing method, device, and terminal. Background technique [0002] Speech recognition technology continues to develop and has been widely used in various industries, especially in electronic equipment. In the speech recognition process, it is usually necessary to perform DRC (Dynamic Range Control, dynamic range compression) processing on the speech signal acquired by the front end, so that the energy of the output signal can better match the wake-up model and recognition model of the back end. [0003] In the prior art, DRC processing is usually directly performed on the voice signal acquired by the front end, so that the voice signal can effectively obtain gain adjustment. [0004] In the process of realizing the present invention, the inventor found that the prior art has the following defects: since there is noise anywhe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/26G10L21/0264G10L25/06G10L25/18G10L25/60

CPCG10L15/26G10L21/0264G10L25/06G10L25/18G10L25/60G10L2021/02082

Inventor 杨晓霞刘溪

Owner 大众问问(北京)信息科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice signal processing method, device and terminal

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology