Speech processing method and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A voice processing and voice frame technology, applied in voice analysis, instruments, etc., can solve problems such as low positioning accuracy and voice output signal aliasing, and achieve the effects of improving accuracy, eliminating aliasing, and improving signal-to-noise ratio

Active Publication Date: 2018-03-02

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF10 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] For this reason, the first object of the present invention is to propose a speech processing method, by performing subband decomposition on each speech frame and performing beamforming on subband signals under the same frequency band, so that there is no aliasing in the resulting speech output signal. To improve the accuracy of positioning, to solve the problem of aliasing and low positioning accuracy in the existing voice output signals obtained through beamforming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0059] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0060] The voice processing method and device thereof according to the embodiments of the present invention are described below with reference to the accompanying drawings.

[0061] At present, beamforming algorithms are mostly used in voice positioning, and the voice output obtained by beamforming algorithms often has aliasing, which will affect voice positioning and make positioning accuracy low.

[0062] To solve this problem, the embodiment of the present invention proposes a speech processing method, by performing subband de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention presents a speech processing method and device. The method includes the following steps: collecting N current speech frames; carrying out sub-band decomposition on each current speech frame to acquire M sub-band signals of each current speech frame, wherein N and M are positive integers; extracting sub-band signals of the same band from the M sub-band signals of each current speech frame; for each band, carrying out beam-forming on the N sub-band signals under the band to get a first speech signal; and carrying out sub-band synthesis on the first speech signals under the bands toacquire an output signal of the current speech frames. According to the method, sub-band decomposition is carried out on each speech frame collected, beam-forming is carried out on the sub-band signals under the same band, and an output signal is obtained through sub-band synthesis. The aliasing in the output signal can be eliminated. Moreover, the signal-to-noise ratio of the output signal is improved, a speech signal with high quality can be output, and the accuracy of speech localization can be improved.

Description

technical field [0001] The invention relates to the technical field of voice processing, in particular to a voice processing method and device thereof. Background technique [0002] Artificial Intelligence (Artificial Intelligence), the English abbreviation is AI. It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that responds in a manner similar to human intelligence. Research in this field includes robotics, speech recognition, image recognition, natural language processing and expert systems, etc. Among them, the most important aspect of artificial intelligence is speech recognition technology. [0003] At present, beamforming algorithms are mostly used in voice positioning, and the voice output obtain...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/0208G10L19/008G10L25/78

CPCG10L19/008G10L21/0208G10L25/78

Inventor吴俊楠宋辉崔玮玮

OwnerBAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Speech processing method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology