Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech processing method and device

A voice processing and voice frame technology, applied in voice analysis, instruments, etc., can solve problems such as low positioning accuracy and voice output signal aliasing, and achieve the effects of improving accuracy, eliminating aliasing, and improving signal-to-noise ratio

Active Publication Date: 2018-03-02
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF10 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For this reason, the first object of the present invention is to propose a speech processing method, by performing subband decomposition on each speech frame and performing beamforming on subband signals under the same frequency band, so that there is no aliasing in the resulting speech output signal. To improve the accuracy of positioning, to solve the problem of aliasing and low positioning accuracy in the existing voice output signals obtained through beamforming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech processing method and device
  • Speech processing method and device
  • Speech processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0060] The voice processing method and device thereof according to the embodiments of the present invention are described below with reference to the accompanying drawings.

[0061] At present, beamforming algorithms are mostly used in voice positioning, and the voice output obtained by beamforming algorithms often has aliasing, which will affect voice positioning and make positioning accuracy low.

[0062] To solve this problem, the embodiment of the present invention proposes a speech processing method, by performing subband de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention presents a speech processing method and device. The method includes the following steps: collecting N current speech frames; carrying out sub-band decomposition on each current speech frame to acquire M sub-band signals of each current speech frame, wherein N and M are positive integers; extracting sub-band signals of the same band from the M sub-band signals of each current speech frame; for each band, carrying out beam-forming on the N sub-band signals under the band to get a first speech signal; and carrying out sub-band synthesis on the first speech signals under the bands toacquire an output signal of the current speech frames. According to the method, sub-band decomposition is carried out on each speech frame collected, beam-forming is carried out on the sub-band signals under the same band, and an output signal is obtained through sub-band synthesis. The aliasing in the output signal can be eliminated. Moreover, the signal-to-noise ratio of the output signal is improved, a speech signal with high quality can be output, and the accuracy of speech localization can be improved.

Description

technical field [0001] The invention relates to the technical field of voice processing, in particular to a voice processing method and device thereof. Background technique [0002] Artificial Intelligence (Artificial Intelligence), the English abbreviation is AI. It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence. Artificial intelligence is a branch of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that responds in a manner similar to human intelligence. Research in this field includes robotics, speech recognition, image recognition, natural language processing and expert systems, etc. Among them, the most important aspect of artificial intelligence is speech recognition technology. [0003] At present, beamforming algorithms are mostly used in voice positioning, and the voice output obtain...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0208G10L19/008G10L25/78
CPCG10L19/008G10L21/0208G10L25/78
Inventor 吴俊楠宋辉崔玮玮
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products