Method for azimuth estimation, equipment and storage medium

An azimuth, azimuth angle technology, applied in the field of speech processing, can solve the problem of azimuth accuracy sensitivity, etc., to achieve the effect of improving accuracy

Active Publication Date: 2019-08-23
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in order to achieve optimal performance, the beamforming algorithm needs to give the azimuth of the target voice, and is very sensitive to the accuracy of the azimuth

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for azimuth estimation, equipment and storage medium
  • Method for azimuth estimation, equipment and storage medium
  • Method for azimuth estimation, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] Embodiments of the present application are described below in conjunction with the accompanying drawings. Apparently, the described embodiments are only part of the embodiments of the present application, not all of the embodiments. Those of ordinary skill in the art know that, with the development of technology and the emergence of new scenarios, the technical solutions provided in the embodiments of the present application are also applicable to similar technical problems.

[0032] An embodiment of the present application provides a method for azimuth estimation, which is used to improve the accuracy of azimuth estimation during voice interaction. The embodiment of the present application also provides a corresponding device and a computer-readable storage medium. Each will be described in detail below.

[0033] The terminal device in the embodiment of the present application is a voice interaction device, which may be a device such as a stereo, a TV, a TV box, or a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for azimuth estimation. The method comprises the steps that multi-channel sampling signals are acquired and buffered; wake-up word detection is performed on each sampling signal of the multi-channel sampling signals, and a wake-up word detection score of each sampling signal is determined; if it is determined that a wake-up word exists according to the wake-up worddetection score of each sampling signal, spatial spectrum estimation is performed on the buffered multi-channel sampling signals to obtain a spatial spectrum estimation result, and the wake-up word is included in a target speech; and the azimuth of the target speech is determined according to the spatial spectrum estimation result and the highest wake-up word detection score. According to the method for azimuth estimation, the accuracy of azimuth estimation in the process of speech interaction is improved by using the wake-up words to assist in estimation of the azimuth of the target speech.

Description

technical field [0001] The present application relates to the technical field of speech processing, and in particular to a method, device and computer-readable storage medium for azimuth estimation. Background technique [0002] With the popularity of smart speakers and their derivatives, voice interaction between humans and machines, especially far-field voice interaction, has gradually become an important research direction. In the field of voice interaction, far-field voice interaction usually refers to a distance greater than 1 meter. Voice interaction between man and machine is considered to be the most important user traffic entry in the future. Therefore, both Internet platforms and content service providers attach great importance to the exploration and innovation of speech recognition interfaces. [0003] At present, voice interactive smart devices in the field of consumer electronics are mainly smart speakers, smart TVs or TV boxes with voice control functions an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/08G10L25/03G10L25/60
CPCG10L15/08G10L25/03G10L25/60G01S3/8006G10L25/18G10L2015/088G10L15/22G10L25/21
Inventor 郑脊萌高毅于蒙刘二男
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products