Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio data processing method, audio data processing device and storage medium

A technology of audio data and voice data, applied in the Internet field, can solve problems such as inability to ensure the accuracy of equipment and affecting the sensitivity of the voice interaction system

Active Publication Date: 2019-11-26
TENCENT TECH (SHENZHEN) CO LTD
View PDF9 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the current voice interaction system of intelligent voice equipment, in the process of picking up wake-up words and command words, it will inevitably be disturbed by external environmental noise and other people's speech. If the interference noise is too strong, it will seriously affect the voice interaction system. Sensitivity, so that there will be wake-up failure or false wake-up during the wake-up process of the device, that is, the accuracy of the wake-up of the device cannot be guaranteed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio data processing method, audio data processing device and storage medium
  • Audio data processing method, audio data processing device and storage medium
  • Audio data processing method, audio data processing device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0103] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0104] Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge, and use knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technology of computer science, which attempts to understand the essence of in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses an audio data processing method, an audio data processing device and a storage medium, wherein the method is applied to the field of artificial intelligence and comprises the steps of obtaining multipath audio data in an environment space; obtaining a voice data set on the basis of the multipath audio data; respectively generating enhanced voice information corresponding to the voice data set in a plurality of enhancing directions; matching voice hidden features in the enhanced voice information with target matching words; determining the enhancing direction corresponding to the enhanced voice information with the highest matching degree with the target matching words as the target audio direction; obtaining voice frequency spectrum features in theenhanced voice information; obtaining the voice frequency spectrum features in the target audio direction in the voice frequency spectrum features; and performing voice verification on the voice hidden features and the voice frequency spectrum features in the target voice direction on the basis of the target matching words to obtain a target verification result. By using the method, the device and the storage medium provided by the invention, the voice control accuracy can be improved, and the misrecognition rate is effectively reduced.

Description

Technical field [0001] The present invention relates to the field of Internet technology, in particular to an audio data processing method, device and storage medium. Background technique [0002] With the development of science and technology, smart voice devices are gradually being popularized and used. In the implementation of smart voice devices, the voice interaction between voice commands and smart voice devices is an important technical means to realize device intelligence. In the current voice interaction system of intelligent voice equipment, during the process of picking up the wake-up and command words, it will inevitably be interfered by the noise of the external environment and other people’s speech. If the interference noise is too strong, it will seriously affect the voice interaction system. Sensitivity, in turn, causes the wake-up failure or false wake-up phenomenon during the device wake-up process, that is, the accuracy of the device wake-up cannot be ensured. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0208G10L21/0216G10L15/02G10L15/06G10L15/08G10L15/20G10L25/18G10L15/22
CPCG10L15/02G10L15/06G10L15/08G10L15/20G10L21/0208G10L21/0216G10L25/18G10L2021/02166G10L2021/02087G10L2021/02082G10L2015/088G10L2015/223G10L15/22G10L21/0364G10L21/02G10L15/32H04R3/005G06F3/167H04R1/406H04R2201/401G10L17/02G10L17/06G10L17/20G10L17/22G10L21/0232
Inventor 高毅
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products