Audio processing method and device, terminal equipment and computer storage medium

An audio processing and audio signal technology, applied in the field of terminal artificial intelligence, can solve the problems of limited application scenarios and high computing power requirements, and achieve the effect of reducing computing power requirements and reducing the amount of calculation

Inactive Publication Date: 2020-05-01
HUAWEI TECH CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of this, the embodiment of the present application provides an audio processing method, device, terminal equipment, and computer storage medium to solve the problem that the existing face-based

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio processing method and device, terminal equipment and computer storage medium
  • Audio processing method and device, terminal equipment and computer storage medium
  • Audio processing method and device, terminal equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0106] In the following description, specific details such as specific system structures and technologies are presented for the purpose of illustration rather than limitation, so as to thoroughly understand the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.

[0107] In order to illustrate the technical solutions described in this application, specific examples are used below to illustrate.

[0108] It should be understood that when used in this specification and the appended claims, the term "comprising" indicates the presence of described features, integers, steps, operations, elements and / or components, but does not exclude one or more ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of terminal artificial intelligence (AI), in particular, relates to the field of voice recognition, and provides an audio processing method and device, terminal equipment and a computer storage medium, wherein the method comprises the steps: obtaining a to-be-processed face image set and a to-be-denoised audio signal; extracting mouth features of each face imagein the to-be-processed face image set, and extracting frequency spectrum features of the to-be-denoised audio signal; inputting the mouth features of each face image and the frequency spectrum featureof the to-be-denoised audio signal into a preset neural network model to obtain a frequency spectrum mask; and processing the to-be-denoised audio signal by using the spectrum mask to obtain a targetaudio signal. The method can solve the problems that an existing face-based auxiliary noise reduction algorithm has high requirements for the computing power of terminal equipment, is difficult to operate on low-computing-power terminal equipment and is limited in application scene.

Description

technical field [0001] The present application belongs to the field of terminal artificial intelligence (AI), specifically related to the field of speech recognition, and particularly relates to an audio processing method, device, terminal equipment, and computer storage medium. Background technique [0002] Currently, many terminal devices have voice interaction functions, such as voice assistants and voice input methods. When the user uses these terminal devices, if the user is in a relatively quiet environment, the terminal device can more accurately identify the recorded audio data. [0003] However, once the noise level in the environment is high, and the terminal device is not equipped with appropriate noise reduction measures, the recognition accuracy of audio data will drop sharply. [0004] An effective noise reduction method is crucial for terminal devices with voice interaction functions. Some scholars have proposed an auxiliary noise reduction algorithm based o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/0208G10L25/30G06N3/04G06K9/00
CPCG10L21/0208G10L25/30G06V40/171G06N3/045
Inventor 耿杰
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products