Audio processing method, vocoder, device, equipment and storage medium

An audio processing and audio technology, applied in the field of audio and video processing, can solve the problems of slow audio synthesis processing speed and lower audio processing efficiency, and achieve the effects of improving efficiency, increasing processing speed, and reducing the number of cycles

Pending Publication Date: 2021-10-22
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current vocoder usually needs to perform multiple cycles based on multiple sampling time points in the audio feature signal to complete speech prediction, and then complete speech synthesis, which leads to slower processing speed of audio synthesis and reduces the efficiency of audio processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio processing method, vocoder, device, equipment and storage medium
  • Audio processing method, vocoder, device, equipment and storage medium
  • Audio processing method, vocoder, device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to make the present application, the present application will be further described in detail below with reference to the accompanying drawings, and the embodiments described will not be considered in the limitation of the present invention. All other embodiments obtained under the premise of creative labor belong to the scope of this application.

[0057] In the following description, "some embodiments" describe the subset of all possible embodiments, but it can be understood that "some embodiments" may be the same subset or different subset of all possible embodiments, and It can be combined with each other without conflict.

[0058] In the following description, the term "first \ second \ third" involved is only a different object, does not represent specific sorting for the object, can understand, "first \ second \ third" The specific order or prior order can be interchanged in the case of allowing, so that the present application embodiments described herein ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an audio processing method, a vocoder, a device, equipment and a storage medium. The method comprises the following steps: performing speech feature conversion on a to-be-processed text to obtain at least one acoustic feature frame; extracting condition features from each acoustic feature frame through a frame rate network; performing frequency band division and time domain downsampling on the current frame to obtain n subframes containing a preset number of sampling points; synchronously performing sampling value prediction on the current m adjacent sampling points corresponding to the n sub-frames in the ith round of prediction process through the sampling prediction network to obtain m * n sub-prediction values, and further obtaining n sub-prediction values corresponding to each sampling point in the preset number of sampling points; obtaining an audio prediction signal corresponding to the current frame according to the n sub-prediction values corresponding to each sampling point; and performing audio synthesis on each acoustic feature frame of the at least one acoustic feature frame to obtain a target audio. According to the invention, the audio processing speed and efficiency can be improved.

Description

Technical field [0001] The present application relates to audio and video processing techniques, and more particularly to an audio processing method, a vocoder, a device, a device, and a storage medium. Background technique [0002] With the rapid development of smart equipment (such as smartphone, smart speakers, etc.), speech interaction technology is a growing application as a natural interaction. As an important part of speech interaction technology, speech synthesis technology has also achieved great progress. Voice synthesis technology converts text into a corresponding audio content through a certain rule or model algorithm. Traditional speech synthesis is based primarily based on splicing methods or statistical parameters. With the continuous breakthrough in the field of speech recognition, deep learning is gradually introduced into the field of speech synthesis. Thanks to this, the neural network-based vocoder (Neural Vocoder) has made great progress. However, the curren...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/04G10L13/02G10L13/08
CPCG10L13/02G10L13/08G10L13/047
Inventor 林诗伦李新辉卢鲤
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products